AI

Tailoring foundation models for your business needs: A comprehensive guide to RAG, fine-tuning, and hybrid approaches

Foundation models (FMs) have revolutionised AI capabilities, but adopting them for specific business needs can be challenging. Organizations often struggle with balancing model performance, cost-efficiency, and the need for domain-specific knowledge. This blog post explores three powerful techniques for tailoring FMs to your unique requirements: Retrieval Augmented Generation (RAG), fine-tuning, and a hybrid approach combining […]

Tailoring foundation models for your business needs: A comprehensive guide to RAG, fine-tuning, and hybrid approaches Read More »

How Rufus doubled their inference speed and handled Prime Day traffic with AWS AI chips and parallel decoding

Large language models (LLMs) have revolutionized the way we interact with technology, but their widespread adoption has been blocked by high inference latency, limited throughput, and high costs associated with text generation. These inefficiencies are particularly pronounced during high-demand events like Amazon Prime Day, where systems like Rufus—the Amazon AI-powered shopping assistant—must handle massive scale

How Rufus doubled their inference speed and handled Prime Day traffic with AWS AI chips and parallel decoding Read More »

New Amazon Bedrock Data Automation capabilities streamline video and audio analysis

Organizations across a wide range of industries are struggling to process massive amounts of unstructured video and audio content to support their core business applications and organizational priorities. Amazon Bedrock Data Automation helps them meet this challenge by streamlining application development and automating workflows that use content from documents, images, audio, and video. Recently, we

New Amazon Bedrock Data Automation capabilities streamline video and audio analysis Read More »

GuardianGamer scales family-safe cloud gaming with AWS

This blog post is co-written with Heidi Vogel Brockmann and Ronald Brockmann from GuardianGamer. Millions of families face a common challenge: how to keep children safe in online gaming without sacrificing the joy and social connection these games provide. In this post, we share how GuardianGamer—a member of the AWS Activate startup community—has built a

GuardianGamer scales family-safe cloud gaming with AWS Read More »

MIT announces the Initiative for New Manufacturing

MIT today launched its Initiative for New Manufacturing (INM), an Institute-wide effort to reinfuse U.S. industrial production with leading-edge technologies, bolster crucial U.S. economic sectors, and ignite job creation. The initiative will encompass advanced research, innovative education programs, and partnership with companies across many sectors, in a bid to help transform manufacturing and elevate its

MIT announces the Initiative for New Manufacturing Read More »

Principal Financial Group increases Voice Virtual Assistant performance using Genesys, Amazon Lex, and Amazon QuickSight

This post was cowritten by Mulay Ahmed, Assistant Director of Engineering, and Ruby Donald, Assistant Director of Engineering at Principal Financial Group. The content and opinions in this post are those of the third-party author and AWS is not responsible for the content or accuracy of this post. Principal Financial Group® is an integrated global

Principal Financial Group increases Voice Virtual Assistant performance using Genesys, Amazon Lex, and Amazon QuickSight Read More »