AI

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Generative AI question-answering applications are pushing the boundaries of enterprise productivity. These assistants can be powered by various backend architectures including Retrieval Augmented Generation (RAG), agentic workflows, fine-tuned large language models (LLMs), or a combination of these techniques. However, building and deploying trustworthy AI assistants requires a robust ground truth and evaluation framework. Ground truth […]

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval Read More »

Accelerate AWS Well-Architected reviews with Generative AI

Building cloud infrastructure based on proven best practices promotes security, reliability and cost efficiency. To achieve these goals, the AWS Well-Architected Framework provides comprehensive guidance for building and improving cloud architectures. As systems scale, conducting thorough AWS Well-Architected Framework Reviews (WAFRs) becomes even more crucial, offering deeper insights and strategic value to help organizations optimize

Accelerate AWS Well-Architected reviews with Generative AI Read More »

Dynamic metadata filtering for Amazon Bedrock Knowledge Bases with LangChain

Amazon Bedrock Knowledge Bases offers a fully managed Retrieval Augmented Generation (RAG) feature that connects large language models (LLMs) to internal data sources. It’s a cost-effective approach to improving LLM output so it remains relevant, accurate, and useful in various contexts. It also provides developers with greater control over the LLM’s outputs, including the ability

Dynamic metadata filtering for Amazon Bedrock Knowledge Bases with LangChain Read More »

Markus Buehler receives 2025 Washington Award

MIT Professor Markus J. Buehler has been named the recipient of the 2025 Washington Award, one of the nation’s oldest and most esteemed engineering honors.  The Washington Award is conferred to “an engineer(s) whose professional attainments have preeminently advanced the welfare of humankind,” recognizing those who have made a profound impact on society through engineering innovation. Past recipients

Markus Buehler receives 2025 Washington Award Read More »

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Increasingly, organizations across industries are turning to generative AI foundation models (FMs) to enhance their applications. To achieve optimal performance for specific use cases, customers are adopting and adapting these FMs to their unique domain requirements. This need for customization has become even more pronounced with the emergence of new models, such as those released

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1 Read More »

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Recent advances in generative AI have led to the proliferation of new generation of conversational AI assistants powered by foundation models (FMs). These latency-sensitive applications enable real-time text and voice interactions, responding naturally to human conversations. Their applications span a variety of sectors, including customer service, healthcare, education, personal and business productivity, and many others.

Reduce conversational AI response time through inference at the edge with AWS Local Zones Read More »