AI

Build reliable AI agents with Amazon Bedrock AgentCore Evaluations

Your AI agent worked in the demo, impressed stakeholders, handled test scenarios, and seemed ready for production. Then you deployed it, and the picture changed. Real users experienced wrong tool calls, inconsistent responses, and failure modes nobody anticipated during testing. The result is a gap between expected agent behavior and actual user experience in production.

Build reliable AI agents with Amazon Bedrock AgentCore Evaluations Read More »

Building an AI powered system for compliance evidence collection

Compliance audits require comprehensive evidence trails, often involving hundreds of screenshots across multiple systems. Your compliance teams likely spend hours manually navigating through GitHub repositories, AWS consoles, and internal applications, capturing screenshots at each step. This manual process is time-consuming, error-prone, and difficult to reproduce consistently across audit cycles. This post demonstrates how we automated

Building an AI powered system for compliance evidence collection Read More »

Accelerating software delivery with agentic QA automation using Amazon Nova Act

Quality assurance (QA) automation is critical for modern software delivery. It catches regressions before production, validates user journeys at scale, and enables confident feature releases. But traditional QA automation solutions are brittle and demand specialized programming knowledge, decelerating software delivery. Automation frameworks rely on implementation details including UI selectors, element identifiers, and structural references to

Accelerating software delivery with agentic QA automation using Amazon Nova Act Read More »

AWS launches frontier agents for security testing and cloud operations

I’m excited to announce that AWS Security Agent on-demand penetration testing and AWS DevOps Agent are now generally available, representing a new class of AI capabilities we announced at re:Invent called frontier agents. These autonomous systems work independently to achieve goals, scale massively to tackle concurrent tasks, and run persistently for hours or days without

AWS launches frontier agents for security testing and cloud operations Read More »