AI

Secure short-term GPU capacity for ML workloads with EC2 Capacity Blocks for ML and SageMaker training plans

As companies of various sizes adopt graphic processing units (GPU)-based machine learning (ML) training, fine-tuning and inference workloads, the demand for GPU capacity has outpaced industry-wide supply. This imbalance has made GPUs a scarce resource, creating a challenge for customers who need reliable access to GPU compute resources for their ML workloads. When you encounter

Secure short-term GPU capacity for ML workloads with EC2 Capacity Blocks for ML and SageMaker training plans Read More »

Overcoming reward signal challenges: Verifiable rewards-based reinforcement learning with GRPO on SageMaker AI

Training large language models requires accurate feedback signals, but traditional reinforcement learning (RL) often struggles with reward signal reliability. The quality of these signals directly influences how models learn and make decisions. However, creating robust feedback mechanisms can be complex and error prone. Real-world training scenarios often introduce hidden biases, unintended incentives, and ambiguous success

Overcoming reward signal challenges: Verifiable rewards-based reinforcement learning with GRPO on SageMaker AI Read More »

Agents that transact: Introducing Amazon Bedrock AgentCore Payments, built with Coinbase and Stripe

We’re in the midst of a fundamental shift in how software gets built and used. AI agents are moving beyond assistants that wait for instructions. They call APIs, access MCP servers, coordinate with other agents, and complete complex multi-step tasks on behalf of users. As agents take on increasingly diverse tasks, the ecosystem around them

Agents that transact: Introducing Amazon Bedrock AgentCore Payments, built with Coinbase and Stripe Read More »