Cloud Computing

Unlock efficient model deployment: Simplified Inference Operator setup on Amazon SageMaker HyperPod

Amazon SageMaker HyperPod offers an end-to-end experience supporting the full lifecycle of AI development—from interactive experimentation and training to inference and post-training workflows. The SageMaker HyperPod Inference Operator is a Kubernetes controller that manages the deployment and lifecycle of models on HyperPod clusters, offering flexible deployment interfaces (kubectl, Python SDK, SageMaker Studio UI, or HyperPod

Unlock efficient model deployment: Simplified Inference Operator setup on Amazon SageMaker HyperPod Read More »

Smithy Java client framework is now generally available

Smithy Java client code generation is now generally available. You can use it to build type-safe, protocol-agnostic Java clients directly from Smithy models. With Smithy Java, serialization, protocol handling, and request/response lifecycles are all generated automatically from your model. This removes the need to write or maintain any of this code by hand. In this

Smithy Java client framework is now generally available Read More »

AI infrastructure efficiency: Ironwood TPUs deliver 3.7x carbon efficiency gains

At Google, we are committed to being transparent about the environmental impact of our AI infrastructure, publishing metrics on the lifetime emissions of our chips — from manufacturing to powering these chips in the data center. Today, we are updating these metrics for our seventh-generation TPU, Ironwood, which demonstrates an approximately 3.7x improvement in Compute

AI infrastructure efficiency: Ironwood TPUs deliver 3.7x carbon efficiency gains Read More »

Introducing Looker self-service Explores for faster ad-hoc analysis

By design, Looker is the enterprise semantic platform which ensures that every data set meets a high standard of accuracy by acting as a single source of truth and providing long-term consistency of your metrics. Today, we are introducing a complement to this governed framework: self-service Explores, to accelerate high-velocity, ad-hoc analysis. Self-service Explores allows

Introducing Looker self-service Explores for faster ad-hoc analysis Read More »

How a leading consumer insight brand uses Dataproc to hyper-personalise faster

At RVU, we have a clear and vital mission: empower people, transform industries.  For our market-leading home management and switching brands — Confused.com, Uswitch, Tempcover, Money.co.uk, and Mojo Mortgages — transparency and accurate information are everything. Today’s consumer expects more than a simple comparison table; they want personalized recommendations tailored to their unique circumstances.  Delivering

How a leading consumer insight brand uses Dataproc to hyper-personalise faster Read More »

Conversational Analytics now available for Looker Embedded environments

Looker Embedded analytics are at the heart of many next-generation data products, enabling monetization with live metrics and customizable user experiences. In the AI era, users expect apps to be highly interactive and conversational, and for data to be contextual, accessible and intuitive. Today, we are delivering conversational analytics in Looker Embedded environments with the

Conversational Analytics now available for Looker Embedded environments Read More »

Envoy: A future-ready foundation for agentic AI networking

In today’s agentic AI environments, the network has a new set of responsibilities. In a traditional application stack, the network mainly moves requests between services. But as discussed in a recent white paper, Cloud Infrastructure in the Agent-Native Era, in an agentic system the network sits in the middle of model calls, tool invocations, agent-to-agent

Envoy: A future-ready foundation for agentic AI networking Read More »

Introducing Veo 3.1 Lite and a new Veo upscaling capability on Vertex AI

We are introducing Veo 3.1 Lite, Google’s most cost-effective video model on Vertex AI.  Alongside this new model, we are also launching a new, standalone Veo upscaling capability on Vertex AI to help you enhance your existing video assets. Choosing the right Veo model for your workload When integrating video generation into your applications, matching

Introducing Veo 3.1 Lite and a new Veo upscaling capability on Vertex AI Read More »