Cloud Computing

Fast and efficient AI inference with new NVIDIA Dynamo recipe on AI Hypercomputer

As generative AI becomes more widespread, it’s important for developers and ML engineers to be able to easily configure infrastructure that supports efficient AI inference, i.e., using a trained AI model to make predictions or decisions based on new, unseen data. While great at training models, traditional GPU-based serving architectures struggle with the “multi-turn” nature

Fast and efficient AI inference with new NVIDIA Dynamo recipe on AI Hypercomputer Read More »

Deliver intuitive shopping experiences with Conversational Commerce agent

Consumer search behavior is shifting, with users now entering longer, more complex questions into search bars in pursuit of more relevant results. For instance, instead of a simple “best kids snacks,” queries have evolved to “What are some nutritious snack options for a 7-year-old’s birthday party?”  However, many digital platforms have yet to adapt to

Deliver intuitive shopping experiences with Conversational Commerce agent Read More »

Deliver intuitive shopping experiences with Conversational Commerce agent

Consumer search behavior is shifting, with users now entering longer, more complex questions into search bars in pursuit of more relevant results. For instance, instead of a simple “best kids snacks,” queries have evolved to “What are some nutritious snack options for a 7-year-old’s birthday party?”  However, many digital platforms have yet to adapt to

Deliver intuitive shopping experiences with Conversational Commerce agent Read More »

Our approach to carbon-aware data centers: Central data center fleet management

Data centers are the engines of the cloud, processing and storing the information that powers our daily lives. As digital services grow, so do our data centers and we are working to responsibly manage them. Google thinks of infrastructure at the full stack level, not just as hardware but as hardware abstracted through software, allowing

Our approach to carbon-aware data centers: Central data center fleet management Read More »

Automate app deployment and security analysis with new Gemini CLI extensions

Find and fix security vulnerabilities. Deploy your app to the cloud. All without leaving your command-line.  Today, we’re closing the gap between your terminal and the cloud with a first look at the future of Gemini CLI, delivered through two new extensions: security extension and Cloud Run extension. These extensions are designed to handle critical

Automate app deployment and security analysis with new Gemini CLI extensions Read More »

Automate app deployment and security analysis with new Gemini CLI extensions

Find and fix security vulnerabilities. Deploy your app to the cloud. All without leaving your command-line.  Today, we’re closing the gap between your terminal and the cloud with a first look at the future of Gemini CLI, delivered through two new extensions: security extension and Cloud Run extension. These extensions are designed to handle critical

Automate app deployment and security analysis with new Gemini CLI extensions Read More »