Cloud Computing

Shaping the future together with our partners: The potential of agentic AI

Partners have always been central to the Google Cloud ecosystem, becoming more and more instrumental in bringing Google’s AI innovations to enterprises. I am inspired by how partners have already built more than 1,000 agentic use cases across every domain to solve deeply entrenched pain points for our shared customers. The emergence of agentic AI […]

Shaping the future together with our partners: The potential of agentic AI Read More »

AI/ML-ready Apache Spark with Dataproc

Apache Spark is the cornerstone for large-scale data processing, model training, and inference for AI/ML workloads. Yet, the complexities of environment configuration, dependency management, and MLOps integration can slow you down. To accelerate your AI/ML journey, Dataproc now delivers powerful, ML-ready capabilities for Spark. Available on both Dataproc on Compute Engine clusters and Google Cloud

AI/ML-ready Apache Spark with Dataproc Read More »

Tzafon selects Google Cloud to build next generation agentic machine intelligence

Tzafon, a San Francisco-based startup and AI R&D lab, is partnering with Google Cloud to utilize Google’s AI-optimized infrastructure and cloud services, which will help Tzafon deliver automation at large scale. The Tzafon team aims to do this by building systems and models that can support multiple, autonomous AI agents that are capable of working

Tzafon selects Google Cloud to build next generation agentic machine intelligence Read More »

Build with more flexibility: New open models arrive in the Vertex AI Model Garden

In our ongoing effort to provide businesses with the flexibility and choice needed to build innovative AI applications, we are expanding the catalog of open models available as Model-as-a-Service (MaaS) offerings in Vertex AI Model Garden. Following the addition of Llama 4 models earlier this year, we are announcing DeepSeek R1 is available for everyone

Build with more flexibility: New open models arrive in the Vertex AI Model Garden Read More »

How Renault Group is using Google’s software-defined vehicle industry solution

It’s funny to think of Renault Group, the massive European car manufacturer, as a software company, but in many ways, it is. Renault Group subsidiary Ampere Software Technology is dedicated to developing and integrating advanced software solutions for intelligent electric vehicles, aiming to create software-defined vehicles (SDVs) with enhanced customer experiences and new services.  Ampere

How Renault Group is using Google’s software-defined vehicle industry solution Read More »

How to integrate your Cloud SQL for MySQL database with Vertex AI & vector search

Search is a critical component of many modern applications – whether searching for products in an online storefront, finding solutions to your customers’ support cases, or building the perfect playlist. But traditional keyword searches often miss the deeper meaning of data. Vector embeddings, however, capture the complexities of your data, enabling highly accurate and powerful

How to integrate your Cloud SQL for MySQL database with Vertex AI & vector search Read More »

Implementing High-Performance LLM Serving on GKE: An Inference Gateway Walkthrough

The excitement around open Large Language Models like Gemma, Llama, Mistral, and Qwen is evident, but developers quickly hit a wall. How do you deploy them effectively at scale?  Traditional load balancing algorithms fall short, as they fail to account for GPU/TPU load status, leading to inefficient routing for computationally intensive AI inference with its

Implementing High-Performance LLM Serving on GKE: An Inference Gateway Walkthrough Read More »