Cloud Computing

Network Performance Decoded: Much ado about headers, data and bitrates

We are happy to drop the third installment of our Network Performance Decoded whitepaper series, where we dive into topics in network performance and benchmarking best practices that often come up as you troubleshoot, deploy, scale, or architect your cloud-based workloads. We started this series last year to provide you helpful tips to not only […]

Network Performance Decoded: Much ado about headers, data and bitrates Read More »

How to secure your remote MCP server on Google Cloud

As enterprises increasingly adopt model context protocol (MCP) to extend capabilities of AI models to better integrate with external tools, databases, and APIs, it becomes even more important to ensure secure MCP deployment.  MCP unlocks new capabilities for AI systems; it can also introduce new risks, such as tool poisoning, prompt injection, and dynamic tool

How to secure your remote MCP server on Google Cloud Read More »

BigQuery under the hood: Scalability, reliability and usability enhancements for gen AI inference

People often think of BigQuery in the context of data warehousing and analytics, but it is a crucial part of the AI ecosystem as well. And today, we’re excited to share significant performance improvements to BigQuery that make it even easier to extract insights from your data with generative AI.  In addition to native model

BigQuery under the hood: Scalability, reliability and usability enhancements for gen AI inference Read More »

GKE network interface at 10: From core connectivity to the AI backbone

It’s hard to believe it’s been over 10 years since Kubernetes first set sail, fundamentally changing how we build, deploy, and manage applications. Google Cloud was at the forefront of the Kubernetes revolution with Google Kubernetes Engine (GKE), providing a robust, scalable, and cutting-edge platform for your containerized workloads. Since then, Kubernetes has emerged as

GKE network interface at 10: From core connectivity to the AI backbone Read More »

How California is transforming public services with Google Cloud

State and local governments across the nation face a myriad of challenges, including strained budgets, aging infrastructure, and a complex regulatory landscape. In California, these challenges are compounded by a rapidly growing population and increasing demand for public services. To address these issues, the state is turning to technology as a catalyst for change. California,

How California is transforming public services with Google Cloud Read More »

Setting new expectations: Benchmarking high-performance trading with C3 machines

Trading in capital markets demands peak compute performance, with every microsecond impacting critical decisions and market outcomes. At Google Cloud, we’re committed to providing global markets with the cutting-edge infrastructure they need to create and participate in digital exchange ecosystems. Our industry investments enable a purpose-built, cloud-native market infrastructure solution leveraging a global network that

Setting new expectations: Benchmarking high-performance trading with C3 machines Read More »

Gemini and OSS text embeddings are now in BigQuery ML

High-quality text embeddings are the engine for modern AI applications like semantic search, classification, and retrieval-augmented generation (RAG). But when it comes to picking a model to generate these embeddings, we know one size doesn’t fit all. Some use cases demand state-of-the-art quality, while others prioritize cost, speed, or compatibility with the open-source ecosystem. To

Gemini and OSS text embeddings are now in BigQuery ML Read More »