Cloud Computing

Pirates in the Data Sea: AI Enhancing Your Adversarial Emulation

Matthijs Gielen, Jay Christiansen Background New solutions, old problems. Artificial intelligence (AI) and large language models (LLMs) are here to signal a new day in the cybersecurity world, but what does that mean for us—the attackers and defenders—and our battle to improve security through all the noise? Data is everywhere. For most organizations, the access

Pirates in the Data Sea: AI Enhancing Your Adversarial Emulation Read More »

Empower your teams with self-service Kubernetes using GKE fleets and Argo CD

Managing applications across multiple Kubernetes clusters is complex, especially when those clusters span different environments or even cloud providers. One powerful and secure solution combines Google Kubernetes Engine (GKE) fleets and, Argo CD, a declarative, GitOps continuous delivery tool for Kubernetes. The solution is further enhanced with Connect Gateway and Workload Identity. This blog post

Empower your teams with self-service Kubernetes using GKE fleets and Argo CD Read More »

Data loading best practices for AI/ML inference on GKE

As AI models increase in sophistication, there’s increasingly large model data needed to serve them. Loading the models and weights along with necessary frameworks to serve them for inference can add seconds or even minutes of scaling delay, impacting both costs and the end-user’s experience.  For example, inference servers such as Triton, Text Generation Inference

Data loading best practices for AI/ML inference on GKE Read More »

65,000 nodes and counting: Google Kubernetes Engine is ready for trillion-parameter AI models

As generative AI evolves, we’re beginning to see the transformative potential it is having across industries and our lives. And as large language models (LLMs) increase in size — current models are reaching hundreds of billions of parameters, and the most advanced ones are approaching 2 trillion — the need for computational power will only

65,000 nodes and counting: Google Kubernetes Engine is ready for trillion-parameter AI models Read More »

Unlocking LLM training efficiency with Trillium — a performance analysis

Rapidly evolving generative AI models place unprecedented demands on the performance and efficiency of hardware accelerators. Last month, we launched our sixth-generation Tensor Processing Unit (TPU), Trillium, to address the demands of next-generation models. Trillium is purpose-built for performance at scale, from the chip to the system to our Google data center deployments, to power

Unlocking LLM training efficiency with Trillium — a performance analysis Read More »

Emerging Threats: Cybersecurity Forecast 2025

Every November, we start sharing forward-looking insights on threats and other cybersecurity topics to help organizations and defenders prepare for the year ahead. The Cybersecurity Forecast 2025 report, available today, plays a big role in helping us accomplish this mission. This year’s report draws on insights directly from Google Cloud’s security leaders, as well as

Emerging Threats: Cybersecurity Forecast 2025 Read More »

How Deutsche Bank built a new retail data platform on Google Cloud

Getting insights into customer’s preferences and needs is crucial for any modern business — and that’s especially true for a retail bank. Insights from customer data help deliver improved customer experiences through custom tailored products, better services, and higher levels of automation. But to gain these customer insights, you need your input data to be

How Deutsche Bank built a new retail data platform on Google Cloud Read More »

Efficiency engine: How three startups deliver results faster with Vertex AI

Have you heard of the monkey and the pedestal? Astro Teller, the head of Google’s X “moonshot factory,” likes to use this metaphor to describe tackling the biggest challenge first, despite being tempted by the endorphin boost of completing more familiar tasks. It’s a challenge startups know well. When you’re re-inventing the industry standard, it’s

Efficiency engine: How three startups deliver results faster with Vertex AI Read More »

Honoring our 2024 Google Cloud Partner All-stars

At Google Cloud, we’re fortunate to partner with organizations that employ some of the world’s most talented and innovative professionals. Together, we’re reshaping industries, driving customer success, and pushing the boundaries of what’s possible. Our partners are more than collaborators — they’re the change-makers defining the future of business. The Google Cloud Partner All-stars program celebrates these

Honoring our 2024 Google Cloud Partner All-stars Read More »