Cloud Computing

Unlocking LLM training efficiency with Trillium — a performance analysis

Rapidly evolving generative AI models place unprecedented demands on the performance and efficiency of hardware accelerators. Last month, we launched our sixth-generation Tensor Processing Unit (TPU), Trillium, to address the demands of next-generation models. Trillium is purpose-built for performance at scale, from the chip to the system to our Google data center deployments, to power […]

Unlocking LLM training efficiency with Trillium — a performance analysis Read More »

Emerging Threats: Cybersecurity Forecast 2025

Every November, we start sharing forward-looking insights on threats and other cybersecurity topics to help organizations and defenders prepare for the year ahead. The Cybersecurity Forecast 2025 report, available today, plays a big role in helping us accomplish this mission. This year’s report draws on insights directly from Google Cloud’s security leaders, as well as

Emerging Threats: Cybersecurity Forecast 2025 Read More »

How Deutsche Bank built a new retail data platform on Google Cloud

Getting insights into customer’s preferences and needs is crucial for any modern business — and that’s especially true for a retail bank. Insights from customer data help deliver improved customer experiences through custom tailored products, better services, and higher levels of automation. But to gain these customer insights, you need your input data to be

How Deutsche Bank built a new retail data platform on Google Cloud Read More »

Efficiency engine: How three startups deliver results faster with Vertex AI

Have you heard of the monkey and the pedestal? Astro Teller, the head of Google’s X “moonshot factory,” likes to use this metaphor to describe tackling the biggest challenge first, despite being tempted by the endorphin boost of completing more familiar tasks. It’s a challenge startups know well. When you’re re-inventing the industry standard, it’s

Efficiency engine: How three startups deliver results faster with Vertex AI Read More »

Honoring our 2024 Google Cloud Partner All-stars

At Google Cloud, we’re fortunate to partner with organizations that employ some of the world’s most talented and innovative professionals. Together, we’re reshaping industries, driving customer success, and pushing the boundaries of what’s possible. Our partners are more than collaborators — they’re the change-makers defining the future of business. The Google Cloud Partner All-stars program celebrates these

Honoring our 2024 Google Cloud Partner All-stars Read More »

A student of Geoff Hinton, Yan Lacun, and Jeff Dean explains where AI is headed

Ben and Ryan are joined by Matt Zeiler, founder and CEO of Clarifai, an AI workflow orchestration platform. They talk about how the transformer architecture supplanted convolutional neural networks in AI applications, the infrastructure required for AI implementation, the implications of regulating AI, and the value of synthetic data.

A student of Geoff Hinton, Yan Lacun, and Jeff Dean explains where AI is headed Read More »

How PUMA leverages built-in intelligence with BigQuery for greater customer engagement

Leveraging first-party data, and data quality in general, are major priorities for online retailers. While first-party data certainly comes with challenges, it also offers a great opportunity to increase transparency, redefine customer interactions, and create more meaningful user experiences. Here at PUMA, we’re already taking steps to seize the opportunities presented by signal loss as

How PUMA leverages built-in intelligence with BigQuery for greater customer engagement Read More »

How an insurance company implements disaster recovery of 3-tier applications

A good strategy for resilience will include operating with high availability and planning for business continuity. It also accounts for the incidence of natural disasters, such as earthquakes or floods and technical failures, such as power failure or network connectivity. AWS recommends a multi-AZ strategy for high availability and a multi-Region strategy for disaster recovery.

How an insurance company implements disaster recovery of 3-tier applications Read More »

How to build custom nodes workflow with ComfyUI on Amazon EKS

ComfyUI is an open-source node-based workflow solution for Stable Diffusion and increasingly being used by many creators. We previously published a blog and solution about how to deploy ComfyUI on AWS. Typically, ComfyUI users use various custom nodes, which extend the capabilities of ComfyUI, to build their own workflows, often using ComfyUI-Manager to conveniently install and manage their

How to build custom nodes workflow with ComfyUI on Amazon EKS Read More »