Cloud Computing

Parallelstore is now GA, fueling the next generation of AI and HPC workloads

Organizations use artificial intelligence (AI) and high-performance computing (HPC) applications to process massive datasets, run complex simulations, and train generative models with billions of parameters for diverse use cases such as LLMs, genomic analysis, quantitative analysis, or real-time sports analytics. These workloads place big performance demands on their storage systems, requiring high throughput and I/O […]

Parallelstore is now GA, fueling the next generation of AI and HPC workloads Read More »

Three steps in mapping out your modern platform strategy

As AI adoption speeds up, one thing is becoming clear: the developer platforms that got you this far won’t get you to the next stage. While yesterday’s platforms were awesome, let’s face it, they weren’t built for today’s AI-infused application development and deployment. And organizations are quickly realizing they need to update their platform strategies

Three steps in mapping out your modern platform strategy Read More »

When to use supervised fine-tuning for Gemini

Have you ever wished you could get a foundation model to respond in a particular style, exhibit domain-specific expertise, or excel at a specific task? While foundation models like Gemini demonstrate remarkable capabilities out-of-the-box, there can be a gap between their general knowledge and the nuanced understanding required for specific applications.  Supervised Fine-Tuning (SFT) emerges

When to use supervised fine-tuning for Gemini Read More »

An advanced LlamaIndex RAG implementation on Google Cloud

Introduction Retrieval Augmented Generation (RAG) is revolutionizing how we build Large Language Model (LLM)-powered applications, but unlike tabular machine learning where XGBoost reigns supreme, there’s no single “go-to” solution for RAG. Developers need efficient ways to experiment with different retrieval techniques and evaluate their performance. This post provides a practical guide to rapidly prototyping and

An advanced LlamaIndex RAG implementation on Google Cloud Read More »

An advanced LlamaIndex RAG implementation on Google Cloud

Introduction Retrieval Augmented Generation (RAG) is revolutionizing how we build Large Language Model (LLM)-powered applications, but unlike tabular machine learning where XGBoost reigns supreme, there’s no single “go-to” solution for RAG. Developers need efficient ways to experiment with different retrieval techniques and evaluate their performance. This post provides a practical guide to rapidly prototyping and

An advanced LlamaIndex RAG implementation on Google Cloud Read More »

You can now sign Microsoft Windows artifacts with keys protected by Cloud HSM

To build trust in the software world, developers need to be able to digitally sign their code and attest that the software their customers are downloading is legitimate and hasn’t been maliciously altered. Keys used to sign code are the cryptographic equivalent of crown jewels for many organizations, and protecting them is of utmost importance. 

You can now sign Microsoft Windows artifacts with keys protected by Cloud HSM Read More »

How Banfico built an Open Banking and Payment Services Directive (PSD2) compliance solution on AWS

This post was co-written with Paulo Barbosa, the COO of Banfico.  Introduction Banfico is a London-based FinTech company, providing market-leading Open Banking regulatory compliance solutions. Over 185 leading Financial Institutions and FinTech companies use Banfico to streamline their compliance process and deliver the future of banking. Under the EU’s revised PSD2, banks can use application

How Banfico built an Open Banking and Payment Services Directive (PSD2) compliance solution on AWS Read More »

Meet the AI native developers who build software through prompt engineering

On today’s episode we chat with Crystal Xu, chief of staff at FSH Tech. She explains how she learned to build and deploy apps and services inside her company using Python and Java, without ever getting a traditional computer science education or training to write code. Instead, Xu works with GenAI systems like ChatGPT, Cursor,

Meet the AI native developers who build software through prompt engineering Read More »

Introducing Valkey 8.0 on Memorystore: unmatched performance and fully open-source

Editor’s note: Ping Xie is a Valkey maintainer on the Valkey Technical Steering Committee (TSC) Today, we’re thrilled to announce Valkey 8.0 on Memorystore in preview, making Google Cloud the first major cloud platform to offer Valkey 8.0 as a fully managed service. Building upon the launch of Memorystore for Valkey 7.2 in August 2024,

Introducing Valkey 8.0 on Memorystore: unmatched performance and fully open-source Read More »

AlloyDB supercharges PostgreSQL vector search with accuracy, speed, and 1B+ scale

In our 20 years of experience integrating AI into real-world applications an important theme emerges: the key to building enterprise gen AI applications is having a trustworthy, scalable data foundation that supports the scale and performance needs of the largest workloads. When you’re building a gen AI or search application, you need high-quality results in

AlloyDB supercharges PostgreSQL vector search with accuracy, speed, and 1B+ scale Read More »