Home / Blog / VMware VCF 9.1: The AI-Native Private Cloud Pivot
Dillip Chowdary

VMware VCF 9.1: The AI-Native Private Cloud Pivot

By Dillip Chowdary • May 05, 2026

Broadcom has officially released VMware Cloud Foundation (VCF) 9.1, marking a significant milestone in the company's pivot toward on-premises AI infrastructure. As enterprises struggle with the rising costs and data sovereignty issues of public cloud AI offerings, VCF 9.1 provides a robust, "AI-Native" alternative for building high-performance clusters in the private data center. The release introduces several breakthrough features, including intelligent memory tiering and native support for the latest AMD MI350 GPUs, positioning VCF as the "Operating System for the AI Factory."

Intelligent Memory Tiering: Breaking the RAM Bottleneck

The most technically significant feature in VCF 9.1 is Intelligent Memory Tiering. In the world of Large Language Models (LLMs), memory capacity is the primary bottleneck. High-bandwidth memory (HBM) is both expensive and in short supply. VCF 9.1 addresses this by allowing the system to seamlessly tier data between high-speed RAM and ultra-low-latency NVMe storage. The platform's orchestrator uses AI models to predict which data will be needed next, pre-fetching it from NVMe to RAM just in time for processing. This effectively "virtualizes" the memory wall that has limited the scale of private AI clusters.

This "Virtual Memory" approach for AI workloads allows organizations to run significantly larger models on their existing hardware. Broadcom claims that VCF 9.1 can support models up to 3x larger than previous versions without a corresponding increase in physical DRAM. For enterprises looking to deploy 175B+ parameter models on-premise, this is a game-changer that significantly reduces the total cost of ownership (TCO) for AI infrastructure. The system also includes DRAM-NVMe balancing, which ensures that the most critical weights of a model always reside in the fastest memory tier.

AMD MI350 Support: Expanding the GPU Ecosystem

VCF 9.1 also brings native, day-zero support for the AMD Instinct MI350 series of GPUs. This is a strategic move by Broadcom to reduce the industry's reliance on NVIDIA hardware. By providing a "First-Class" experience for AMD silicon, VMware is enabling a more competitive and diverse GPU market. VCF 9.1's hardware abstraction layer allows IT teams to manage AMD and NVIDIA GPUs within the same cluster, providing the flexibility to choose the best hardware for each specific workload. This "Vendor-Neutral" approach is a key part of VMware's value proposition in 2026.

The integration includes advanced features like Direct-GPU-to-Storage paths, which allow the AMD MI350s to pull data directly from NVMe drives without involving the CPU. This significantly reduces latency and increases throughput for data-intensive tasks like training and fine-tuning. For organizations that have already invested heavily in the AMD ecosystem, VCF 9.1 provides the professional-grade orchestration they need to scale their AI efforts. The release also includes enhanced support for RoCE v2 (RDMA over Converged Ethernet), ensuring high-speed interconnectivity between nodes in a massive MI350 cluster.

Focus on On-Premises AI Clusters

Broadcom's strategy with VCF 9.1 is clear: win the "Private AI" market. While the public cloud is great for experimentation, many enterprises are finding that running production AI workloads at scale is both cheaper and more secure on-premise. VCF 9.1 is designed to make building and managing these private AI clusters as easy as possible, providing a "Cloud-Like" experience within the company's own data center. The platform now includes Self-Healing Infrastructure agents that can detect failing hardware components and re-route AI inference tasks before they impact the user experience.

The platform includes a new AI Infrastructure Planner, a tool that helps engineers design their clusters based on their specific model requirements and performance goals. It can calculate the optimal balance of CPUs, GPUs, and storage, and even predict the power and cooling requirements of the final setup. This reduces the risk of "Over-Provisioning" and ensures that enterprises get the maximum value from their hardware investments. Furthermore, VCF 9.1 introduces Workload-Aware Scheduling, which can prioritize certain AI models over others based on business value, ensuring that the company's most important intelligence always has the resources it needs.

Data Sovereignty and Compliance

Beyond performance and cost, VCF 9.1 is being driven by the global trend toward Data Sovereignty. Many industries, particularly finance and healthcare, are prohibited from moving sensitive data to the public cloud for AI processing. A Modern Private Cloud powered by VCF 9.1 allows these organizations to leverage the latest AI breakthroughs while maintaining full control over their data and staying compliant with local regulations. Broadcom has even added "Sovereignty-Aware" scheduling, which ensures that certain workloads are never moved to hardware that doesn't meet specific security or residency requirements. This "Hardened Enclave" approach is essential for the 2026 regulatory landscape.

Conclusion: The Private Cloud Strike Back

VMware VCF 9.1 is a powerful signal that the private cloud is far from dead; in fact, it is evolving to meet the most demanding challenges of the AI era. By solving the memory wall with intelligent tiering and embracing a multi-vendor GPU ecosystem, Broadcom is giving enterprises the tools they need to build their own "AI Factories." As we move further into 2026, the choice between public and private cloud is no longer about capability, but about economics, control, and sovereignty. With VCF 9.1, the private cloud is making a very strong case for itself as the foundation of the sovereign AI era.

Stay Ahead

Get the latest technical deep dives on AI and infrastructure delivered to your inbox.