Accelerating AI Workloads: Mastering vGPU Management in VMware Environments

Explore 2025 Session Recap – INVB1158LV

Are you looking to maximize AI/ML performance in your virtualized environment? At VMware Explore 2025, I attended a compelling session — INVB1158LV: Accelerating AI Workloads: Mastering vGPU Management in VMware Environments — that unpacked how to effectively configure and scale GPUs for AI workloads in vSphere.

This blog post shares key takeaways from the session and outlines how to use vGPU, MIG, and Passthrough to achieve optimal performance for AI inference and training on VMware Cloud Foundation 9.0.


vGPU Configuration Options in VMware vSphere

🔹 1. DirectPath I/O (Passthrough)

  • A dedicated GPU is assigned to a single VM or containerized workload.
  • Ideal for maximum performance and full GPU access (e.g., LLM training).
  • No sharing or resource fragmentation.

🔹 2. NVIDIA vGPU – Time Slicing Mode

  • Shares one physical GPU across multiple VMs.
  • Each VM gets 100% of GPU cores for a slice of time, while memory is statically partitioned.
  • Supported on all NVIDIA GPUs.
  • Useful for efficient GPU sharing, especially for model inference and dev/test setups.

✅ Example profiles: grid_a100-8c, grid_a100-4-20c

🔹 3. Multi-Instance GPU (MIG)

  • Available on NVIDIA Ampere & Hopper (e.g., A100, H100).
  • Splits GPU into isolated hardware slices (compute + memory).
  • Offers deterministic performance and better isolation.
  • Best for multi-tenant AI inference, production-grade deployments.

✅ Example profiles: MIG 1g.5gb, MIG 2g.10gb, MIG 3g.20gb
✅ Assignable via vSphere UI with profiles like grid_a100-3-20c


Time Slicing vs. MIG – When to Use What?

ModeBest ForSharing Type
Time SlicingLLM training, dev/test environmentsTime-shared
MIGProduction inference, multitenancySpatial (hardware)
PassthroughMaximum performance for single workloadNot shared

Smarter vMotion for AI Workloads in VCF 9.0

One of the standout improvements presented during session INVB1158LV was the vMotion optimization for VMs using vGPUs. With vSphere 8.0 U3 and VMware Cloud Foundation 9.0, the way vMotion handles GPU memory has been completely reengineered to minimize downtime (stun time) during live migration.

Instead of migrating all GPU memory during the VM stun phase, 70% of the vGPU cold data is now pre-copied in the pre-copy stage, and only the final 30% is checkpointed during stun. This greatly accelerates live migration even for massive LLM workloads running on multi-GPU systems.

📊 Example results with Llama 3.1 models:

  • Migrating a VM using 2×H100 GPUs (144 GB vGPU memory) saw stun time drop from 24.5s to just 6.3s.
  • Migrating a large model on 8×H100 (576 GB) now completes in 21s, compared to 325s for a power-off-and-reload approach — that’s a 15× improvement.

These enhancements make zero-downtime AI infrastructure upgrades and scaling possible, even for large language model deployments

Deploying a Minimal VCF 9.0 Lab – Insights from Explore 2025

I had the pleasure of attending the excellent session “Deploying Minimal VMware Cloud Foundation 9.0 Lab” by Alan Renouf and William Lam at VMware Explore 2025. It was packed with practical advice, hardware insights, and field-tested tips on how to stand up a fully functional VCF environment—even on a tight budget.

Whether you’re a home lab enthusiast, enterprise architect, or just VCF-curious, here’s a recap of the key takeaways.


Key Changes: VCF 5.x vs VCF 9.x

VCF 5.x:

  • Required 4+ ESXi hosts
  • Monolithic installer
  • vSAN required
  • 3-node NSX cluster
  • 10GbE NICs mandatory

VCF 9.x:

  • More modular design
  • Only 2–3 ESXi hosts required
  • 1 x 10GbE NIC sufficient
  • Support for singleton appliances
  • Flexible storage (vSAN ESA, FC, NFS)

VCF 9.0 Tips & Tricks (with real CLI guidance)

Here’s the juicy part—real-world deployment tips and overrides:

1. Minimum ESXi Host Requirements

  • For vSAN/FC: 3 ESXi hosts
  • For NFS: 2 ESXi hosts
  • ⚠️ You can install VCF Installer + SDDC Manager even on a single ESXi host (great for nested labs!)
> cat /home/vcf/feature.properties

feature.vcf.internal.single.host.domain = true

> echo 'y' | /opt/vmware/vcf/operationsmanager/scripts/cli/sddcmanager_restart_services.sh

2. NIC Validation Bypass

If your ESXi host doesn’t have a 10GbE NIC:

> cat /etc/vmware/vcf/domainmanager/application.properties

enable.speed.of.physical.nics.validation = false

> echo 'y' | /opt/vmware/vcf/operationsmanager/scripts/cli/sddcmanager_restart_services.sh

3. vSAN HCL Override

VCF Installer will fail validation if your SSD or controller is not on the vSAN ESA HCL. Install a “mock” VIB to bypass:

esxcli software vib install -v /tmp/vsan-mock.vib

4. Offline Depot HTTPS Requirement

By default, the VCF installer requires HTTPS:

cat /opt/vmware/vcf/lcm/lcm-app/conf/application-prod.properties

lcm.depot.adapter.httpsEnabled=false

systemctl restart lcm

5. Basic Auth Requirement

You don’t need a full-blown web server:

python http_server_auth.py --bind 192.168.1.100 --user myuser --password mysecurepassword --port 443 --directory /myrepo

Reference Hardware for Minimal Lab

Here’s an example BOM shared by the presenters:

  • MinisForum MS-A2 w/ AMD Ryzen 7945HX (16c/32t)
  • 128GB DDR5 (2x64GB SODIMM)
  • 3x M.2 NVMe SSDs
  • 10GbE SFP+ NIC + 2.5GbE onboard
  • MikroTik 5-port 10GbE switch (for under $200)

This setup is small, powerful, and flexible enough for a complete VCF 9.0 deployment.


Deployment Walkthrough – TL;DR

Here’s the summarized 8-step flow:

  1. Install ESXi (kickstart from USB)
  2. Deploy VCF Installer VM
  3. Connect to Offline Depot
  4. Run Installer with JSON
  5. Configure vSAN ESA
  6. Deploy vCenter
  7. Update Storage Policies
  8. Deploy SDDC Manager, NSX, Fleet Manager, Automation, etc.

Summary

This session truly showcased how far VCF has come in terms of flexibility and accessibility. More info: VMware Cloud Foundation (VCF) 9.x in a Box.
All trademarks belong to their respective owners.

Automagically Deploy VMware Cloud Foundation 9.0 in Your Homelab with the Holodeck Toolkit

VMware Explore 2025 featured an exciting session, [CODEQT1247LV] Automagically Deploy VMware Cloud Foundation 9.0 in Homelab Using VMware Cloud Foundation Holodeck Toolkit. The presentation showcased how the VMware Cloud Foundation Holodeck Toolkit makes it easier than ever to deploy a fully functional nested VMware Cloud Foundation (VCF) 9.0 lab—in just a few commands.

For IT professionals, architects, developers, and homelab enthusiasts, this toolkit is a breakthrough. Instead of struggling with the complexity of a VCF deployment, Holodeck offers a repeatable, automated process that saves both time and effort.


What Is the VMware Cloud Foundation Holodeck Toolkit?

The Holodeck Toolkit is a PowerShell-based automation framework designed for building nested VCF environments on VMware ESXi or a vSphere cluster. It’s ideal for:

  • Proof-of-concept deployments
  • Technical demos and enablement sessions
  • Training environments
  • Hands-on testing of VCF 5.2.x and VCF 9.0 features

With just four PowerShell commands, users can deploy an entire VCF management domain and optional workload domains, complete with NSX, vCenter, SDDC Manager, and vSAN ReadyNodes.


Key Features of VMware Holodeck for VCF 9.0

The session highlighted several powerful features included in Holodeck environments:

  • VCF 9.0 and 5.2.x support
  • Management Domain with 4 nested ESXi hosts, vSAN, vCenter, NSX, and SDDC Manager
  • Optional Workload Domain with 3 nested ESXi hosts and Supervisor services
  • vSAN ESA & OSA support
  • NSX Edge Clusters deployable in both management and workload domains
  • Provision-only mode for greenfield VCF Installer + ESXi testing
  • Custom CIDR support for flexible network design

This makes Holodeck an ideal solution for homelabs and VCF training environments, but it is not intended for production workloads.


The HoloRouter Appliance: Backbone of the Lab

At the core of every Holodeck deployment is the HoloRouter appliance, a Photon OS–based virtual appliance that provides critical networking and infrastructure services for the nested VCF environment:

  • DNS, DHCP, NTP
  • Firewall and BGP routing
  • Proxy services for VCF Installer
  • L3 routing between nested VLANs
  • Webtop GUI for easy access and management
  • Job scheduling and automation
  • Integrated PowerShell + PowerCLI

The HoloRouter ensures smooth connectivity between nested VCF environments and external networks, enabling realistic lab testing.


Pre-requisites

VCF 9.0Single SiteDual Site
CPU2448
Memory325 GB768 GB
Disk1.1 TB2.5TB

Links:

Building your GenAI Agents on VCF with Private…

At today’s VMware Explore general session, Chris Wolf showcased Intelligent Assist for VMware Cloud Foundation — bringing AI-powered assistance directly to our users.

Building your GenAI Agents on VCF with Private…

Today at VMware Explore’s general session you saw Chris Wolf demonstrate Intelligent Assist for VMware Cloud Foundation, providing AI-powered assistance for our users. In this blog, we’ll take a step behind the curtain to see how these capabilities are running in VCF, using AI features that […]


Broadcom Social Media Advocacy

Advancing AI for enterprises: Announcing…

Today at hashtag#VMwareExplore, Broadcom and AMD announced an expanded collaboration to deliver private, secure, and high-performance AI infrastructure for enterprises.

Advancing AI for enterprises: Announcing…

Artificial Intelligence (AI) is rapidly transforming industries, and Generative AI (Gen AI) is pushing the boundaries of what’s possible, creating new content and redefining value creation. However, enterprises face significant challenges in AI adoption, especially concerning privacy, data […]


Broadcom Social Media Advocacy

Unleashing the Power of Private AI: New…

VMware Private AI Services Now Included in VCF Subscription
Big news from today’s VMware Explore general session with Chris Wolf 🎉
What was once sold separately is now part of the platform — VCF Private AI Services are now included in your VMware Cloud Foundation subscription.

This marks a major step forward in unleashing the power of Private AI, fueled by Broadcom and NVIDIA innovations.

Unleashing the Power of Private AI: New…

Enterprises can get tremendous productivity and business transformation from AI. With VMware Private AI Foundation with NVIDIA, Broadcom and NVIDIA aim to unlock AI and unleash productivity with lower TCO. Recently with VCF 9.0, Broadcom and NVIDIA released several features in VMware Private AI Foundation with NVIDIA to further our mission of providing private and … Continued The post Unleashing the Power of Private AI: New Innovations from Broadcom with NVIDIA appeared first on VMware Cloud…Read More


Broadcom Social Media Advocacy

The Communities Lounge at VMware Explore 2025

The Communities Lounge at VMware Explore 2025

The Communities Lounge is the main place for the VMware community to meet and reconnect at Explore, right at the center of the event. I like to think of it as one of the great train stations of old cities. It carries a timeless feeling: the place where journeys cross, where you meet faces […]


Broadcom Social Media Advocacy

VMware Explore 2025 in Las Vegas | Day 1 Recap

VMware Explore 2025 in Las Vegas | Day 1 Recap

VMware Explore 2025 in Las Vegas | Day 1 Recap

VMware Explore 2025 in Las Vegas Day 1 Recap | August 25, 2025 If you heard the collective buzz across the Venetian Convention and Expo Center today, that was Day 1 of VMware Explore in Las Vegas coming to life. You crushed it—140 sessions across all four tracks, endless hallway conversations, and a few “just … Continued The post VMware Explore 2025 in Las Vegas | Day 1 Recap appeared first on VMware Explore Blog.


Broadcom Social Media Advocacy

Exploring Beyond VMware Explore: Best Things to Do in Las Vegas (Beyond the Casinos)

I hope you’re enjoying yourselves in the spectacular city of Las Vegas! While the Strip is world-famous for its casinos and neon lights, there’s so much more waiting for you beyond the roulette tables. If you find yourself with a bit of spare time, here are some gems you won’t want to miss.


Welcome to Fabulous Las Vegas Sign

Kick off your adventure at the iconic “Welcome to Fabulous Las Vegas” sign, located at the southern end of the Strip. It’s the ultimate photo stop and a must-see landmark for every visitor. Arrive early in the morning if you’d like to beat the crowds and grab your selfie without waiting in line.


Ride the Deuce Bus Along the Strip

One of the easiest—and most affordable—ways to explore Las Vegas is the Deuce bus, which runs 24/7 along the Strip. For just a few dollars, you can travel all the way from the Las Vegas sign to downtown. Along the ride you’ll pass mega-resorts, quirky hidden gems, and plenty of photo-worthy spots.

💡 Tip: Tickets are simple to get—just download the RideRTC app


Downtown Las Vegas – Fremont Street

At the end of the bus ride, you’ll land in downtown Las Vegas, the birthplace of the city’s casino culture. Don’t miss Binion’s Casino, where you can take a free photo with one million dollars in cash (yes, real bills!).

By night, Fremont Street transforms into a festival of sound and light, with live music stages, street performers, and the jaw-dropping Fremont Street Experience—a massive canopy that comes alive with LED animations and booming music.


Take a Break From the Strip

When the neon starts to feel overwhelming, step into nature. Just a short drive away lies Red Rock Canyon, a breathtaking desert escape with scenic drives and hiking trails. Or head to Lake Mead, the vast reservoir created by the Hoover Dam. Whether you swim, kayak, or simply relax by the shore, it’s the perfect place to recharge away from the Strip.


Day Trips Beyond Las Vegas

Looking for a true bucket-list experience?

  • Grand Canyon, Arizona – Standing on its rim and gazing into its endless depths is a moment you’ll never forget.
  • Antelope Canyon, Arizona – If your journey takes you further east, its smooth sandstone walls, painted in red and gold, create one of the most photogenic spots on Earth.

💡 Tip: To easily browse and book tours, check out the GetYourGuide app


Now Is the Time to Explore!

Las Vegas isn’t just about entertainment—it’s a gateway to adventures, culture, and some of the most stunning natural wonders in the American Southwest. So pack your curiosity, grab your camera, and start discovering!

Session Scheduling Is Now Live: VMware Explore 2025 Las Vegas

If you’re planning to attend VMware Explore 2025 in Las Vegas, now’s the time to act—Session Scheduling Is Now Live! 🚨

Don’t risk landing on a waiting list for the most sought-after sessions. Some of them fill up quickly, especially those focused on VMware Cloud Foundation 9.0 (VCF), Private AI, and hands-on labs led by community legends.

🎯 Why I’m Excited About Explore 2025

As someone who’s both a cloud infrastructure nerd and an AI enthusiast, VMware Explore is the perfect blend of both worlds. From the latest features in VCF 9.0 to deep dives into AI inference at scale, this year’s agenda is packed with innovation. I’m especially hyped for the sessions that explore how to run enterprise-grade workloads (like LLMs!) on VMware infrastructure, even in constrained environments like home labs.

🧠 My Personal Session Picks

I’ve handpicked 10+ sessions I consider absolute must-attends—whether you’re an AI practitioner, SDDC architect, or storage performance junkie.

📝 I shared my detailed picks already in this post:
👉 My Top 10 Sessions You Can’t Miss at VMware Explore 2025

Here’s a quick refresher:

Top 10 Sessions You Should Lock In Right Now

1️⃣ Deploying Minimal VMware Cloud Foundation 9.0 Lab
📎 CLOB1201LVWilliam Lam
Perfect for home labbers and resource-conscious architects.

2️⃣ Real-World Lessons in Rightsizing VCF for On-Prem AI Workloads
📎 INVB1300LVFrank Denneman & Johan van Amersfoort
A deep and practical look at balancing AI infra needs.

3️⃣ Unleashing AI Inference at Scale with NVIDIA Dynamo
📎 INVB1070LVDenneman & Kelly
Mind-blowing scale potential for LLMs on VMware Private AI.

4️⃣ Encryption, Keys & Secrets for Kubernetes – Part 1 & 2
📎 CMTYQT1284LV & CMTYQT1285LVWaldemar Pera
Security-first deep dive for modern hybrid cloud architectures.

5️⃣ Six Innovations Redefining Storage and DR in VCF
📎 CLOB1028LVDuncan Epping
From ransomware recovery to S3, this is storage future today.

6️⃣ vSAN Express Storage Architecture Deep Dive
📎 CLOB1067LVEpping & Nicholson
If you thought vSAN couldn’t get faster—you’ll be surprised.

7️⃣ Mastering vGPU Management in VMware for AI
📎 INVB1158LVShawn Kelly & Justin Murray
GPU slicing, vMotion, best practices—all in one session.

8️⃣ Security & Resilience for Sovereign AI
📎 CLOB1262LVBob Plankers & Justin Murray
Vital for organizations building compliant, sovereign AI solutions.

9️⃣ What’s New in VCF 9.0 Security & Compliance
📎 CLOB1261LVBob Plankers
Stay audit-ready with the latest VCF 9.0 compliance updates.

BONUS: SPECIAL INVITE – Homelab Meets SAM2: Fine-Tune Like a Pro
📎 CMTYQT1227LV
I’ll be co-presenting this one! We’ll show how we fine-tuned SAM2 for video segmentation using just a homelab.


📌 Why You Should Go to VMware Explore

Whether you’re an enterprise architect, a cloud engineer, or just a tinkerer with a homelab and ambition—VMware Explore is where the future of infrastructure gets built. It’s not just about announcements—it’s about ideas, connections, and the spark that fuels your next big project.

🌐 Register now and lock in your schedule:
👉 https://brcm.tech/3YYd1yg

Don’t miss out. I’ll see you there—maybe at the SAM2 session! 😉