WilliamLam.com

  • About
    • About
    • Privacy
  • VMware Cloud Foundation
    • VMware Cloud Foundation 9
  • VKS
  • Homelab
    • Hardware Options
    • Hardware Reviews
    • Lab Deployment Scripts
    • Nested Virtualization
    • Homelab Podcasts
  • VMware Nostalgia
  • Apple

Modding an NVIDIA RTX 4000 Ada (20GB VRAM) into Minisforum MS-A2

10.08.2025 by William Lam // 2 Comments

The Minisforum MS-A2 has been my go to platform for running an optimized and fully functional VMware Cloud Foundation (VCF) 9.0 lab environment. From vSAN ESA to NSX VPC to vSphere Kubernetes Service to VCF Automation to Data Services Manager, it runs it all!

Over the past couple of weeks, I had also been experimenting with running VMware Private AI Services (PAIS) using the MS-A2. For those with an eye for detail, may have noticed that the NVIDIA GPU requirement was actually being satisfied by an ASUS NUC 14 Performance, which includes an NVIDIA GeForce RTX 4070 (8GB VRAM). The ASUS NUC was purely for validation and prototyping purposes to ensure that I could actually run PAIS before investing in a more capable and certainly more expensive NVIDIA GPU šŸ¤‘

One of the lesser known capabilities of the MS-A2 is the ability to add a half-height PCIe device and from the very beginning when I had first heard about the MS-A2, my plan and hope was to be able to add an NVIDIA GPU to the system!

[Read more...]

Categories // Private AI Services, VMware Cloud Foundation Tags // Minisforum, VCF 9.0

MS-A2 VCF 9.0 Lab: Deploying Model Endpoint with DirectPath I/O using VMware for Private AI Services (PAIS)

10.07.2025 by William Lam // Leave a Comment

In this final blog post, we will now deploy several AI model endpoints (downloaded from Hugging Face), configure our private data source which can be a shared location (Google Drive, Confluence, Microsoft Sharepoint, or S3-compatible endpoint) or using local filesĀ and then consuming them using an AI Agent that is built using VMware for Private AI Services (PAIS).

As mentioned in the very first blog post of this mini-series, my goal was to get hands experience with PAIS but without the need to have anĀ NVIDIA GPU capable of vGPU, which would also require an NVIDIA AI for Enterprise (NVAIE) license.

Luckily, we can use an NVIDIA GPU via DirectPath I/O, thanks to the backend plumbing the PAIS Engineering team have built and had shared with me 😊

For my proof of concept, I am using an ASUS NUC 14 Performance, which has an NVIDIA GeForce RTX 4070 mobile GPU (8GB VRAM). The ASUS NUC 14 is running alongside my Minisforum MS-A2 setup, is only used to deploy the completions model endpoint. The use of the ASUS NUC 14 is purely for prototyping and experimentation purposes to demonstrate that anyone can play with PAIS within their lab environment. I plan to use a more powerful NVIDIA GPU setup, which I will share more details at a later point for those interested.

References:

  • Running Completion or Embedding Models by Using Model Endpoints
  • Adding Context to Model Responses by Using Knowledge Bases
  • Deploy an Agent for a Generative AI Application

Requirements:

  • VCF Automation (VCFA) Organization configured with Namespace
  • VMware Private AI Services (PAIS) enabled
  • Data Services Manager (DSM) configured with VCFA
  • Authentik IdP configured with OIDC Public Client Application
  • Harbor instance configured for AI model storeĀ 
  • PAIS instance deployed

[Read more...]

Categories // Private AI Services, VMware Cloud Foundation Tags // VCF 9.0

MS-A2 VCF 9.0 Lab: Deploying VMware Private AI Services (PAIS)

10.01.2025 by William Lam // 2 Comments

In this blog post, we will now deploy an instance of VMware Private AI Services (PAIS) that will use the Vector Database that was provisioned earlier and the OIDC Client Application that we had also setup earlier using the Authentik Identity Provider (IdP).

Requirements:

  • VCF Automation (VCFA) Organization configured with Namespace
  • VMware Private AI Services (PAIS) deployed
  • Data Services Manager (DSM) configured with VCFA
  • Authentik IdP configured with OIDC Public Client Application
  • Harbor instance configured for AI model storeĀ 

[Read more...]

Categories // Private AI Services, VMware Cloud Foundation Tags // VCF 9.0

  • 1
  • 2
  • 3
  • Next Page »

Search

Thank Author

Author

William is Distinguished Platform Engineering Architect in the VMware Cloud Foundation (VCF) Division at Broadcom. His primary focus is helping customers and partners build, run and operate a modern Private Cloud using the VMware Cloud Foundation (VCF) platform.

Connect

  • Bluesky
  • Email
  • GitHub
  • LinkedIn
  • Mastodon
  • Reddit
  • RSS
  • Twitter
  • Vimeo

Recent

  • Every Mini PC & SFF Hardware Announced at CES 2026 01/21/2026
  • Improved Workaround for NSX Edge Deployment & Upgrade to VCF 9.0.2 running AMD Ryzen CPUs 01/20/2026
  • Disable HTTP Range Requests on Synology WebStation, Apache or Nginx 01/14/2026
  • Quick Tip - Correlating VCF Component to Bundle ID/Name 01/08/2026
  • TLS Chain of Trust when using SSL Inspection with VCF Download Tool (VCFDT) 01/07/2026

Advertisment

Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.

To find out more, including how to control cookies, see here: Cookie Policy

Copyright WilliamLam.com © 2026

 

Loading Comments...