Home AINVIDIA is unlocking AI computing at scale and inviting partners to accelerate the expansion of AI infrastructure

NVIDIA is unlocking AI computing at scale and inviting partners to accelerate the expansion of AI infrastructure

by OmarAli
NVIDIA is unlocking AI computing at scale and inviting partners to accelerate the expansion of AI infrastructure

As AI moves from model development to production inference, computational needs are accelerating and shifting toward continuous AI factories that generate tokens at scale. This transformation requires access to large-scale, multi-tenant accelerated computing that can come online quickly, remain highly utilized, and support the economics of token-scale AI services.

Emerging AI companies have historically had limited access to capital-intensive infrastructure, and even long-term commitments have not been enough to unlock funding for computing power.

To address this issue, NVIDIA is introducing a new business model that opens up computational access to the rapidly growing AI ecosystem of startups, modelers, enterprises, research organizations and regional AI players.

banner

This new model enables AI Clouds to procure NVIDIA infrastructure for AI native, enterprise and ISV customers through economic alignment with a revenue share and credit support model. Under the partnership, AI Clouds will sell NVIDIA-based cloud services, with NVIDIA receiving both standard product revenue and a share of cloud revenue for supported capacity. This structure accelerates the adoption of NVIDIA platforms in the high-growth, high-conviction AI-native sector and provides NVIDIA with a recurring, usage-based revenue stream.

For model builders, inference providers, agent platforms, and companies scaling AI, this can mean faster access to accelerated full-stack computing without having to wait for site selection, power procurement, construction, and hardware deployment.

NVIDIA AI factory capacity is aligned with demand

The initiative is already taking shape: AI cloud companies are building DSX AI factories designed to serve customers and workloads in different regions.

Sharon AI and Firmus are among the first companies to partner with NVIDIA on this new business model.

Sharon AI deploys up to 40,000 NVIDIA Grace Blackwell GB300 GPUs.

“This strategic collaboration with NVIDIA marks a pivotal moment in Sharon AI’s mission to deliver sovereign, large-scale AI computing infrastructure,” said James Manning, co-founder and CEO of Sharon AI.

Firmus is building a DSX AI factory campus in Batam, Indonesia. The campus will be scaled to 360 megawatts and up to 170,000 NVIDIA GPUs.

“AI-native companies need access to scalable, energy- and cost-efficient computing infrastructure to compete globally,” said Tim Rosenfield, co-CEO of company Technologies. “Firmus AI Cloud is building an NVIDIA DSX-focused AI factory that will enable our cloud to give more customers access to the computing power they need to build and scale AI.”

AI natives like Baseten, Fireworks AI, and Together AI show where computing needs are headed: They need immediate access to AI cloud capacity to perform model training, post-training, fine-tuning, and high-volume agent inference for developers, digital natives, and companies building with AI.

Your customers need reliable access to NVIDIA accelerated computing at scale as usage increases, but also need commercial flexibility as products move from pilot to production.

To secure computing capacity and build and deploy AI models, contact Sharon AI and Firmus.

Find out more about NVIDIA Cloud Partner And AI factories.

https://blogs.nvidia.com/blog/nvidia-unlocks-ai-compute-at-scale-capital-partners-to-power-ai-infrastructure-buildout/

Viral Trends

This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Accept Read More