AMSTERDAM–(BUSINESS WIRE)–Nebius Group N.V. (NASDAQ:NBIS), a leading AI infrastructure company, today announced the launch of its first GPU cluster in the United States with a deployment in Kansas City, MO, bringing its AI-native cloud closer to American customers.
Scheduled to go live in Q1 2025, the Kansas City cluster will house thousands of state-of-the-art NVIDIA GPUs, primarily H200 Tensor Core GPUs in the initial phase, with the energy-efficient NVIDIA Blackwell platform expected to arrive in 2025. The colocation can be expanded from an initial 5 MW up to 40 MW, or about 35 thousand GPUs, at full potential capacity.
Nebius is actively ramping up its presence in the US as part of its strategy to become a leading provider of AI infrastructure to AI builders globally, and is in advanced discussions for a second, larger-scale GPU cluster in the US, also slated to come online in 2025. The Company has also opened two new customer-facing hubs in San Francisco and Dallas, with a third office set to open in New York later this year.
Arkady Volozh, founder and CEO of Nebius, said:
“Our first GPU cluster in the US and new offices represent a pivotal step in our expansion in the US market. Serving American customers from American facilities means lower latency and maximizes the advantages of our AI-native cloud. We will be building out more GPU clusters across the US to meet exploding demand for high-quality AI infrastructure from US AI developers and enterprises.”
Built on top of the latest NVIDIA GPUs with a fleet of H100s already installed and H200s coming onstream this month, Nebius’s full-stack AI infrastructure is being purpose-built to meet the demands of the global AI industry and leans on deep technical expertise across hardware and software, cloud engineering and machine learning (“ML”).
Publicly announced in October, the AI-native Nebius cloud is designed to manage the full ML lifecycle – from data processing and training through to fine-tuning and inference – all in one place. The recently launched Nebius AI Studio inference service expands the Company’s offering to app builders, with access to a range of state-of-the-art open-source models in a flexible, user-friendly environment at among the lowest price-per-token on the market.
Nebius has a team of around 400 engineers with decades of knowledge of building world-class tech infrastructure, as well as an in-house large language model (“LLM”) R&D team. Listed on Nasdaq, the Company recently announced investments of more than USD 1 billion in AI infrastructure by mid-2025, enabling Nebius to deploy tens of thousands of NVIDIA GPUs to bring its highly differentiated, energy-efficient, AI-native cloud offering to customers worldwide.
About Nebius
Nebius is a technology company building full-stack infrastructure to service the explosive growth of the global AI industry, including large-scale GPU clusters, cloud platforms, and tools and services for developers. Headquartered in Amsterdam and listed on Nasdaq, the Company has a global footprint with R&D hubs across Europe, North America and Israel.
Nebius’s core business is an AI-centric cloud platform built for intensive AI workloads. With proprietary cloud software architecture and hardware designed in-house (including servers, racks and data center design), Nebius gives AI builders the compute, storage, managed services and tools they need to build, tune and run their models.
A Preferred cloud service provider in the NVIDIA Partner Network, Nebius offers high-end infrastructure optimized for AI training and inference. The Company boasts a team of around 400 skilled engineers, delivering a true hyperscale cloud experience tailored for AI builders.
To learn more please visit www.nebius.com