NVIDIA Vera Rubin Enters Full Production, Set to Slash AI Inference Costs by 10-Fold

By: CFM 2026-01-06 02:49 (UTC+0)

NVIDIA CEO Jensen Huang announced that the company's next-generation artificial intelligence platform, "Vera Rubin," has entered full-scale production and is scheduled to commence shipments to partners in the second half of this year.

Compared to the previous generation architecture, the number of transistors in the Rubin platform increased by only 1.6 times, yet it delivers a fivefold improvement in inference performance while reducing the cost per token by nine-tenths (or 90%).

Huang pointed out that NVIDIA broke from its traditional approach of updating only one or two chips per generation, instead completely redesigning all six chips from the ground up. These chips include the Vera central processing unit (CPU), the Rubin GPU, the NVLink switch, the ConnectX-9 SuperNIC, the BlueField-4 DPU, and the Spectrum-6 Ethernet switch.

The Vera CPU and Rubin GPU were not developed in isolation but were co-designed to enable high-speed, low-latency bidirectional data sharing . NVIDIA also designed the ConnectX-9 network card specifically for the Vera processor, and these technologies were announced together only after their integration was fully realized.

Huang emphasized that when training a model with 10 trillion parameters, the Vera Rubin system can complete the task in the same timeframe while occupying only a quarter of the physical space required by a Blackwell system. Furthermore, the processing capability per watt has improved by approximately 10 times compared to Blackwell, a factor directly linked to data center profitability. The Vera Rubin platform slashes AI inference costs by 10 times and supports "confidential computing," meaning all data is encrypted during transmission, storage, and computation.

Price Center

View All

Newsflash

View All

13 hours ago

Hot News

View All

NVIDIA Vera Rubin Enters Full Production, Set to Slash AI Inference Costs by 10-Fold

Latest News

CFMS｜MemoryS 2026: Shenzhen, March 27

Winbond Electronics: 8Gb DDR4 Now in Shipment, NOR Flash Wafer Starts to Expand to 30K Units by Mid-Year

Samsung Electronics' HBM4E R&D Progress Revealed

Compal: Memory Costs Have Doubled as a Proportion of PC Components

Microsoft Plans to Deploy LPDDR in AI Data Centers

Samsung Plans to Adopt 2nm Process for Custom HBM Logic Chips

Samsung Electronics' 1c DRAM Yield Reportedly Close to 60%

SK Hynix's DRAM production at its Wuxi plant has been upgraded to the 1a process

Price Center

Newsflash

Huawei Tops Chinese Smartphone Market in January, Mate 80 Series Sales Exceed 2.54 Million Units

Samsung Resumes Investment in NAND Flash Memory Production Line at Pyeongtaek Plant

South Korea's Semiconductor Exports Surge 102.7% in January, Driving Total Exports to Record High

Lenovo Announces New AI Tablet Legon Y700 for March Release

AMD Zen 6 Architecture Details Leaked: Core Count Significantly Increased, Area Almost Unchanged

Xiaomi's Next-Generation SU7 Auto to Enter Mass Production Phase

Shenghe Jingwei Responds to Second Round of Inquiries for its STAR Market IPO

SanDisk: Revenue Exceeded $3 Billion（Up 61% YoY) with Impressive Performance in AI Data Center Business

Kioxia and Sandisk Extend Yokkaichi Joint Venture Agreement Through 2034

Longsys: Net Profit in 2025 Expected to Increase by 150.66% to 210.82% YoY

Hot News

Kioxia and Sandisk Extend Yokkaichi Joint Venture Agreement Through 2034

Kioxia Introduces QLC UFS 4.1 Embedded Flash Memory Devices for High-Capacity Mobile Storage

CFMS｜MemoryS 2026: Shenzhen, March 27

SK hynix Announces FY25 Financial Results

Persistent Supply Shortages Keep NAND Resource Costs Elevated, Driving Continued Price Increases Across OEM, Channel, and Embedded Products

Micron Breaks Ground on Advanced Wafer Fabrication Facility in Singapore

Winbond Electronics: 8Gb DDR4 Now in Shipment, NOR Flash Wafer Starts to Expand to 30K Units by Mid-Year

MemoryMarket