Workstation GPU

NVIDIA RTX PRO 6000 Blackwell Workstation Edition

96GB of GDDR7 ECC and 4,000 AI TOPS in a tower — the single-card king for local LLMs and 3D.

92/ 100
Outstanding
Android Hire score
Enterprise
Professional pricing · ~$8.5k+
96GB GDDR7 ECC
Memory
125 TFLOPS
FP32
4,000 TOPS
AI
600 W
Power
NVIDIA RTX PRO 6000 Blackwell Workstation Edition

Image: NVIDIA

Best for

Local LLMs, 3D rendering, 8K video and simulation on one card.

Standout

96GB + ECC has no consumer equal, plus certified pro drivers.

Watch out

Overkill and poor value if your workload fits inside 32GB.

By Aditya Singh

The NVIDIA RTX PRO 6000 Blackwell Workstation Edition is the desk-side flagship of NVIDIA's professional Blackwell lineup. It drops the full GB202 GPU and an enormous 96 GB of GDDR7 ECC into a tower-friendly, actively cooled card with certified pro drivers. For creators and engineers, the headline isn't raw frame rate — it's the ability to hold a 70-billion-parameter model, a billion-polygon scene, or hours of 8K footage entirely in VRAM on one card.

Quick verdict: The single-card king for memory-hungry professional work — local LLMs, large-scene rendering, 8K video and simulation. 96GB + ECC has no consumer equal, and it cleanly outclasses the RTX 6000 Ada. The catch is price (≈$8,500, often $10k–13k) and a 600W / 12″ footprint. If your workload fits in 32GB, an RTX 5090 is far better value. Score: 92/100.

RTX PRO 6000 Workstation Edition — Full Specifications

Core specs are from NVIDIA's official datasheet; third-party figures (clocks, node, PSU guidance) are flagged.

SpecificationDetail
GPU & Architecture
GPU chipGB202 (Blackwell) — NVIDIA does not name the die
Process nodeTSMC 4N-class (third-party; not on datasheet)
CUDA cores24,064
Streaming Multiprocessors188 SMs (enabled)
RT cores188 (4th gen) — 380 RT TFLOPS
Tensor cores752 (5th gen, FP4)
Base / boost clock~1,590 MHz / ~2,617 MHz (third-party; NVIDIA doesn't publish base clock)
Memory
Memory size96 GB GDDR7 with ECC
Memory bus512-bit
Memory bandwidth1,792 GB/s (~1.8 TB/s)
Compute & AI
FP32 (single precision)125 TFLOPS
RT core performance380 TFLOPS
AI performance4,000 AI TOPS (FP4 with sparsity)
Power & Cooling
Total board power600 W (fixed)
Power connector1× 16-pin (12V-2x6 / CEM5)
Recommended PSU~1,000 W system (third-party guidance)
CoolingActive double flow-through (dual axial)
Form Factor & Display
Form factorDual-slot, 5.4″ H × 12.0″ L (extended height)
InterfacePCIe 5.0 x16
Display outputs4× DisplayPort 2.1b (active)
Max resolutionUp to 8K @ 240 Hz, 16K @ 60 Hz
Media engine4× NVENC (9th gen) + 4× NVDEC (6th gen), 4:2:2 support, AV1
Enterprise Features
MIGUp to 4× 24 GB, 2× 48 GB, or 1× 96 GB
AI Management ProcessorYes
APIsDX12 (SM 6.6), OpenGL 4.6, Vulkan 1.3, CUDA, OpenCL 3.0
Availability & Price
LaunchAnnounced GTC, March 18, 2025; shipping later 2025
PriceMSRP ≈ $8,500 → NVIDIA later listed ≈ $13,250; street ≈ $8,000–$13,000+

Real-World Performance

Independent testing (StorageReview, Puget Systems) shows what the 96GB buys you:

  • Local LLMs: ran a 120B-parameter model in LM Studio at ~163 tokens/sec — a model the 48GB RTX 6000 Ada simply cannot load. Roughly 2× the Ada in MLPerf-style Llama2 text generation (8,008 vs 3,957).
  • 3D rendering: Blender "Monster" scene at 7,870 samples/min vs 5,632 on the RTX 6000 Ada (~40% faster), with strong multi-GPU scaling in OctaneRender.
  • AI image: Stable Diffusion 1.5 FP16 at 0.705 s/image vs 1.477 s on Ada (~2×).
  • Video / 8K: four 9th-gen NVENC engines with 4:2:2 hardware accelerate DaVinci Resolve and camera-native footage; ~23% faster than Ada in Topaz Video AI.

RTX PRO 6000 Workstation vs RTX 5090 — Which Should a Pro Buy?

Both are GB202. For gaming-rate FP32 the RTX 5090 is in the same ballpark and far cheaper — but the pro card wins on everything production-critical:

AttributeRTX PRO 6000 WSGeForce RTX 5090
VRAM96 GB GDDR732 GB GDDR7
ECC memoryYesNo
AI performance~4,000 AI TOPS~3,352 AI TOPS
Cooling exhaustDouble flow-through (vents out of case)Open-air
DriversCertified enterprise / ISVGeForce Game Ready
Data-center usePermittedRestricted by EULA
Price≈ $8,500+≈ $1,999 MSRP

The decision is simple: if your workload exceeds 32GB of VRAM (large models, huge scenes, long-context inference) the PRO 6000 is the only single-card answer. If it fits in 32GB, the RTX 5090 delivers most of the throughput for a quarter of the price.

What Creators & AI Devs Ask

  • "Is 96GB worth 4–6× the price of a 5090?" Only if you actually exceed 32GB — then it's not a luxury, it's the requirement.
  • "How big a model fits in 96GB?" A 70B model in FP8, or very large quantized models with long context, entirely in VRAM.
  • "vs the RTX 6000 Ada?" Roughly 2× the AI performance, ~40% faster in Blender, and double the VRAM (96 vs 48GB).
  • "Will it fit my workstation?" It's 12″ long, dual-slot, extended-height, draws 600W and uses a 16-pin connector — you need a full-size tower and a ~1,000W PSU.
  • "Certified drivers?" Yes — ISV-certified enterprise driver branch for CAD/DCC apps, unlike the GeForce line.

Pros and Cons

✅ Pros

  • 96GB GDDR7 with ECC — class-leading single-card capacity
  • ~1.8 TB/s bandwidth and full GB202 (24,064 CUDA / 752 Tensor / 188 RT)
  • 4,000 AI TOPS — ~2× the RTX 6000 Ada in AI workloads
  • Active double flow-through cooling vents heat out of the case
  • Certified enterprise drivers, ISV certifications, MIG, AI Management Processor
  • 4× DP 2.1b (8K@240 / 16K@60) and 4× NVENC with 4:2:2 for pro video
  • Drops into a standard tower (unlike the headless Server Edition)

❌ Cons

  • Very expensive — ~$8,500 MSRP, frequently $10k–13k+ at retail
  • 600W draw demands a high-wattage PSU and good airflow
  • 12″ length / extended height won't fit smaller cases
  • Single 16-pin connector carrying 600W — seat the cable carefully
  • Overkill (and poor value) if your work fits in 32GB — get an RTX 5090

The Bottom Line

For professionals who genuinely need 96GB on a single card, the RTX PRO 6000 Blackwell Workstation Edition is essentially the only choice — and a very good one. It's close to category-defining: unmatched capacity, ECC reliability, production-grade drivers and cooling, and a clean generational leap over the RTX 6000 Ada. The only real knocks are steep, volatile pricing and 600W power/cooling demands. Our score: 92/100.

Related reading

Frequently Asked Questions

How much VRAM does the RTX PRO 6000 Blackwell Workstation Edition have?

96 GB of GDDR7 with ECC on a 512-bit bus, delivering 1,792 GB/s (about 1.8 TB/s) of bandwidth. That's enough to hold a 70-billion-parameter model in FP8, billion-polygon scenes, or large quantized models with long context entirely in VRAM.

RTX PRO 6000 Workstation Edition vs RTX 5090 — what's the difference?

Both use the GB202 chip, but the PRO 6000 has 96GB of ECC memory (vs 32GB non-ECC), ~4,000 AI TOPS, certified enterprise drivers, double flow-through cooling, and is permitted for data-center use. The RTX 5090 is far cheaper and great if your work fits in 32GB; the PRO 6000 is for workloads that don't.

What power supply does the RTX PRO 6000 Workstation need?

The card has a fixed 600W board power and a single 16-pin (12V-2x6) connector, so a roughly 1,000W system PSU is recommended, along with a full-size tower that can fit the 12-inch, dual-slot, extended-height card.

Is the RTX PRO 6000 Workstation Edition good for local LLMs?

Yes — it's one of the best single-card options for local AI. In testing it ran a 120B-parameter model in LM Studio at around 163 tokens/sec and delivered roughly 2× the AI text-generation performance of the RTX 6000 Ada.

How much does the RTX PRO 6000 Workstation Edition cost?

It launched around $8,500 MSRP in 2025. Pricing has been volatile since — NVIDIA later listed it near $13,250, and street prices commonly range from about $8,000 to $13,000+ depending on supply and reseller.

NVIDIARTX PRO 6000BlackwellGraphics Cards