Discussion about this post

User's avatar
Max's avatar

The GB10 is exciting initially but evaluating its details changes first impressions. No ECC memory risks silent data corruption: a bit-flip could taint model weights, yielding unreliable outputs that devs blame on their code, not hardware. For deployment, retrain the final prototype on an ECC system to ensure integrity. Your estimates confirm our theoretical tuning estimates: 14B params for 128GB, 26B for 256GB. GB10's CPU-GPU link is amazing ~600 GB/s, but the ConnectX-7 between two Sparks crawls at ~25 GB/s. Spotted the LMSYS article last night—meant to read today, but you already nailed the dissection. Love your stats take over mine!

Expand full comment

No posts