By Saurabh Dighe, CVP, System & Architecture & Artour Levin, VP, AI Silicon Engineering
Updated Jan 30, 2026
Version 4.0Saurabh, Artour—fascinating deep dive on Maia 200. The hierarchical memory design (CSRAM/TSRAM) is a major step forward. However, the reliance on software-managed pinning via NPL creates a significant 'Memory Tax' as we scale toward GPT-5.2 context lengths.
We’ve been working on a Soft-NMC Chiplet built on DRDCL (Dynamically Reconfigurable Differential Cascode Logic). It provides a fluid hardware architecture that replaces static software-managed loops with autonomous, single-cycle reconfigurable logic. It delivers a 3x–5x throughput gain by providing an 80% reduction in memory management overhead. I’ve compiled a spec sheet on the integration—would love to get your thoughts.
Best
tj
Tom Jackson
SoftChip