Hardware & Computing
Chip architecture, GPUs, quantum computing, custom silicon, memory-centric computing, semiconductor supply chains
Hardware & Computing
AI compute infrastructure in early 2026 is defined by two simultaneous paradigm shifts. On the classical side, the unit of compute is no longer the GPU chip but the rack — NVIDIA's Vera Rubin (50 PFLOPS FP4 per GPU, 288 GB HBM4 at 22 TB/s, 72-GPU racks at 260 TB/s aggregate) set the new ceiling. The foundry substrate shifted beneath it: TSMC N2 (2nm) in mass production Q4 2025, A16 (1.6nm) with backside-power Super Power Rail due late 2026 for NVIDIA's Feynman. HBM memory underwent an architectural shakeup — base dies moved from DRAM to TSMC logic processes, enabling HBM4E (2.5× bandwidth) and C-HBM4E (near-memory compute). On the commercial photonic side, 2026 is the year photonic computing became a production substrate: three anchor results — imec's PyTorch-integrated tensor processor (Nature Communications), Nokia Bell Labs' first on-chip photonic backprop (Nature), and Lightmatter's transformer-without-modification demo — prove photonic compute is no longer a research curiosity.
On the quantum side, four anchor results now define the landscape. Google Willow (Aug 2024) crossed below the surface-code threshold — exponential error suppression with Λ=2.14 per 2 distance steps, distance-7 code at 0.143% error/cycle. IBM Nighthawk + qLDPC (Nov 2025) delivered real-time error decoding in <480ns alongside a 120-qubit superconducting chip, with a concrete 2029 fault-tolerance target. Quantinuum's iceberg codes (March 2026) crossed beyond break-even with 94 logical qubits from only 98 physical. Besedin et al. in Nature Physics (Jan 2026) demonstrated the first lattice surgery on superconducting qubits — the compute primitive on surface codes. And in April 2026, Google expanded into neutral-atom quantum via QuEra, making neutral atoms a formal third architecture alongside superconducting and trapped-ion. 100,000-atom chambers are the scaling target.
The custom-silicon insurgency continues (hyperscaler ASICs at 44.6% CAGR vs GPU at 16.1%) but NVIDIA's rack-as-product lock-in is proving more durable than chip-level CUDA lock-in.
Frontier — What's Moving Now
- Quantum: three architectures, same 2029 target — superconducting (IBM + Google), trapped-ion (Quantinuum), neutral atom (QuEra/Atom/Pasqal).
- Below-threshold surface codes proven — Google Willow, Λ=2.14, 0.143% error/cycle at distance-7.
- Lattice surgery on superconducting — Besedin et al. Nature Physics Jan 2026, first on SC hardware.
- Neutral atom as third architecture — Google + QuEra (Apr 2026), Microsoft + Atom Computing, Pasqal.
- Photonic compute crossed to production — 3 anchor results in 2025-2026 (imec, Nokia Bell Labs, Lightmatter).
- TSMC Angstrom era — N2 in volume, A16 late 2026, first Nanosheet GAA node.
- HBM4E / C-HBM4E architectural shakeup — base dies on TSMC N3P, 2.5× bandwidth, near-memory compute.
- Memory-centric computing / PIM — 60–90% of system energy is data movement; C-HBM4E NMC is the commercial wedge; RowHammer/RowPress force memory intelligence; JEDEC + trillion-dollar incumbency is the real bottleneck.
- IBM verified quantum advantage target — end-2026.
Concept Map
Concepts
| Concept | Sources | Evidence | Frontier | Last Updated |
|---|---|---|---|---|
| Rack-Scale AI Compute | 2 | Strong | Active | 2026-04-09 |
| HBM4 Memory Architecture | 3 | Strong | Breakthrough | 2026-04-17 |
| Custom Silicon vs GPU | 2 | Strong | Active | 2026-04-09 |
| Silicon Photonics | 1 | Strong | Active | 2026-04-09 |
| Nanosheet GAA Transistor | 1 | Strong | Breakthrough | 2026-04-17 |
| Logical Qubit Error Correction | 4 | Strong | Breakthrough | 2026-04-17 |
| Quantum Fault Tolerance Roadmap | 5 | Strong | Active | 2026-04-17 |
| Photonic Neural Network | 3 | Strong | Breakthrough | 2026-04-17 |
| Neutral Atom Quantum Computing | 1 | Moderate | Breakthrough | 2026-04-17 |
| Processing-In-Memory & Memory-Centric Computing | 2 | Moderate | Active | 2026-04-21 |
Entities
| Entity | Type | Sources | Key Connection |
|---|---|---|---|
| NVIDIA | Company | 2 | Vera Rubin, rack-as-product strategy |
| Vera Rubin | Product | 2 | 6-chip AI supercomputer |
| TSMC | Company | 3 | N2 / A16 process nodes, HBM base dies |
| Quantinuum | Company | 1 | 94 logical qubits, iceberg codes |
| IBM Quantum | Company | 1 | Nighthawk + qLDPC + 2029 FT roadmap |
| Google Quantum AI | Lab | 3 | Willow below-threshold + QuEra neutral atom |
| QuEra | Company | 1 | Neutral atom, Google-backed |
| Nokia Bell Labs | Lab | 1 | First on-chip photonic backprop |
| Lightmatter | Company | 1 | Photonic transformer inference |
Timeline
See timeline.md for chronological developments.
Research Frontier
See frontier.md for active research directions, breakthroughs, and knowledge gaps.
Sources
| # | Title | Type | Date | Status |
|---|---|---|---|---|
| 1 | Inside the NVIDIA Vera Rubin Platform | tech report | 2026-01-05 | compiled |
| 2 | Custom Silicon Inflection 2026 | analysis | 2026-02-25 | compiled |
| 3 | IBM Nighthawk + Fault Tolerance Roadmap | tech report | 2025-11-12 | compiled |
| 4 | HBM4 / C-HBM4E Architectural Shakeup | analysis | 2025-12-02 | compiled |
| 5 | TSMC N2 / A16 Angstrom Era | analysis | 2026-01-21 | compiled |
| 6 | Quantinuum 94 Logical Qubits | tech report | 2026-03-26 | compiled |
| 7 | Google Willow: Below Surface Code Threshold | preprint | 2024-08-24 | compiled |
| 8 | Besedin: Lattice Surgery on SC Qubits | paper | 2026-01-08 | compiled |
| 9 | Google + QuEra Neutral Atom Expansion | news | 2026-04-03 | compiled |
| 10 | Photonic Tensor Processor (imec, PyTorch) | paper | 2026-01-15 | compiled |
| 11 | Ashtiani: On-Chip Photonic Backprop | paper | 2026-03-20 | compiled |
| 12 | Lightmatter Photonic Processor | tech report | 2025-04-09 | compiled |