At NVIDIA GTC 2026 in San Jose, Akamai Technologies announced Akamai Inference Cloud, the first global-scale implementation of NVIDIA AI Grid — a reference architecture designed to distribute AI inference workloads across centralised data centres and edge infrastructure for better latency and cost efficiency.
Akamai's platform integrates thousands of GPUs and uses intelligent orchestration to route AI workloads across its network of 4,400+ edge locations, regional clouds, and core data centres, enabling low-latency, real-time AI applications such as gaming NPC interactions, fraud detection, and live media processing.
The announcement represents a major milestone in the evolution of AI, moving beyond isolated AI factories and towards a unified and distributed grid for AI inference.
Key Announcements
Expanded GPU capabilities to power a globally distributed AI compute grid for next-generation inference applications
A newly secured four-year, $200M service agreement with a major US technology company to deploy NVIDIA Blackwell GPU clusters
Launch of Inference Cloud platform powered by NVIDIA Blackwell AI architecture