OpenAI taps Cerebras for 750MW of low-latency inference compute

Yesterday, OpenAI announced a partnership with Cerebras to add 750MW of ultra low-latency AI compute to its platform. OpenAI says the capacity will roll out in phases and come online in multiple tranches through 2028.

Yesterday (January 14, 2026), OpenAI announced a partnership with Cerebras to bring 750MW of ultra low-latency AI compute onto OpenAI’s platform.

OpenAI says Cerebras builds purpose-built AI systems aimed at accelerating long outputs, and attributes the speed to a design that concentrates compute, memory, and bandwidth on a single large chip to reduce inference bottlenecks.

OpenAI says the Cerebras capacity will be integrated into its inference stack in phases, with expansion planned across workloads including code generation, image generation, and AI agent use cases.

In describing the goal, OpenAI pointed to an interactive loop—request, model processing, response—and said the added low-latency inference capacity is intended to make that cycle feel faster for real-time use.

OpenAI stated that the 750MW capacity will come online in multiple tranches through 2028.

OpenAI executive Sachin Katti described the deal as adding a dedicated low-latency inference option within OpenAI’s compute portfolio, and Cerebras CEO Andrew Feldman framed the focus as enabling real-time inference with OpenAI models on Cerebras hardware.

// ARTICLE_MODULE

ai-agents
tech-news

Anthropic pushes Claude Opus 4.6 beyond coding with office-work upgrades

Anthropic released Claude Opus 4.6, positioning its flagship model for broader knowledge work alongside agentic coding. The company highlights stronger first-pass outputs for documents, spreadsheets, and presentations while keeping predecessor-level pricing.

2026.02.06 | 1 MIN READ
// ARTICLE_MODULE

ai-agents
tech-news

Agent HQ brings Claude and Codex into GitHub workflows

GitHub expanded Agent HQ so Copilot Pro+ and Enterprise users can run Claude and OpenAI Codex alongside Copilot inside GitHub and VS Code. The update keeps agent work tied to repos, issues, and pull requests without switching tools.

2026.02.04 | 1 MIN READ
// ARTICLE_MODULE

ai-agents
tech-news

AWS shares a concise enterprise checklist for AI agents with Bedrock AgentCore

AWS lays out a focused set of engineering practices for production AI agents using Amazon Bedrock AgentCore, emphasizing scoped use cases, observability, tooling discipline, and measurable evaluation targets.

2026.02.04 | 1 MIN READ