NVIDIA outlines Hybrid-EP to push MoE all-to-all closer to hardware limits

NVIDIA introduced Hybrid-EP, a MoE expert-parallel communication approach aimed at reducing all-to-all overhead by streaming token dispatch/combine across NVLink and RDMA networks with low SM usage.

NVIDIA described Hybrid-EP, a hybrid expert-parallel communication path designed to reduce all-to-all bottlenecks when training hyperscale mixture-of-experts (MoE) models.

The post frames expert-parallel (EP) as an all-to-all pattern made harder by sparse routing (top-k experts per token) and notes that, in DeepSeek-V3-style MoE training, EP communication can exceed 50% of step time without targeted optimization.
Hybrid-EP uses hierarchical transport (intra-node NVLink plus inter-node RDMA) and a streaming pipeline that separates token “dispatch” and “combine” work into different warp groups to mask latency.
It advertises native support for FP8 and BF16 data paths and aims to overlap communication with computation rather than running them as separate phases.
NVIDIA reports validation via Megatron Core and benchmarks across DeepSeek-V3, Megatron-FSDP, and Qwen 3 235B, including a claimed 514% throughput uplift over prior approaches.
In the same results, NVIDIA states Hybrid-EP can saturate network bandwidth using 416 streaming multiprocessors (SMs), leaving more GPU capacity for model compute.

// ARTICLE_MODULE

ai-agents
tech-news

Anthropic pushes Claude Opus 4.6 beyond coding with office-work upgrades

Anthropic released Claude Opus 4.6, positioning its flagship model for broader knowledge work alongside agentic coding. The company highlights stronger first-pass outputs for documents, spreadsheets, and presentations while keeping predecessor-level pricing.

2026.02.06 | 1 MIN READ
// ARTICLE_MODULE

ai-agents
tech-news

Agent HQ brings Claude and Codex into GitHub workflows

GitHub expanded Agent HQ so Copilot Pro+ and Enterprise users can run Claude and OpenAI Codex alongside Copilot inside GitHub and VS Code. The update keeps agent work tied to repos, issues, and pull requests without switching tools.

2026.02.04 | 1 MIN READ
// ARTICLE_MODULE

ai-agents
tech-news

AWS shares a concise enterprise checklist for AI agents with Bedrock AgentCore

AWS lays out a focused set of engineering practices for production AI agents using Amazon Bedrock AgentCore, emphasizing scoped use cases, observability, tooling discipline, and measurable evaluation targets.

2026.02.04 | 1 MIN READ