VIBE
← Back to Leaderboard
AI/InfrastructureAPP
AI/InfrastructureAPP1mo ago9.7k

About

DeepEP DeepEP is a communication library tailored for Mixture-of-Experts (MoE) and expert parallelism (EP). It provides high-throughput and low-latency all-to-all GPU kernels, which are also known as MoE dispatch and combine. The library also supports low-precision operations, including FP8. To align with the group-limited gating algorithm proposed in the DeepSeek-V3 paper, DeepEP offers a set of kernels optimized for asymmetric-domain bandwidth

Tech Stack

Python

Comments

No comments yet.