Index / tool

SepLLM

Category: AI Tools
Pricing: Open Source
Type: TOOL
Builder: hkuds
GitHub: 572 stars
Added: May 26, 2026

About

ICML 2025 paper accelerating large language models by compressing each segment into a single separator token to speed up inference.

Why it made the leaderboard

ICML 2025 technique that speeds up LLM inference by compressing each text segment into a single separator token — a concrete way to cut attention cost if inference latency is your bottleneck.

Tech Stack

CC++CudaMakefilePythonShell

Media

Comments (0)

No comments yet

Indexed by a proprietary survey. Corrections welcome.

About

Why it made the leaderboard

Tags

Tech Stack

Media

Comments (0)