論文 深掘り Hugging Face 発表: 2026-06-10 HF ↑56

MiniMax Sparse Attention

MiniMax Sparse Attention

著者: Xunhao Lai, Weiqi Xu, Yufeng Yang, Qiaorui Chen, Yang Xu ほか6名

要約

Ultra-long-context capability is becoming indispensable for frontier LLMs: agentic workflows, repository-scale code reasoning, and persistent memory all require the model to jointly attend over hundreds of thousands to millions of tokens, yet the quadratic cost of softmax attention makes this untena…

#multimodal#llm#agent#coding#benchmark

同じカテゴリの記事