What's Changed
- 🔥🔥🔥[DeepSeek-V3] DeepSeek-V3 Technical Report by @DefTruth in #109
- 🔥🔥[SP: TokenRing] TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication by @DefTruth in #110
- 🔥🔥[FFPA] FFPA: Yet another Faster Flash Prefill Attention with O(1) SRAM complexity for headdim > 256, ~1.5x faster than SDPA EA(@DefTruth) by @DefTruth in #111
Full Changelog: v2.6.9...v2.6.10