Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models Paper • 2511.23319 • Published Nov 28, 2025 • 22
On the Transformations across Reward Model, Parameter Update, and In-Context Prompt Paper • 2406.16377 • Published Jun 24, 2024 • 13
Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models Paper • 2412.16545 • Published Dec 21, 2024
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published Oct 30, 2025 • 116
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published Oct 30, 2025 • 116 • 5
AutoDeco Collection Chat with truly end-to-end LLMs with AutoDeco heads • 8 items • Updated 15 days ago • 6
RoT: Enhancing Large Language Models with Reflection on Search Trees Paper • 2404.05449 • Published Apr 8, 2024
What would Harry say? Building Dialogue Agents for Characters in a Story Paper • 2211.06869 • Published Nov 13, 2022 • 1