Running Featured 127 Open-LLM performances are plateauing, let’s make the leaderboard steep again 🏔 127 Explore and compare advanced language models on a new leaderboard
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Paper • 2407.14057 • Published Jul 19, 2024 • 46
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore Paper • 2407.12854 • Published Jul 9, 2024 • 31
TinyLlama/TinyLlama-1.1B-Chat-v1.0 Text Generation • 1B • Updated Mar 17, 2024 • 3.13M • 1.57k