LLM Paperlist
updated
Mixture-of-Agents Enhances Large Language Model Capabilities
Paper
• 2406.04692
• Published
• 59
CRAG -- Comprehensive RAG Benchmark
Paper
• 2406.04744
• Published
• 46
Boosting Large-scale Parallel Training Efficiency with C4: A
Communication-Driven Approach
Paper
• 2406.04594
• Published
• 6
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language
Models
Paper
• 2406.04271
• Published
• 29
4-bit Shampoo for Memory-Efficient Network Training
Paper
• 2405.18144
• Published
• 12
Self-Exploring Language Models: Active Preference Elicitation for Online
Alignment
Paper
• 2405.19332
• Published
• 22
Paper
• 2405.18407
• Published
• 48
2BP: 2-Stage Backpropagation
Paper
• 2405.18047
• Published
• 26
Yuan 2.0-M32: Mixture of Experts with Attention Router
Paper
• 2405.17976
• Published
• 21
LLaMA-NAS: Efficient Neural Architecture Search for Large Language
Models
Paper
• 2405.18377
• Published
• 21
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
Paper
• 2406.15319
• Published
• 64
ColPali: Efficient Document Retrieval with Vision Language Models
Paper
• 2407.01449
• Published
• 51
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented
Generation
Paper
• 2406.19215
• Published
• 32
Visual Haystacks: Answering Harder Questions About Sets of Images
Paper
• 2407.13766
• Published
• 2