Papers SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28, 2025 • 123
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28, 2025 • 123
Alignment Dataset English and other model alignment datasets. H-D-T/Buzz-8b-Large-v0.5 Text Generation • 8B • Updated May 14, 2024 • 18 • 29 allenai/WildChat-1M Viewer • Updated Oct 17, 2024 • 838k • 11.9k • 405 nvidia/ChatQA-Training-Data Viewer • Updated Jun 4, 2024 • 442k • 703 • 174 nvidia/ChatRAG-Bench Viewer • Updated May 24, 2024 • 34.6k • 1.25k • 115
Personalization LLM User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 21
User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 21
Indic Datasets List of text and voice datasets to train and finetune Indic LLMs ai4bharat/sangraha Viewer • Updated Mar 5, 2025 • 268M • 8.48k • 65 uonlp/CulturaX Viewer • Updated Dec 16, 2024 • 7.18B • 41.9k • 571 pary/hind_encorp Updated Jan 18, 2024 • 38 • 2 PleIAs/YouTube-Commons Updated Jun 26, 2024 • 2.2k • 371
Papers SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28, 2025 • 123
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28, 2025 • 123
Personalization LLM User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 21
User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 21
Indic Datasets List of text and voice datasets to train and finetune Indic LLMs ai4bharat/sangraha Viewer • Updated Mar 5, 2025 • 268M • 8.48k • 65 uonlp/CulturaX Viewer • Updated Dec 16, 2024 • 7.18B • 41.9k • 571 pary/hind_encorp Updated Jan 18, 2024 • 38 • 2 PleIAs/YouTube-Commons Updated Jun 26, 2024 • 2.2k • 371
Alignment Dataset English and other model alignment datasets. H-D-T/Buzz-8b-Large-v0.5 Text Generation • 8B • Updated May 14, 2024 • 18 • 29 allenai/WildChat-1M Viewer • Updated Oct 17, 2024 • 838k • 11.9k • 405 nvidia/ChatQA-Training-Data Viewer • Updated Jun 4, 2024 • 442k • 703 • 174 nvidia/ChatRAG-Bench Viewer • Updated May 24, 2024 • 34.6k • 1.25k • 115