Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated 2 days ago • 148
latam-gpt/Wayra-Perplexity-Estimator-55M Text Classification • 55.4M • Updated Aug 15, 2025 • 88 • 19
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 897
facebook/wav2vec2-large-960h-lv60-self Automatic Speech Recognition • Updated May 23, 2022 • 95.2k • 161
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 204k • • 2.85k