TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 5 days ago • 96
google/gemma-4-31B-it Image-Text-to-Text • 33B • Updated about 7 hours ago • 1.59M • • 1.67k
google/gemma-4-26B-A4B-it Image-Text-to-Text • 27B • Updated about 7 hours ago • 1.27M • • 587
SecureCode v2.0: A Production-Grade Dataset for Training Security-Aware Code Generation Models Paper • 2512.18542 • Published Dec 20, 2025 • 5
MegaVul: A C/C++ Vulnerability Dataset with Comprehensive Code Representation Paper • 2406.12415 • Published Jun 18, 2024 • 1
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 34 minutes ago • 1.1M • 228