cqiujin's picture

28 8

cqiujin

Ronronne

AI & ML interests

None yet

Recent Activity

liked a model 19 days ago

FutureMa/Qwen3-8B-Drama-Thinking

upvoted a paper 3 months ago

StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs

upvoted a paper 3 months ago

SWE-QA: Can Language Models Answer Repository-level Code Questions?

View all activity

Organizations

None yet

liked a model 19 days ago

FutureMa/Qwen3-8B-Drama-Thinking

Text Generation • 308k • Updated 4 days ago • 2.16k • 88

upvoted 3 papers 3 months ago

StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs

Paper • 2509.22220 • Published Sep 26 • 65

SWE-QA: Can Language Models Answer Repository-level Code Questions?

Paper • 2509.14635 • Published Sep 18 • 35

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Paper • 2509.16198 • Published Sep 19 • 126

upvoted a paper 4 months ago

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

Paper • 2509.06806 • Published Sep 8 • 63

liked a model 4 months ago

MachineLearningLM/MachineLearningLM-7B-v1

Text Generation • 8B • Updated Oct 1 • 68 • 34

upvoted a paper 7 months ago

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published May 29 • 93

upvoted a paper 8 months ago

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21 • 156

upvoted 2 papers 10 months ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published Mar 10 • 67

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 191

liked 6 models 10 months ago

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4

Reinforcement Learning • 8B • Updated Mar 26 • 1.65k • 227

NousResearch/DeepHermes-3-Llama-3-8B-Preview

Text Generation • 8B • Updated Apr 10 • 295 • • 354

unsloth/DeepSeek-R1-GGUF

Text Generation • 671B • Updated May 30 • 23k • 1.1k

nomic-ai/nomic-embed-text-v2-moe

Sentence Similarity • 0.5B • Updated Apr 1 • 665k • 446

microsoft/OmniParser-v2.0

Updated Mar 28 • 815 • 1.31k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 555k • • 12.9k

upvoted 4 papers 10 months ago

Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning

Paper • 2410.14208 • Published Oct 18, 2024 • 3

Teaching Models to Balance Resisting and Accepting Persuasion

Paper • 2410.14596 • Published Oct 18, 2024 • 3

How Do Training Methods Influence the Utilization of Vision Models?

Paper • 2410.14470 • Published Oct 18, 2024 • 5

Context is Key(NMF): Modelling Topical Information Dynamics in Chinese Diaspora Media

Paper • 2410.12791 • Published Oct 16, 2024 • 5