-
Visual Instruction Tuning
Paper • 2304.08485 • Published • 21 -
Improved Baselines with Visual Instruction Tuning
Paper • 2310.03744 • Published • 39 -
Flamingo: a Visual Language Model for Few-Shot Learning
Paper • 2204.14198 • Published • 16 -
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Paper • 2301.12597 • Published • 2
Arslan
cowgoesmoo
·
AI & ML interests
None yet
Recent Activity
updated a collection 4 days ago
vision updated a collection 4 days ago
vision liked a model 4 days ago
llava-hf/llava-v1.6-mistral-7b-hfOrganizations
llm
-
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 24 -
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
Paper • 2512.02556 • Published • 265 -
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning
Paper • 2511.22570 • Published • 93 -
DeepSeek-OCR: Contexts Optical Compression
Paper • 2510.18234 • Published • 93
vision
-
Visual Instruction Tuning
Paper • 2304.08485 • Published • 21 -
Improved Baselines with Visual Instruction Tuning
Paper • 2310.03744 • Published • 39 -
Flamingo: a Visual Language Model for Few-Shot Learning
Paper • 2204.14198 • Published • 16 -
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Paper • 2301.12597 • Published • 2
llm
-
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 24 -
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
Paper • 2512.02556 • Published • 265 -
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning
Paper • 2511.22570 • Published • 93 -
DeepSeek-OCR: Contexts Optical Compression
Paper • 2510.18234 • Published • 93
models 14
cowgoesmoo/huggingface_rl_unit7_SoccerTwos
Reinforcement Learning • Updated • 13
cowgoesmoo/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning • Updated
cowgoesmoo/huggingface_rl_unit8_ppo-LunarLander-v2
Reinforcement Learning • Updated
cowgoesmoo/huggingface_rl_unit8_ppo-LunarLander-v3
Reinforcement Learning • Updated
cowgoesmoo/huggingface_rl_unit8_ppo-CartPole-v1
Reinforcement Learning • Updated
cowgoesmoo/huggingface_unit6_a2c-PandaReachDense-v3
Reinforcement Learning • Updated • 11
cowgoesmoo/huggingface_rl_unit4-pyramids
Reinforcement Learning • Updated • 16
cowgoesmoo/huggingface_rl_unit5_ppo-SnowballTarget
Reinforcement Learning • Updated • 15
cowgoesmoo/huggingface_rl_unit4_pixelcopter
Reinforcement Learning • Updated
cowgoesmoo/huggingface_rl_unit4
Reinforcement Learning • Updated
datasets 0
None public yet