Raushan Turganbay's picture

Raushan Turganbay

RaushanTurganbay

·

zucchini-nlp

AI & ML interests

Generation and Multimodality

Recent Activity

updated a model 2 days ago

RaushanTurganbay/kimi-two-layers

published a model 2 days ago

RaushanTurganbay/kimi-two-layers

upvoted an article 5 days ago

EMO: Pretraining mixture of experts for emergent modularity

View all activity

Organizations

updated a model 2 days ago

RaushanTurganbay/kimi-two-layers

19B • Updated 2 days ago • 389

published a model 2 days ago

RaushanTurganbay/kimi-two-layers

19B • Updated 2 days ago • 389

upvoted an article 5 days ago

Article

EMO: Pretraining mixture of experts for emergent modularity

allenai

•

6 days ago

• 30

upvoted a paper 23 days ago

EXAONE 4.5 Technical Report

Paper • 2604.08644 • Published Apr 9 • 69

upvoted an article 24 days ago

Article

Building a Fast Multilingual OCR Model with Synthetic Data

nvidia

•

26 days ago

• 33

updated a model 27 days ago

RaushanTurganbay/audio-flamingo-3-hf-lora-finetuned

Text Generation • Updated 27 days ago • 70

upvoted an article 28 days ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

28 days ago

• 70

upvoted a collection 28 days ago

EXAONE 4.5

LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated 21 days ago • 42

upvoted a paper about 1 month ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 146

upvoted a paper about 2 months ago

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Paper • 2603.12254 • Published Mar 12 • 22

upvoted an article about 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 152

updated 2 models about 2 months ago

deepseek-community/Janus-Pro-7B

Any-to-Any • 7B • Updated Mar 18 • 476 • 3

deepseek-community/Janus-Pro-1B

Any-to-Any • 2B • Updated Mar 18 • 96.8k • 14

New activity in OpenGVLab/InternVL2-8B about 2 months ago

Compatibility with v5

#23 opened about 2 months ago by

RaushanTurganbay

New activity in OpenGVLab/InternVL2-1B about 2 months ago

Compatibility with v5

#10 opened about 2 months ago by

RaushanTurganbay

New activity in OpenGVLab/InternVL2-2B about 2 months ago

Compatibility with v5

#7 opened about 2 months ago by

RaushanTurganbay

New activity in OpenGVLab/InternViT-300M-448px-V2_5 about 2 months ago

Compatibility with v5

#5 opened about 2 months ago by

RaushanTurganbay

New activity in OpenGVLab/InternViT-300M-448px about 2 months ago

Compatibility with v5

#6 opened about 2 months ago by

RaushanTurganbay

New activity in PerceptronAI/Isaac-0.1 about 2 months ago

Compatibility with v5

#6 opened about 2 months ago by

RaushanTurganbay

New activity in PerceptronAI/Isaac-0.2-1B about 2 months ago

Compatibility with v5

#4 opened about 2 months ago by

RaushanTurganbay