Pratyay Banerjee's picture

In a Training Loop 🔄

Pratyay Banerjee

Neilblaze

·

https://neilblaze.live

AI & ML interests

IR, NLP, Pattern Recognition, xAI, Interpretability, Evals

Recent Activity

upvoted a paper about 6 hours ago

Measuring Epistemic Resilience of LLMs Under Misleading Medical Context

upvoted a paper about 6 hours ago

Learning from the Self-future: On-policy Self-distillation for dLLMs

upvoted a paper about 6 hours ago

Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding?

View all activity

Organizations

upvoted 6 papers about 6 hours ago

Measuring Epistemic Resilience of LLMs Under Misleading Medical Context

Paper • 2606.12291 • Published 9 days ago • 42

Learning from the Self-future: On-policy Self-distillation for dLLMs

Paper • 2606.18195 • Published 3 days ago • 70

Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding?

Paper • 2606.08063 • Published 13 days ago • 78

Redesign Mixture-of-Experts Routers with Manifold Power Iteration

Paper • 2606.12397 • Published 9 days ago • 86

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 4 days ago • 97

Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings

Paper • 2606.07502 • Published 14 days ago • 95

liked a Space about 7 hours ago

TinyNarrator

A small-model accessibility screen reader.

liked a model about 17 hours ago

WeiboAI/VibeThinker-3B

Text Generation • 3B • Updated 12 minutes ago • 6.59k • • 410

liked a Space about 17 hours ago

QUEST

Answer complex questions with web‑sourced research

liked 2 models 3 days ago

Paulescu/LFM2.5-Audio-1.5B-OHF-Voice-GGUF

1B • Updated May 12 • 443 • 4

LiquidAI/LFM2.5-Audio-1.5B

Audio-to-Audio • 1B • Updated Mar 30 • 1.25k • 425

liked a model 5 days ago

Jackrong/Qwopus3.6-27B-Coder-MTP-GGUF

Image-Text-to-Text • 0.5B • Updated 4 days ago • 122k • 251

liked a Space 7 days ago

FineWeb: decanting the web for the finest text data at scale

Explore and download the FineWeb web‑scale text dataset

upvoted an article 7 days ago

Article

Build Small Hackathon With Cohere Models

CohereLabs

•

15 days ago

• 5

liked a model 7 days ago

google/diffusiongemma-26B-A4B-it

Image-Text-to-Text • 26B • Updated 9 days ago • 527k • 1k

upvoted 5 papers 8 days ago

OpenSkill: Open-World Self-Evolution for LLM Agents

Paper • 2606.06741 • Published 15 days ago • 27

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 11 days ago • 33

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Paper • 2606.09730 • Published 11 days ago • 50

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published 15 days ago • 52

LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

Paper • 2606.06087 • Published 15 days ago • 63