12 1

Tianshi ZHENG PRO

StoneTZHENG

StoneTZHENG

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

PatchWorld: Gradient-Free Optimization of Executable World Models

upvoted a paper 22 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

upvoted a paper 28 days ago

SciResearcher: Scaling Deep Research Agents for Frontier Scientific Reasoning

View all activity

Organizations

None yet

upvoted a paper 1 day ago

PatchWorld: Gradient-Free Optimization of Executable World Models

Paper • 2605.30880 • Published 30 days ago • 12

upvoted a paper 22 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published 24 days ago • 44

upvoted a paper 28 days ago

SciResearcher: Scaling Deep Research Agents for Frontier Scientific Reasoning

Paper • 2605.01489 • Published May 26 • 1

upvoted 2 papers about 1 month ago

MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models

Paper • 2605.14906 • Published May 14 • 79

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 89

upvoted a paper about 2 months ago

Evaluation-driven Scaling for Scientific Discovery

Paper • 2604.19341 • Published Apr 21 • 3

upvoted 2 papers 5 months ago

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Paper • 2601.15808 • Published Jan 22 • 20

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Paper • 2601.11004 • Published Jan 16 • 31

upvoted a collection 8 months ago

AutoGraph-R1

Collection

Directly Optimizing Knowledge Graph Construction for RAG using Reinforcement Learning • 11 items • Updated Oct 24, 2025 • 2

upvoted a paper 9 months ago

NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents

Paper • 2510.07172 • Published Oct 8, 2025 • 28

upvoted a paper 11 months ago

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1, 2025 • 96

upvoted a paper about 1 year ago

From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery

Paper • 2505.13259 • Published May 19, 2025 • 1

Tianshi ZHENG PRO

AI & ML interests

Recent Activity

Organizations

StoneTZHENG's activity