University of Texas at Austin

university

Verified

https://www.utexas.edu

AI & ML interests

None defined yet.

Recent Activity

MarioBarbeque authored a paper 6 days ago

Mitigating Catastrophic Forgetting in Mathematical Reasoning Finetuning through Mixed Training

JonathanBLi authored a paper 9 days ago

Cautious Weight Decay

ChristinaW authored a paper 20 days ago

Mitigating Intra- and Inter-modal Forgetting in Continual Learning of Unified Multimodal Models

View all activity

hychiang

authored a paper 24 days ago

UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs

Paper • 2512.03383 • Published 26 days ago • 4

SP2001

authored 3 papers 2 months ago

Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms

Paper • 2510.13913 • Published Oct 15 • 3

EgoVLM: Policy Optimization for Egocentric Video Understanding

Paper • 2506.03097 • Published Jun 3

Hard2Verify: A Step-Level Verification Benchmark for Open-Ended Frontier Math

Paper • 2510.13744 • Published Oct 15 • 5

JianYu03

authored a paper 3 months ago

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

Paper • 2411.06469 • Published Nov 10, 2024 • 17

SP2001

authored a paper 4 months ago

SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents

Paper • 2509.06283 • Published Sep 8 • 17

cotran2

authored a paper 4 months ago

Arch-Router: Aligning LLM Routing with Human Preferences

Paper • 2506.16655 • Published Jun 19 • 17

evanking

authored 2 papers 4 months ago

Sasha: Creative Goal-Oriented Reasoning in Smart Homes with Large Language Models

Paper • 2305.09802 • Published May 16, 2023

Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices

Paper • 2509.02523 • Published Sep 2 • 7

abao

authored a paper 4 months ago

Concentration of Measure for Distributions Generated via Diffusion Models

Paper • 2501.07741 • Published Jan 13

fcyin

authored a paper 6 months ago

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

Paper • 2501.05414 • Published Jan 9 • 2

fcyin

authored a paper 7 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19 • 17

hychiang

authored 4 papers 9 months ago

Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models

Paper • 2503.22879 • Published Mar 28 • 9

Quamba: A Post-Training Quantization Recipe for Selective State Space Models

Paper • 2410.13229 • Published Oct 17, 2024 • 1

Efficient Low-rank Backpropagation for Vision Transformer Adaptation

Paper • 2309.15275 • Published Sep 26, 2023 • 1

MobileTL: On-device Transfer Learning with Inverted Residual Blocks

Paper • 2212.03246 • Published Dec 5, 2022 • 1

SP2001

authored 3 papers 10 months ago

CodeUpdateArena: Benchmarking Knowledge Editing on API Updates

Paper • 2407.06249 • Published Jul 8, 2024

SFR-RAG: Towards Contextually Faithful LLMs

Paper • 2409.09916 • Published Sep 16, 2024 • 1

FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"

Paper • 2410.03727 • Published Sep 30, 2024 • 2

singhanj13

authored a paper 10 months ago

Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation

Paper • 2310.03780 • Published Oct 5, 2023