Argonne National Laboratory

company

Verified

https://www.anl.gov/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

authored 4 papers 5 months ago

MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models

Paper • 2508.17467 • Published Aug 24, 2025

PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference

Paper • 2509.04377 • Published Sep 4, 2025 • 1

LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference

Paper • 2509.02753 • Published Sep 2, 2025

ImageNet-Think-250K: A Large-Scale Synthetic Dataset for Multimodal Reasoning for Vision Language Models

Paper • 2510.01582 • Published Oct 2, 2025

authored 5 papers 5 months ago

AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons

Paper • 2503.05731 • Published Feb 19, 2025 • 3

LM4HPC: Towards Effective Language Model Application in High-Performance Computing

Paper • 2306.14979 • Published Jun 26, 2023

AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions

Paper • 2509.13523 • Published Sep 16, 2025 • 7

MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models

Paper • 2508.17467 • Published Aug 24, 2025

PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference

Paper • 2509.04377 • Published Sep 4, 2025 • 1

authored a paper 5 months ago

Swift: An Autoregressive Consistency Model for Efficient Weather Forecasting

Paper • 2509.25631 • Published Sep 30, 2025 • 2

authored a paper 6 months ago

AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions

Paper • 2509.13523 • Published Sep 16, 2025 • 7

authored 2 papers about 1 year ago

DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

Paper • 2310.04610 • Published Oct 6, 2023 • 1

Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study

Paper • 2211.02092 • Published Nov 3, 2022

authored 2 papers about 1 year ago

LSHBloom: Memory-efficient, Extreme-scale Document Deduplication

Paper • 2411.04257 • Published Nov 6, 2024

Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets

Paper • 2403.15953 • Published Mar 23, 2024

authored a paper over 1 year ago

Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

Paper • 2411.15221 • Published Nov 20, 2024 • 30

authored a paper over 1 year ago

A Survey of Techniques for Optimizing Transformer Inference

Paper • 2307.07982 • Published Jul 16, 2023

Deema

authored a paper about 2 years ago

CIDAR: Culturally Relevant Instruction Dataset For Arabic

Paper • 2402.03177 • Published Feb 5, 2024 • 8

authored a paper about 2 years ago

A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators

Paper • 2310.04607 • Published Oct 6, 2023