Collection of LLM Evaluation Frameworks
Aditi Khare
AditiShashiKhare
AI & ML interests
Enterprise AI Product, Platform & Applied Research Leader building production-grade AI systems from 0→1 to global adoption at enterprise scale. Focused on translating cutting-edge AI into deployable systems across agentic AI, generative AI, and decision intelligence.
Recent Activity
updated a collection 2 days ago
LLM Evaluation Frameworks upvoted an article 2 days ago
Let's talk about LLM evaluation liked a model 4 days ago
deepseek-ai/DeepSeek-V4-ProOrganizations
None yet
Diagnostic & Evaluation Datasets (Curated)
Curated datasets designed to diagnose, evaluate, and reason about AI system behavior.
Foundational & Modern AI Research (Curated)
A curated selection of foundational and modern AI research papers that meaningfully influence how real-world AI systems are designed, evaluated, and g
-
Attention Is All You Need
Paper • 1706.03762 • Published • 122 -
Scaling Laws for Neural Language Models
Paper • 2001.08361 • Published • 10 -
Training Compute-Optimal Large Language Models
Paper • 2203.15556 • Published • 11 -
Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT
Paper • 2210.04186 • Published
Open-Source Foundations for Modern AI Systems
open-source libraries that form the infrastructure layer of modern AI systems, spanning model dev, retrieval, orchestration, evaluation, and MLOPS.
-
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale
Paper • 2309.06497 • Published • 7 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 629 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 251
Foundational & Applied AI Models
A curated set of influential AI models across research and production, including open and closed-source systems - Agentic AI & Gen AI
LLM Evaluation Frameworks
Collection of LLM Evaluation Frameworks
Open-Source Foundations for Modern AI Systems
open-source libraries that form the infrastructure layer of modern AI systems, spanning model dev, retrieval, orchestration, evaluation, and MLOPS.
-
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale
Paper • 2309.06497 • Published • 7 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 629 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 251
Diagnostic & Evaluation Datasets (Curated)
Curated datasets designed to diagnose, evaluate, and reason about AI system behavior.
Foundational & Applied AI Models
A curated set of influential AI models across research and production, including open and closed-source systems - Agentic AI & Gen AI
Foundational & Modern AI Research (Curated)
A curated selection of foundational and modern AI research papers that meaningfully influence how real-world AI systems are designed, evaluated, and g
-
Attention Is All You Need
Paper • 1706.03762 • Published • 122 -
Scaling Laws for Neural Language Models
Paper • 2001.08361 • Published • 10 -
Training Compute-Optimal Large Language Models
Paper • 2203.15556 • Published • 11 -
Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT
Paper • 2210.04186 • Published