5 5 12

Daniel Byrne

realdanielbyrne

realdanielbyrne

AI & ML interests

Deep Learning, LLMs, Time Series Analysis

Recent Activity

liked a Space about 1 month ago

open-llm-leaderboard/blog

liked a model 6 months ago

WeiboAI/VibeThinker-1.5B

liked a dataset 7 months ago

1-800-LLMs/physics

View all activity

Organizations

None yet

liked a Space about 1 month ago

Open-LLM performances are plateauing, let’s make the leaderboard steep again

🏔

127

Explore and compare advanced language models on a new leaderboard

liked a model 6 months ago

WeiboAI/VibeThinker-1.5B

Text Generation • 2B • Updated Nov 24, 2025 • 1.95k • 519

liked a dataset 7 months ago

1-800-LLMs/physics

Viewer • Updated Apr 19, 2025 • 20k • 11 • 2

replied to Kseniase's post 7 months ago

Thank you for this resource.,

liked a dataset over 1 year ago

Tiiny/QWQ-LONGCOT-500K

Viewer • Updated Dec 26, 2024 • 286k • 85 • 124

updated a model over 1 year ago

realdanielbyrne/opt-350m

0.3B • Updated Dec 12, 2024 • 3

liked a model over 1 year ago

Sao10K/I_am_alive_yay

Updated Nov 30, 2024 • 64

liked a dataset over 1 year ago

realdanielbyrne/AgathaChristieText

Viewer • Updated Dec 6, 2024 • 14.7k • 14 • 2

updated a dataset over 1 year ago

realdanielbyrne/AgathaChristieText

Viewer • Updated Dec 6, 2024 • 14.7k • 14 • 2

liked a dataset over 1 year ago

rombodawg/Everything_Instruct

Viewer • Updated Oct 8, 2024 • 4.05M • 49 • 54

upvoted a paper over 1 year ago

Selective Attention Improves Transformer

Paper • 2410.02703 • Published Oct 3, 2024 • 25

liked a model over 1 year ago

ValiantLabs/Llama3.1-8B-Enigma

Text Generation • Updated Mar 12, 2025 • 30 • 11

liked a dataset over 1 year ago

sequelbox/Tachibana

Viewer • Updated Sep 27, 2024 • 104k • 115 • 10

liked a model over 1 year ago

Undi95/Meta-Llama-3.1-8B-Claude

Text Generation • 8B • Updated Jul 31, 2024 • 30 • 57

upvoted 2 papers almost 2 years ago

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

Paper • 2407.14057 • Published Jul 19, 2024 • 46

Scaling Retrieval-Based Language Models with a Trillion-Token Datastore

Paper • 2407.12854 • Published Jul 9, 2024 • 31

liked a dataset almost 2 years ago

Weyaxi/sci-datasets

Viewer • Updated Sep 28, 2024 • 971k • 581 • 28

liked a model almost 2 years ago

TinyLlama/TinyLlama-1.1B-Chat-v1.0

Text Generation • 1B • Updated Mar 17, 2024 • 3.13M • 1.57k

New activity in meta-llama/Meta-Llama-3-8B almost 2 years ago

The model just repeats part of the input

#83 opened about 2 years ago by

summerstay

New activity in meta-llama/Meta-Llama-3-8B about 2 years ago

Any constraint on chat template applying insturction-finetuing?

❤️ 2

#7 opened about 2 years ago by

andreaKIM

Daniel Byrne

AI & ML interests

Recent Activity

Organizations

realdanielbyrne's activity

Open-LLM performances are plateauing, let’s make the leaderboard steep again

The model just repeats part of the input

Any constraint on chat template applying insturction-finetuing?