Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
BEEspoke Data
community
https://www.bees.org/
Activity Feed
Follow
62
AI & ML interests
'an LLM is only as good as the dataset it was trained on' - Sun Tzu
Recent Activity
pszemraj
Β
updated
a model
18 days ago
BEE-spoke-data/NVIDIA-Nemotron-Parse-v1.2
pszemraj
Β
published
a model
21 days ago
BEE-spoke-data/NVIDIA-Nemotron-Parse-v1.2
kenhktsui
Β
authored
a paper
5 months ago
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources
View all activity
Team members
9
BEE-spoke-data
's datasets
82
Sort:Β Recently updated
BEE-spoke-data/fineweb-cryptid-5k
Viewer
β’
Updated
Dec 29, 2025
β’
5k
β’
14
BEE-spoke-data/MoistWeb-25k
Viewer
β’
Updated
Dec 29, 2025
β’
25k
β’
11
β’
1
BEE-spoke-data/fineweb-synergy-20k
Viewer
β’
Updated
Dec 29, 2025
β’
20k
β’
20
BEE-spoke-data/FineMeme-100k
Viewer
β’
Updated
Dec 29, 2025
β’
100k
β’
37
BEE-spoke-data/beeweb-5k
Viewer
β’
Updated
Dec 29, 2025
β’
5k
β’
15
BEE-spoke-data/SaunaWeb-50k
Viewer
β’
Updated
Dec 29, 2025
β’
50k
β’
11
BEE-spoke-data/napierone-pdf-raw
Viewer
β’
Updated
Dec 29, 2025
β’
18.5k
β’
10
BEE-spoke-data/napierone-epub-raw
Viewer
β’
Updated
Dec 29, 2025
β’
13.8k
β’
47
BEE-spoke-data/UltraTextbooks-2.1-fw_mix
Viewer
β’
Updated
Dec 29, 2025
β’
7.27M
β’
26
β’
4
BEE-spoke-data/fineweb-1000_64k
Viewer
β’
Updated
Dec 29, 2025
β’
2k
β’
11
β’
4
BEE-spoke-data/fineweb-100_128k
Viewer
β’
Updated
Dec 29, 2025
β’
100
β’
6
β’
4
BEE-spoke-data/fineweb-1M_longish
Viewer
β’
Updated
Dec 29, 2025
β’
1M
β’
12
β’
4
BEE-spoke-data/fineweb-1M_en-med
Viewer
β’
Updated
Dec 29, 2025
β’
1M
β’
57
β’
2
BEE-spoke-data/fineweb-100k_en-med
Viewer
β’
Updated
Dec 29, 2025
β’
100k
β’
43
β’
4
BEE-spoke-data/allNLI-sbert
Viewer
β’
Updated
Dec 29, 2025
β’
1.96M
β’
9
β’
1
BEE-spoke-data/gutenberg-en-v1-clean
Viewer
β’
Updated
Dec 29, 2025
β’
33.3k
β’
132
β’
4
BEE-spoke-data/edgar-corpus
Viewer
β’
Updated
Dec 29, 2025
β’
517k
β’
9
BEE-spoke-data/financial-news-articles-filtered
Viewer
β’
Updated
Dec 29, 2025
β’
200k
β’
16
BEE-spoke-data/sp500-edgar-10k-markdown
Viewer
β’
Updated
Dec 29, 2025
β’
12.6k
β’
25
β’
5
BEE-spoke-data/consumer-finance-complaints
Viewer
β’
Updated
Dec 29, 2025
β’
6.4M
β’
84
β’
5
BEE-spoke-data/YIMA
Viewer
β’
Updated
Dec 29, 2025
β’
2.84k
β’
7
BEE-spoke-data/angle-UAE-pairs
Viewer
β’
Updated
Dec 29, 2025
β’
1.76M
β’
7
BEE-spoke-data/jinaai_negation-dataset-v2-hf
Viewer
β’
Updated
Dec 29, 2025
β’
51k
β’
6
BEE-spoke-data/AutoMathText-top-hf
Viewer
β’
Updated
Dec 29, 2025
β’
619k
β’
12
β’
2
BEE-spoke-data/Nvidia-DeepLearningExamples
Viewer
β’
Updated
Dec 29, 2025
β’
4.34k
β’
10
β’
2
BEE-spoke-data/sbert-paraphrase-data
Viewer
β’
Updated
Dec 29, 2025
β’
148M
β’
45
BEE-spoke-data/TACO-hf
Viewer
β’
Updated
Dec 29, 2025
β’
26.4k
β’
9
β’
1
BEE-spoke-data/stackoverflow-questions-long
Viewer
β’
Updated
Dec 29, 2025
β’
752k
β’
18
β’
1
BEE-spoke-data/yahoo_answers_topics-long-text
Viewer
β’
Updated
Dec 29, 2025
β’
3.49k
β’
14
β’
2
BEE-spoke-data/v3-mix-mixtral
Viewer
β’
Updated
Dec 29, 2025
β’
12.7M
β’
9
β’
1
Previous
1
2
3
Next