Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
codelion
/
gpt-2-70m
like
16
Text Generation
Safetensors
codelion/finepdfs-1B
codelion/dclm-baseline-1B
codelion/fineweb-edu-1B
English
gpt2
dataset-mixing
pretraining
Eval Results
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
gpt-2-70m
774 MB
1 contributor
History:
9 commits
codelion
Fix dataset composition percentages and token counts
ea63110
verified
about 1 month ago
.gitattributes
1.52 kB
initial commit
about 1 month ago
README.md
5.18 kB
Fix dataset composition percentages and token counts
about 1 month ago
config.json
750 Bytes
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens)
about 1 month ago
generation_config.json
119 Bytes
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens)
about 1 month ago
merges.txt
456 kB
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens)
about 1 month ago
model.safetensors
256 MB
xet
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens)
about 1 month ago
special_tokens_map.json
99 Bytes
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens)
about 1 month ago
tokenizer.json
3.56 MB
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens)
about 1 month ago
tokenizer_config.json
475 Bytes
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens)
about 1 month ago
training_state.pt
513 MB
xet
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens)
about 1 month ago
vocab.json
798 kB
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens)
about 1 month ago