codelion
/

gpt-2-70m

Text Generation

Eval Results (legacy)

Model card Files Files and versions

774 MB

Ctrl+K

Ctrl+K

1 contributor

History: 9 commits

codelion's picture

Fix dataset composition percentages and token counts

ea63110 verified 5 months ago

.gitattributes

1.52 kB
initial commit 5 months ago
README.md

5.18 kB
Fix dataset composition percentages and token counts 5 months ago
config.json

750 Bytes
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) 5 months ago
generation_config.json

119 Bytes
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) 5 months ago
merges.txt

456 kB
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) 5 months ago
model.safetensors

256 MB
xet

Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) 5 months ago
special_tokens_map.json

99 Bytes
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) 5 months ago
tokenizer.json

3.56 MB
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) 5 months ago
tokenizer_config.json

475 Bytes
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) 5 months ago
training_state.pt

513 MB
xet

Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) 5 months ago
vocab.json

798 kB
Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) 5 months ago