Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

codelion
/
gpt-2-70m

Text Generation
Safetensors
English
gpt2
dataset-mixing
pretraining
Eval Results
Model card Files Files and versions
xet
Community
gpt-2-70m
774 MB
  • 1 contributor
History: 9 commits
codelion's picture
codelion
Fix dataset composition percentages and token counts
ea63110 verified about 1 month ago
  • .gitattributes
    1.52 kB
    initial commit about 1 month ago
  • README.md
    5.18 kB
    Fix dataset composition percentages and token counts about 1 month ago
  • config.json
    750 Bytes
    Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) about 1 month ago
  • generation_config.json
    119 Bytes
    Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) about 1 month ago
  • merges.txt
    456 kB
    Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) about 1 month ago
  • model.safetensors
    256 MB
    xet
    Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) about 1 month ago
  • special_tokens_map.json
    99 Bytes
    Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) about 1 month ago
  • tokenizer.json
    3.56 MB
    Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) about 1 month ago
  • tokenizer_config.json
    475 Bytes
    Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) about 1 month ago
  • training_state.pt
    513 MB
    xet
    Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) about 1 month ago
  • vocab.json
    798 kB
    Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) about 1 month ago