Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
batmanLovesAI
/
HeliumLM
like
0
Text Generation
PyTorch
roneneldan/TinyStories
English
slm
transformer
attention
optimization
tinystories
educational
arxiv:
2305.07759
arxiv:
2505.19529
License:
mit
Model card
Files
Files and versions
xet
Community
main
HeliumLM
/
checkpoints
1.1 GB
1 contributor
History:
34 commits
batmanLovesAI
Upload checkpoints/heliumlm-primer-iter-10000.pt with huggingface_hub
5ee2798
verified
about 3 hours ago
helium-distill-1-08-model-iter-14000.pt
pickle
Detected Pickle imports (4)
"torch.ComplexFloatStorage"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
106 MB
xet
Removed uneccessary models and renamed models for better understanding
7 days ago
helium-distill-1-08-model-iter-8000.pt
pickle
Detected Pickle imports (4)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch.ComplexFloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
106 MB
xet
Removed uneccessary models and renamed models for better understanding
7 days ago
helium-distill-5-05-model-iter-8000.pt
pickle
Detected Pickle imports (4)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch.ComplexFloatStorage"
What is a pickle import?
106 MB
xet
Removed uneccessary models and renamed models for better understanding
7 days ago
helium-nano-distill-5-05-model-iter-4000.pt
pickle
Detected Pickle imports (4)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch.ComplexFloatStorage"
What is a pickle import?
250 MB
xet
Removed uneccessary models and renamed models for better understanding
7 days ago
heliumLM-distilled-final-phase-1.pt
pickle
Detected Pickle imports (4)
"torch.FloatStorage"
,
"torch.ComplexFloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
106 MB
xet
Added first model of the final phase
3 days ago
heliumlm-final-phase2-model-iter-2000.pt
pickle
Detected Pickle imports (4)
"torch.ComplexFloatStorage"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
106 MB
xet
Upload checkpoints/heliumlm-final-phase2-model-iter-2000.pt with huggingface_hub
3 days ago
heliumlm-grammar-model.pt
pickle
Detected Pickle imports (4)
"torch.FloatStorage"
,
"torch.ComplexFloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
106 MB
xet
Deleted irrelevant models and added grammatically correct model trained in phases on entire tinystories dataset (using quartely batch technique)
4 days ago
heliumlm-primer-iter-10000.pt
pickle
Detected Pickle imports (4)
"collections.OrderedDict"
,
"torch.ComplexFloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
107 MB
xet
Upload checkpoints/heliumlm-primer-iter-10000.pt with huggingface_hub
about 3 hours ago
heliumlm-primer-iter-5000.pt
pickle
Detected Pickle imports (4)
"collections.OrderedDict"
,
"torch.ComplexFloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
107 MB
xet
Upload checkpoints/heliumlm-primer-iter-5000.pt with huggingface_hub
about 3 hours ago