1B
•
Updated
•
160
distributed/llama-1b-ws-2
distributed/llama-1b-ws-8
distributed/llama-1b-run-7
1B
•
Updated
•
5
distributed/optimized-gpt2-1b
Text Generation
•
1B
•
Updated
•
39
distributed/gpt2-1b-bs2048-nodt-1_1
1B
•
Updated
•
25
distributed/optimized-gpt2-500m
Text Generation
•
0.5B
•
Updated
•
68
distributed/optimized-gpt2-1b-stable-embeddings
Text Generation
•
1B
•
Updated
•
4
distributed/optimized-gpt2-2b-vtestnet-v1
Text Generation
•
2B
•
Updated
•
4
distributed/optimized-gpt2-2b-without-stable-embeddings
Text Generation
•
2B
•
Updated
•
4
distributed/optimized-gpt2-1b-vtestnet-v2
Text Generation
•
1B
•
Updated
•
8
distributed/optimized-gpt2-1b-vtestnet-v3
distributed/optimized-gpt2-250m-v0.1.2
Text Generation
•
0.3B
•
Updated
•
863
distributed/optimized-gpt2-250m-convergence-test-v1
Text Generation
•
0.3B
•
Updated
•
9
distributed/optimized-gpt2-250m-convergence-test-v2
Text Generation
•
0.3B
•
Updated
•
4
•
1
distributed/gpt2-250m-convergence-test
Text Generation
•
94.5M
•
Updated
•
8
distributed/gpt2-250m-convergence-test-v2
Text Generation
•
94.5M
•
Updated
•
40
distributed/gpt2-124m-convergence-test
Feature Extraction
•
0.1B
•
Updated
•
5