1B • Updated distributed/llama-1b-ws-2
Updated
distributed/llama-1b-ws-8
Updated
distributed/llama-1b-run-7
1B • Updated distributed/optimized-gpt2-1b
Text Generation
• 1B • Updated • 11
distributed/gpt2-1b-bs2048-nodt-1_1
1B • Updated • 1
distributed/optimized-gpt2-500m
Text Generation
• 0.5B • Updated • 6
distributed/optimized-gpt2-1b-stable-embeddings
Text Generation
• 1B • Updated • 4
distributed/optimized-gpt2-2b-vtestnet-v1
Text Generation
• 2B • Updated • 6
distributed/optimized-gpt2-2b-without-stable-embeddings
Text Generation
• 2B • Updated • 2
distributed/optimized-gpt2-1b-vtestnet-v2
Text Generation
• 1B • Updated • 6
distributed/optimized-gpt2-1b-vtestnet-v3
Updated
distributed/optimized-gpt2-250m-v0.1.2
Text Generation
• 0.3B • Updated • 49
distributed/optimized-gpt2-250m-convergence-test-v1
Text Generation
• 0.3B • Updated • 2
distributed/optimized-gpt2-250m-convergence-test-v2
Text Generation
• 0.3B • Updated • 3
• 1
distributed/gpt2-250m-convergence-test
Text Generation
• 94.5M • Updated • 4
distributed/gpt2-250m-convergence-test-v2
Text Generation
• 94.5M • Updated • 13
distributed/gpt2-124m-convergence-test
Feature Extraction
• 0.1B • Updated • 1