mozilla-foundation/common_voice_17_0
Updated • 5.54k • 16
The F5-TTS model is finetuned for Russian and English language
This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution
Base Model: SWivid/F5-TTS
Training Duration: 813k steps
Dataset Duration: 100k hours
{
"exp_name": "F5TTS_Base",
"learning_rate": 1e-05,
"batch_size_per_gpu": 5000,
"batch_size_type": "frame",
"max_samples": 64,
"grad_accumulation_steps": 1,
"max_grad_norm": 1,
"epochs": 1,
"num_warmup_updates": 405764,
"save_per_updates": 811528,
"keep_last_n_checkpoints": 5,
"last_per_updates": 10000,
"finetune": true,
"file_checkpoint_train": "",
"tokenizer_type": "char",
"tokenizer_file": "",
"mixed_precision": "fp16",
"logger": "wandb",
"bnb_optimizer": true
}
Go to base repo
Base model
SWivid/F5-TTS