unsloth/gemma-4-E2B-it-GGUF
Image-Text-to-Text • 5B • Updated • 714k • 249
Dub videos in another language with cloned voice
Generate a talking face video from an image and audio
Compare speech-to-text models by WER and speed
Generate and listen to creative stories
Generate multilingual talking-face videos from your text
Generate realistic speech and sounds from typed text
Generate animated face images using a driving video