Text-To-Speech myshell-ai/OpenVoice Text-to-Speech • Updated Dec 24, 2024 • 487 coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 5.8M • 3.38k suno/bark Text-to-Speech • Updated Oct 4, 2023 • 16k • 1.51k microsoft/speecht5_tts Text-to-Speech • Updated Nov 8, 2023 • 85.6k • 823
image-generation h94/IP-Adapter-FaceID Text-to-Image • Updated Apr 16, 2024 • 230k • 1.82k latent-consistency/lcm-lora-sdv1-5 Text-to-Image • Updated Nov 16, 2023 • 67.8k • 518 stabilityai/stable-diffusion-xl-refiner-1.0 Image-to-Image • Updated Sep 25, 2023 • 325k • 2.02k stabilityai/stable-video-diffusion-img2vid Image-to-Video • Updated Jul 10, 2024 • 49k • 1.01k
image-to-image timbrooks/instruct-pix2pix Image-to-Image • Updated Jul 5, 2023 • 61.5k • 1.17k lllyasviel/sd-controlnet-openpose Image-to-Image • Updated Apr 24, 2023 • 2.1k • 153 destitech/controlnet-inpaint-dreamer-sdxl Image-to-Image • Updated Apr 23, 2024 • 4.73k • 120 lllyasviel/sd-controlnet-canny Image-to-Image • Updated May 1, 2023 • 16.6k • 238
Audio-To-Audio ResembleAI/resemble-enhance Audio-to-Audio • Updated Dec 21, 2023 • 175 JorisCos/DPTNet_Libri1Mix_enhsingle_16k Audio-to-Audio • Updated Sep 23, 2021 • 36 • 3 speechbrain/mtl-mimic-voicebank Audio-to-Audio • Updated Feb 19, 2024 • 270 • 35 speechbrain/metricgan-plus-voicebank Audio-to-Audio • Updated Feb 28, 2024 • 5.24k • 69
Speech-To-Text jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 124k • 475 nvidia/parakeet-rnnt-1.1b Automatic Speech Recognition • Updated Nov 27, 2025 • 673 • 163 facebook/seamless-m4t-v2-large Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 91.6k • 947
jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 124k • 475
Image-To-Video stabilityai/stable-video-diffusion-img2vid-xt Image-to-Video • Updated Jul 10, 2024 • 124k • 3.23k cerspense/zeroscope_v2_576w Text-to-Video • Updated Jul 1, 2023 • 8.29k • 491 stabilityai/stable-video-diffusion-img2vid Image-to-Video • Updated Jul 10, 2024 • 49k • 1.01k Vchitect/SEINE Image-to-Video • Updated Jul 29, 2025 • 15
Image-To-3D FrozenBurning/SceneDreamer Image-to-3D • Updated Nov 30, 2023 • 44 zxhezexin/openlrm-small-obj-1.0 Image-to-3D • Updated Jan 9, 2024 • 2 • 6 Rompo/Rompov Image-to-3D • Updated Dec 8, 2023 • 2 zxhezexin/openlrm-large-obj-1.0 Image-to-3D • Updated Jan 9, 2024 • 15 • 5
Motion-Estimation walterzhu/MotionBERT Updated Apr 8, 2023 • 71 aliprf/ASMNet Updated Aug 4, 2022 • 2 HoyerChou/MultiAugs Updated Feb 20, 2024 lllyasviel/sd-controlnet-openpose Image-to-Image • Updated Apr 24, 2023 • 2.1k • 153
stock-market-prediction foduucom/stockmarket-future-prediction Object Detection • Updated Oct 7, 2023 • 162 • 163 foduucom/stockmarket-pattern-detection-yolov8 Object Detection • Updated Apr 2, 2025 • 8.19k • 393 NabeelShar/tes Object Detection • Updated Oct 30, 2023 • 4 • 2
Text-To-Speech myshell-ai/OpenVoice Text-to-Speech • Updated Dec 24, 2024 • 487 coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 5.8M • 3.38k suno/bark Text-to-Speech • Updated Oct 4, 2023 • 16k • 1.51k microsoft/speecht5_tts Text-to-Speech • Updated Nov 8, 2023 • 85.6k • 823
Speech-To-Text jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 124k • 475 nvidia/parakeet-rnnt-1.1b Automatic Speech Recognition • Updated Nov 27, 2025 • 673 • 163 facebook/seamless-m4t-v2-large Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 91.6k • 947
jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 124k • 475
image-generation h94/IP-Adapter-FaceID Text-to-Image • Updated Apr 16, 2024 • 230k • 1.82k latent-consistency/lcm-lora-sdv1-5 Text-to-Image • Updated Nov 16, 2023 • 67.8k • 518 stabilityai/stable-diffusion-xl-refiner-1.0 Image-to-Image • Updated Sep 25, 2023 • 325k • 2.02k stabilityai/stable-video-diffusion-img2vid Image-to-Video • Updated Jul 10, 2024 • 49k • 1.01k
Image-To-Video stabilityai/stable-video-diffusion-img2vid-xt Image-to-Video • Updated Jul 10, 2024 • 124k • 3.23k cerspense/zeroscope_v2_576w Text-to-Video • Updated Jul 1, 2023 • 8.29k • 491 stabilityai/stable-video-diffusion-img2vid Image-to-Video • Updated Jul 10, 2024 • 49k • 1.01k Vchitect/SEINE Image-to-Video • Updated Jul 29, 2025 • 15
image-to-image timbrooks/instruct-pix2pix Image-to-Image • Updated Jul 5, 2023 • 61.5k • 1.17k lllyasviel/sd-controlnet-openpose Image-to-Image • Updated Apr 24, 2023 • 2.1k • 153 destitech/controlnet-inpaint-dreamer-sdxl Image-to-Image • Updated Apr 23, 2024 • 4.73k • 120 lllyasviel/sd-controlnet-canny Image-to-Image • Updated May 1, 2023 • 16.6k • 238
Image-To-3D FrozenBurning/SceneDreamer Image-to-3D • Updated Nov 30, 2023 • 44 zxhezexin/openlrm-small-obj-1.0 Image-to-3D • Updated Jan 9, 2024 • 2 • 6 Rompo/Rompov Image-to-3D • Updated Dec 8, 2023 • 2 zxhezexin/openlrm-large-obj-1.0 Image-to-3D • Updated Jan 9, 2024 • 15 • 5
Audio-To-Audio ResembleAI/resemble-enhance Audio-to-Audio • Updated Dec 21, 2023 • 175 JorisCos/DPTNet_Libri1Mix_enhsingle_16k Audio-to-Audio • Updated Sep 23, 2021 • 36 • 3 speechbrain/mtl-mimic-voicebank Audio-to-Audio • Updated Feb 19, 2024 • 270 • 35 speechbrain/metricgan-plus-voicebank Audio-to-Audio • Updated Feb 28, 2024 • 5.24k • 69
Motion-Estimation walterzhu/MotionBERT Updated Apr 8, 2023 • 71 aliprf/ASMNet Updated Aug 4, 2022 • 2 HoyerChou/MultiAugs Updated Feb 20, 2024 lllyasviel/sd-controlnet-openpose Image-to-Image • Updated Apr 24, 2023 • 2.1k • 153
stock-market-prediction foduucom/stockmarket-future-prediction Object Detection • Updated Oct 7, 2023 • 162 • 163 foduucom/stockmarket-pattern-detection-yolov8 Object Detection • Updated Apr 2, 2025 • 8.19k • 393 NabeelShar/tes Object Detection • Updated Oct 30, 2023 • 4 • 2