Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
HKUSTAudio
/
Audio-Omni
like
41
Follow
HKUST Audio
461
Any-to-Any
text-to-audio
text-to-speech
audio-editing
music
speech
diffusion
multimodal
audio-generation
arxiv:
2604.10708
License:
cc-by-nc-4.0
Model card
Files
Files and versions
xet
Community
1
main
Audio-Omni
23.2 GB
Ctrl+K
Ctrl+K
2 contributors
History:
17 commits
Zeyue7
Update README.md
d613c83
verified
27 days ago
.gitattributes
Safe
1.52 kB
initial commit
about 2 months ago
Audio-Omni.json
Safe
4 kB
Update Audio-Omni.json
about 1 month ago
README.md
3.11 kB
Update README.md
27 days ago
model.ckpt
Safe
pickle
Detected Pickle imports (4)
"torch.HalfStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
22.2 GB
xet
Restore model.ckpt
about 1 month ago
synchformer_state_dict.pth
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
950 MB
xet
Upload synchformer_state_dict.pth with huggingface_hub
about 2 months ago