prithivMLmods/proxima-ocr-d.markdown-post3.0.l
Image-Text-to-Text β’ 9B β’ Updated β’ 61 β’ 5
Generate expressive speech audio from text with custom voice
Long-form Speech Synthesis with Zonos
Long-Form Speech Synthesis with Zonos and DeepFilterNet
Generate audio from text with customizable emotions
mcp_server
Separate vocals and instrumentals from any music track
Translate text from English to Russian or Chinese
Generate virtual tryβon image of a person wearing a garment
Generate music from text descriptions and optional melodies
Track your online presence with reverse face search
Swap faces in images or videos
Find image sources by uploading an image