ZhenYE's picture

ZhenYE

ZhenYe234

·

https://github.com/zhenye234

zhenye234

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

Atotti/Qwen3-Omni-AudioTransformer

upvoted a paper about 2 months ago

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

updated a model 4 months ago

ZhenYe234/test2

View all activity

Organizations

authored a paper 10 months ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published Feb 6 • 27

authored 4 papers about 1 year ago

CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

Paper • 2305.06908 • Published May 11, 2023 • 6

CoMoSVC: Consistency Model-based Singing Voice Conversion

Paper • 2401.01792 • Published Jan 3, 2024 • 11

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23, 2024 • 32

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Paper • 2408.17175 • Published Aug 30, 2024 • 6