Transcribe and summarize YouTube videos or audio files
Generate vocal covers from audio or text input