Generate text transcript from audio with speaker diarization
Ask questions about your PDF using a local language model wi