*MacOs and Web App in progress
Welcome to GPTranscribe(for now)! This python application(for now) allows for transcription of system audio(ie a meeting or youtube video), extraction of important terms and GPT driven definitions for the terms you used to google during a meeting or video. Think of this as the equivelant of left clicking text and searching for a definition but for system audio!
GPTranscribe requires 3 main dependencies
- python 3.7-3.9
- An openAI API key
- blackhole 2 Channel audio driver
- ffmpeg
Assuming the user has 1, 2 and a cloned repo follow these steps for setup:
- Install audio driver dependencies
- blackhole2ch
brew install blackhole2-ch
- PortAudio
brew install portaudio
- blackhole2ch
- Create a multi-audio device for blackhole + your current/desired output. Check here for instructions.
- Create a
.env
file withOPENAI_KEY=your_api_key
- Install python requirements
pip install -r requirements.txt
- Ensure you're using the multi-audio device from (2) and run
python smart_transcribe.py
- *optional: For a basic user interface run
python server.py
alongside