A desktop Electron application implementing the Gemini 2.0 Flash Multimodal Live API client. This project provides real-time interaction with Gemini's API through text, audio, video, and screen sharing capabilities.
This started as a simplified version of Google's original React implementation, created in response to this issue, and has evolved into a desktop application written in Electron.
- Real-time chat with Gemini 2.0 Flash Multimodal Live API
- Real-time audio responses from the model
- Real-time audio input from the user, allowing interruptions
- Real-time video streaming from the user's webcam
- Real-time screen sharing from the user's screen
- Function calling
- Native desktop application built with Electron
- Cross-platform support (Windows, macOS, Linux)
- Mobile-friendly web interface
- Node.js 18 or higher
- Yarn package manager
- Google AI Studio API key
-
Get your API key from Google AI Studio
-
Clone the repository
git clone https://github.com/ViaAnthroposBenevolentia/gemini-2-live-electron.git
-
Install dependencies:
cd gemini-2-live-electron yarn install
-
Start the application:
yarn start
-
For production build:
yarn build:win # or build:mac or build:linux
Contributions are welcome! Please feel free to submit issues and pull requests.
This project is licensed under the MIT License.