A telegram bot which will help you to connect your devices and automate its actions seamlessly between devices 🔥
This telegram bot agent can:
- Interact with nearly all apps and browsers.
- Integrated with GPT-4o, Gemini Pro Vision, Claude 3 and LLaVa.
- Future plan: Send response back to the commander.
- Future plan: Connect multiple mobile phones and tablets remotely at once.
You should have
- Python 3.11 or higher
- API key of any model from GPT-4o, Gemini Pro Vision, Claude 3 and LLaVa.
Navigate to the Project Directory: Change to the directory where the project files are located. For example:
cd path/to/project/directory
-
Change the directory.
cd host
-
Create a virtual environment.
python3 virtualenv venv
-
Activate the virtual environment.
source ./venv/bin/activate
-
Install dependencies from
requirements.txt
pip install -r requirements.txt
-
Configure environment variables
- Copy
.env.example
to.env
. - Fill in the
.env
file with the necessary environment variables.
- Copy
-
Start the server
python3 main.py
├── .env.local.example
├── .eslintrc.json
├── .gitignore
├── README.md
├── bun.lockb
├── components.json
├── host
│ ├── .env.example
│ ├── .gitignore
│ ├── agent.py
│ ├── listner.py
│ ├── main.py
│ ├── operate
│ │ ├── __init__.py
│ │ ├── config.py
│ │ ├── exceptions.py
│ │ ├── main.py
│ │ ├── models
│ │ │ ├── __init__.py
│ │ │ ├── apis.py
│ │ │ ├── prompts.py
│ │ │ └── weights
│ │ │ ├── __init__.py
│ │ │ └── best.pt
│ │ ├── operate.py
│ │ └── utils
│ │ ├── __init__.py
│ │ ├── label.py
│ │ ├── misc.py
│ │ ├── ocr.py
│ │ ├── operating_system.py
│ │ ├── screenshot.py
│ │ └── style.py
│ ├── requirements.txt
│ ├── screenshots
│ ├── test.py
│ └── temp.py
├── next-env.d.ts
├── next.config.ts
├── package.json
├── postcss.config.mjs
├── public
│ ├── file.svg
│ ├── globe.svg
│ ├── next.svg
│ ├── vercel.svg
│ └── window.svg
├── src
│ ├── app
│ │ ├── api
│ │ │ ├── prompt
│ │ │ │ └── route.ts
│ │ │ └── verify
│ │ │ └── route.ts
│ │ ├── favicon.ico
│ │ ├── fonts
│ │ │ ├── GeistMonoVF.woff
│ │ │ └── GeistVF.woff
│ │ ├── globals.css
│ │ ├── layout.tsx
│ │ └── page.tsx
│ ├── components
│ │ ├── blocks
│ │ │ ├── expandable-card-demo-grid.jsx
│ │ │ └── expandable-card-demo-standard.jsx
│ │ ├── common
│ │ │ ├── DotBackground.tsx
│ │ │ ├── InputWithButton.tsx
│ │ │ ├── Loader.tsx
│ │ │ └── PromptSuggestionButtton.tsx
│ │ ├── theme-provider.tsx
│ │ └── ui
│ │ ├── alert.tsx
│ │ ├── animated-modal.tsx
│ │ ├── badge.tsx
│ │ ├── button.tsx
│ │ ├── dialog.tsx
│ │ ├── flip-words.tsx
│ │ ├── input.tsx
│ │ ├── select.tsx
│ │ ├── toast.tsx
│ │ └── toaster.tsx
│ ├── context
│ │ ├── PrivyProvider.tsx
│ │ └── UserAuthContext.tsx
│ ├── hooks
│ │ ├── use-outside-click.js
│ │ ├── use-toast.js
│ │ └── use-toast.ts
│ ├── lib
│ │ ├── ably.ts
│ │ └── utils.ts
│ ├── providers
│ │ └── index.tsx
│ └── utils
│ ├── api.ts
│ └── setTokenCookies.ts
├── tailwind.config.ts
└── tsconfig.json
- Fork the repository.
- Create a new branch:
git checkout -b feature-name
. - Make your changes.
- Push your branch:
git push origin feature-name
. - Create a pull request.
This project couldn't be possible if CapxAI didn't be there!
Even I had many issues while making this project and this was my first time to make a telegram bot but there team helped me a lot to over come the issues, gave me suggestions and I am really thankful to it ❤️🩹!
This project is licensed under the MIT License.