A5-Browser-Use Chrome Extension and Server for Agentic AI Workflows

Your commands control the browser - made easy. Supports various large language model providers, including OpenAI, Gemini, and Ollama.

A5 is an open-source project that integrates the powerful Browser Use Python library (along with Gradio-AI support coming soon) with a user-friendly RESTful API and Chrome extension. It aims to simplify agentic AI-powered browser automation tasks by providing an all-in-one solution that requires minimal setup, making it accessible to both developers and non-developers alike.

Chrome Extension Demo:

Important (Experimental) Notice

This project is experimental. You can run it easily on macOS by using the executable generated in the Python_server/dist folder (e.g., ./a5browseruse on macOS). For other platforms like Linux and Windows, you can build or run the server similarly (see the Installation steps for more details).

WARNING: Important before proceeding

Make sure to start Chrome in debug mode or you will get irratic behavior (many browser windows opening and closing). Instructions are below on how to do that based on your operating system. Additionally, you will need to create a .env file in the Python_server folder with your OpenAI, Gemini or other provider credentials (example of the .env format is in Python_server/.env.example).

Quick Start (macOS)

Note: Mac Users should be able to run the executable (after making sure to start Chrome in debug mode and setting your .env with your relevant API key information, per the warning above) located in the Python_server/dist folder by navigating to the Python_server/dist folder and running ./a5browseruse (This executable supports OpenAI). The instructions below will get you started with Ollama and Gemini if you prefer these providers. Again, you will need to create a .env file in the Python_server/dist folder with your OpenAI or other provider credentials if you are taking this route. An example of the format is available in the .env.example file.

However, if this does not work or alternatively, you can follow these steps to set it up manually:

Close all Chrome windows completely.

IMPORTANT : Start Chrome with Remote Debugging Enabled (required by Browser Use):

/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222 --profile-directory="Default" --disable-features=BlockInsecurePrivateNetworkRequests

Make sure you have Python 3.11 or higher installed.
From the Python_server folder, install dependencies:
```
pip install -r requirements.txt
```

Close Chrome, and start Chrome with remote debugging:

/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222 --profile-directory="Default" --disable-features=BlockInsecurePrivateNetworkRequests

You will need to create a .env file in the Python_server/ folder with your OpenAI or other provider credentials. An example of the format is available in the .env.example file.
In a separate terminal window, still in Python_server, start the server: For OpenAI:
```
uvicorn main:app --host 127.0.0.1 --port 8888 --workers 1
```
For Gemini:
```
uvicorn mainGemini:app --host 127.0.0.1 --port 8888 --workers 1
```
For Ollama:
```
uvicorn mainOllama:app --host 127.0.0.1 --port 8888 --workers 1
```
Note: if you would like to use Ollama to save on costs (or just prefer Ollama), you will need to install Ollama on your Desktop and make sure it is running on your local machine. You should be able to call the ollama list command in your terminal and see a list of the models you have installed. You will need qwen2.5:32b-instruct-q4_K_M in order for your automoted browser-usage to work well. You can change which model is used by editing Python_server/mainOllama.py.
Once the server is running, open http://localhost:8888/lastResponses/ to verify it’s active.

Quick Start (Linux and Windows)

Close all Chrome windows completely.

Start Chrome with Remote Debugging Enabled:

Windows (in Command Prompt or PowerShell):

"C:\Program Files\Google\Chrome\Application\chrome.exe" --remote-debugging-port=9222 --profile-directory="Default" --disable-features=BlockInsecurePrivateNetworkRequests

Linux (in Terminal):

google-chrome --remote-debugging-port=9222 --profile-directory="Default" --disable-features=BlockInsecurePrivateNetworkRequests

You will need to create a .env file in the Python_server/ folder with your OpenAI or other provider credentials. An example of the format is available in the .env.example file.

Run the Python Server (similarly as macOS):

For OpenAI:

uvicorn main:app --host 127.0.0.1 --port 8888 --workers 1

For Gemini:

uvicorn mainGemini:app --host 127.0.0.1 --port 8888 --workers 1

For Ollama:

uvicorn mainOllama:app --host 127.0.0.1 --port 8888 --workers 1

Access the API at http://localhost:8888/lastResponses/.

If you need a standalone executable on Windows or Linux, you’ll have to build it on that platform (since PyInstaller doesn’t support cross-compiling). The resulting file will be in the dist folder for that OS.

Features

Seamless Integration: Combines a Chrome extension with a Python backend to execute browser commands effortlessly
AI-Powered Automation: Utilizes OpenAI's language models (and future expansions!) to interpret and perform complex browser tasks
Cross-Platform Support: Compatible with Windows, macOS, and Linux
Open-Source: Community-driven development to continuously enhance functionality and usability
RESTful API: Well-documented API endpoints for easy integration and extensibility

NEW

Context Storage: You can add context to be saved and used for all of your commands (across sessions). Click "Advanced Settings" under the command bar, and save information that will be included with each initial command. This can include methods for overcoming obstacles you have observed or to prevent having to repeat the same information everytime. You can edit this at any time or clear the information.

Prerequisites

Before you begin, ensure you have met the following requirements:

Python 3.11 or higher installed on your machine (Download Python)
Google Chrome browser installed (Download Chrome)
Git installed for cloning the repository (Download Git)

Installation

Follow these steps to set up A5-Browser-Use on your local machine.

Clone the Repository

git clone https://github.com/AgenticA5/A5-Browser-Use.git
cd A5-Browser-Use

Set Up the Python Server

Navigate to the Python_server folder:
```
cd Python_server
```
Install the dependencies:
```
pip install -r requirements.txt
```

Close all instances of Chrome, then start it with remote debugging:

macOS:

/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222 --profile-directory="Default" --disable-features=BlockInsecurePrivateNetworkRequests

Windows:

"C:\Program Files\Google\Chrome\Application\chrome.exe" --remote-debugging-port=9222 --profile-directory="Default" --disable-features=BlockInsecurePrivateNetworkRequests

Linux:

google-chrome --remote-debugging-port=9222 --profile-directory="Default" --disable-features=BlockInsecurePrivateNetworkRequests

You will need to create a .env file in the Python_server/ folder with your OpenAI or other provider credentials. An example of the format is available in the .env.example file.

In a new terminal window (while Chrome is running in remote debugging mode), start the FastAPI server:

For OpenAI:

uvicorn main:app --host 127.0.0.1 --port 8888 --workers 1

For Gemini:

uvicorn mainGemini:app --host 127.0.0.1 --port 8888 --workers 1

For Ollama:

uvicorn mainOllama:app --host 127.0.0.1 --port 8888 --workers 1

Go to http://localhost:8888/lastResponses in your browser to confirm the server is running.

Install the Chrome Extension

Open Google Chrome and go to Settings → Extensions.
Enable Developer Mode in the top right corner.
Click Load unpacked or Load Unpacked Extension.
Select the Chrome_extension folder from this repository.

Once installed, the extension adds a small arrow on the left side of your browser window. Click it to expand and issue commands to the Python server (powered by the AI backend).

Usage

After the setup:

Open Chrome (in remote debugging mode).
Start your Python server via uvicorn or the executable from Python_server/dist (if you built one for your OS).
Click the Arrow in Chrome (added by the extension) to expand the control panel.
Issue Commands: Type your instruction, and the agent will attempt to perform the requested browser actions.

Contributing

We welcome contributions from the community! In particular, if you’d like to add support for additional AI providers beyond OpenAI, Gemini and Ollama, please open a Pull Request or open an Issue.

Fork the Project
Create a new branch (git checkout -b feature/YourFeature)
Commit your changes (git commit -m 'Add some feature')
Push to your branch (git push origin feature/YourFeature)
Open a Pull Request

License

This project is licensed under the MIT License. Feel free to modify and distribute it as per the license terms.

Final Notes

Close Chrome Completely before you re-run the server in remote debugging mode. Otherwise, the agent can’t connect properly.
For Windows and Linux executables, you must build them on their respective OS. PyInstaller won’t cross-compile from macOS.
The extension simply adds an arrow on the left of your browser that, when clicked, opens a sidebar to issue AI commands.
Future Plans: We plan to expand beyond OpenAI for multiple LLM backends—stay tuned, or submit a PR!

If you have any questions or run into issues, feel free to open an issue in the repo or reach out to the maintainers. Enjoy A5-Browser-Use!

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
Chrome_extension		Chrome_extension
Python_server		Python_server
attached_assets		attached_assets
.gitignore		.gitignore
.replit		.replit
README.md		README.md
icon.png		icon.png
pyproject.toml		pyproject.toml
replit.nix		replit.nix
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A5-Browser-Use Chrome Extension and Server for Agentic AI Workflows

Chrome Extension Demo:

Important (Experimental) Notice

WARNING: Important before proceeding

Quick Start (macOS)

Quick Start (Linux and Windows)

Table of Contents

Features

Prerequisites

Installation

Clone the Repository

Set Up the Python Server

Install the Chrome Extension

Usage

Contributing

License

Final Notes

About

Releases 5

Packages

Contributors 4

Languages

AgenticA5/A5-Browser-Use

Folders and files

Latest commit

History

Repository files navigation

A5-Browser-Use Chrome Extension and Server for Agentic AI Workflows

Chrome Extension Demo:

Important (Experimental) Notice

WARNING: Important before proceeding

Quick Start (macOS)

Quick Start (Linux and Windows)

Table of Contents

Features

Prerequisites

Installation

Clone the Repository

Set Up the Python Server

Install the Chrome Extension

Usage

Contributing

License

Final Notes

About

Topics

Resources

Stars

Watchers

Forks

Releases 5

Packages 0

Contributors 4

Languages

Packages