Skip to content

Open-source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

License

Notifications You must be signed in to change notification settings

Ramlinbird/nanobrowser

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

banner

GitHub Twitter Discord

🌐 Nanobrowser

Nanobrowser is an open-source AI web automation tool that runs in your browser. A free alternative to OpenAI Operator with flexible LLM options and multi-agent system.

⬇️ Get Nanobrowser from Chrome Web Store for free

👏 Join the community in Discord | X

❤️ Loving Nanobrowser? Give us a star 🌟 and help spread the word!

Nanobrowser Demo GIF

Nanobrowser's multi-agent system analyzing HuggingFace in real-time, with the Planner intelligently self-correcting when encountering obstacles and dynamically instructing the Navigator to adjust its approach—all running locally in your browser.

🔥Why Nanobrowser?

Looking for a powerful AI web agent without the $200/month price tag of OpenAI Operator? Nanobrowser , as a chrome extension, delivers premium web automation capabilities while keeping you in complete control:

  • 100% Free - No subscription fees or hidden costs. Just install and use your own API keys, and you only pay what you use with your own API keys.
  • Privacy-Focused - Everything runs in your local browser. Your credentials stay with you, never shared with any cloud service.
  • Flexible LLM Options - Connect to your preferred LLM providers with the freedom to choose different models for different agents.
  • Fully Open Source - Complete transparency in how your browser is automated. No black boxes or hidden processes.

Note: We currently support OpenAI, Anthropic, Gemini, Ollama and custom OpenAI-Compatible providers, more providers will be supported.

📊 Key Features

  • Multi-agent System: Specialized AI agents collaborate to accomplish complex web workflows
  • Interactive Side Panel: Intuitive chat interface with real-time status updates
  • Task Automation: Seamlessly automate repetitive web automation tasks across websites
  • Follow-up Questions: Ask contextual follow-up questions about completed tasks
  • Conversation History: Easily access and manage your AI agent interaction history
  • Multiple LLM Support: Connect your preferred LLM providers and assign different models to different agents

🚀 Quick Start

  1. Install from Chrome Web Store (Stable Version):

Important Note: For latest features, install from "Manually Install Latest Version" below, as Chrome Web Store version may be delayed due to review process.

  1. Configure Agent Models:
    • Click the Nanobrowser icon in your toolbar to open the sidebar
    • Click the Settings icon (top right)
    • Add your LLM API keys
    • Choose which model to use for different agents (Navigator, Planner, Validator)

🔧 Manually Install Latest Version

To get the most recent version with all the latest features:

  1. Download

    • Download the latest nanobrowser.zip file from the official Github release page.
  2. Install:

    • Unzip nanobrowser.zip.
    • Open chrome://extensions/ in Chrome
    • Enable Developer mode (top right)
    • Click Load unpacked (top left)
    • Select the unzipped nanobrowser folder.
  3. Configure Agent Models

    • Click the Nanobrowser icon in your toolbar to open the sidebar
    • Click the Settings icon (top right).
    • Add your LLM API keys.
    • Choose which model to use for different agents (Navigator, Planner, Validator)
  4. Upgrading:

    • Download the latest nanobrowser.zip file from the release page.
    • Unzip and replace your existing Nanobrowser files with the new ones.
    • Go to chrome://extensions/ in Chrome and click the refresh icon on the Nanobrowser card.

🛠️ Build from Source

If you prefer to build Nanobrowser yourself, follow these steps:

  1. Prerequisites:

    • Node.js (v22.12.0 or higher)
    • pnpm (v9.15.1 or higher)
  2. Clone the Repository:

    git clone https://github.com/nanobrowser/nanobrowser.git
    cd nanobrowser
  3. Install Dependencies:

    pnpm install
  4. Build the Extension:

    pnpm build
  5. Load the Extension:

    • The built extension will be in the dist directory
    • Follow the installation steps from the Manually Install section to load the extension into your browser
  6. Development Mode (optional):

    pnpm dev

🤖 Choosing Your Models

Nanobrowser allows you to configure different LLM models for each agent to balance performance and cost. Here are recommended configurations:

Better Performance

  • Planner & Validator: Claude 3.7 Sonnet
    • Better reasoning and planning capabilities
    • More reliable task validation
  • Navigator: Claude 3.5 Haiku
    • Efficient for web navigation tasks
    • Good balance of performance and cost

Cost-Effective Configuration

  • Planner & Validator: Claude Haiku or GPT-4o
    • Reasonable performance at lower cost
    • May require more iterations for complex tasks
  • Navigator: Gemini 2.0 Flash or GPT-4o-mini
    • Lightweight and cost-efficient
    • Suitable for basic navigation tasks

Local Models

  • Setup Options:

    • Use Ollama or other custom OpenAI-compatible providers to run models locally
    • Zero API costs and complete privacy with no data leaving your machine
  • Recommended Models:

    • Qwen 2.5 Coder 14B
    • Mistral Small 24B
    • We welcome community experience sharing with other local models in our Discord
  • Prompt Engineering:

    • Local models require more specific and cleaner prompts
    • Avoid high-level, ambiguous commands
    • Break complex tasks into clear, detailed steps
    • Provide explicit context and constraints

Note: The cost-effective configuration may produce less stable outputs and require more iterations for complex tasks.

Tip: Feel free to experiment with your own model configurations! Found a great combination? Share it with the community in our Discord to help others optimize their setup.

💡 See It In Action

Here are some powerful tasks you can accomplish with just a sentence:

  1. News Summary:

    "Go to TechCrunch and extract top 10 headlines from the last 24 hours"

  2. GitHub Research:

    "Look for the trending Python repositories on GitHub with most stars"

  3. Shopping Research:

    "Find a portable Bluetooth speaker on Amazon with a water-resistant design, under $50. It should have a minimum battery life of 10 hours"

🛠️ Roadmap

We're actively developing Nanobrowser with exciting features on the horizon, welcome to join us!

  • Expand LLM Support: Add integration with more large language model providers for AI agents
  • Security Firewall: Implement protective measures to ensure safe browsing operations
  • Memory Optimization: Reduce token usage while maintaining context awareness
  • Session Replay: Efficiently replay historical tasks with minimal token consumption
  • Specialized Agents: Develop purpose-built agents for complex domain-specific tasks

🤝 Contributing

We need your help to make Nanobrowser even better! Contributions of all kinds are welcome:

  • Share Prompts & Use Cases
    • Join our Discord server.
    • share how you're using Nanobrowser. Help us build a library of useful prompts and real-world use cases.
  • Provide Feedback
    • Try Nanobrowser and give us feedback on its performance or suggest improvements in our Discord server.
  • Contribute Code
    • Check out our CONTRIBUTING.md for guidelines on how to contribute code to the project.
    • Submit pull requests for bug fixes, features, or documentation improvements.

We believe in the power of open source and community collaboration. Join us in building the future of web automation!

💬 Community

Join our growing community of developers and users:

👏 Acknowledgments

Nanobrowser builds on top of other awesome open-source projects:

Huge thanks to their creators and contributors!

📄 License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Made with ❤️ by the Nanobrowser Team.

Like Nanobrowser? Give us a star 🌟 and join us in Discord | X

About

Open-source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TypeScript 91.5%
  • JavaScript 7.3%
  • Other 1.2%