VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities

VISION: A Modular AI Assistant for Beamline Operations

VISION (Virtual Scientific Companion) is an AI-driven assistant designed to streamline operations at synchrotron beamlines and scientific facilities through natural language interaction. Built on a modular architecture, VISION is an assembly of Cognitive Blocks (Cogs)—specialized AI components tailored for tasks like transcription, classification, code generation, data analysis, and scientific querying. These cogs operate in predefined workflows, enabling seamless communication between users and complex instrumentation.

Key workflows include natural language-controlled (audio, text, or both) beamline operations, where commands are classified, converted to executable code, and deployed for data acquisition or analysis; custom function addition, where custom functions defined by the user in natural language are dynamically integrated into the system; and a domain-specific chatbot, capable of answering scientific queries with precision. This scalable, adaptable system minimizes deployment overhead, accelerates experimentation, and serves as a foundation for AI-augmented scientific discovery.

Paper

Please see our paper here - https://arxiv.org/abs/2412.18161

Directory Structure

Highlight of the most important files:

📁 VISION/
├── 📁 backend/
│   ├── 📁 src/
│   │   └── 📁 hal_beam_com/
│   │       └── cog_manager.py      (main entry-point for backend)
│   └── 📁 tests/                   (contains testing framework to generate evaluate LLMs on our datasets)
└── 📁 frontend/
    └── 📁 UI/
        └── 📁 program/
            └── executable.py        (main entry-point for front-end)

How to run

For each of the folders, create a separate virtual environment through Python 3.12.7 and run pip install -r ./requirements.txt.

Then execute:

python ./frontend/UI/program/executable.py for the frontend and
python ./backend/src/hal_beam_com/cog_manager.py for the backend.

If you want to use Claude's Anthropic as the main model for the cogs (LLMs with a specialized prompt), please set the ANTHROPIC_API_KEY in your environment variables.

Alternatively, you can switch the model setup by changing the ACTIVE_CONFIG in ./backend/src/hal_beam_com/utils.py. Running most models will require having Ollama installed and being able to run your selected models. New models can be added through base_models_path and then used when adding it to a configuration in model_configurations.

More detailed explanations of the install are in the README's of the respective directories.

Citation

@misc{mathur2024visionmodularaiassistant,
      title={VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities}, 
      author={Shray Mathur and Noah van der Vleuten and Kevin Yager and Esther Tsai},
      year={2024},
      eprint={2412.18161},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2412.18161}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
backend		backend
frontend		frontend
images		images
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities

VISION: A Modular AI Assistant for Beamline Operations

Paper

Directory Structure

How to run

Citation

About

Releases

Packages

Contributors 2

Languages

CFN-softbio/VISION

Folders and files

Latest commit

History

Repository files navigation

VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities

VISION: A Modular AI Assistant for Beamline Operations

Paper

Directory Structure

How to run

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages