AI assist photo taking

Abstract

Users with no training in photography are often unable to capture attractive photographs. This project utilizes AI to automatically analyze frame elements and generate instructions that guide users in capturing better photographs. In detail，this project is an application of a visual language model based on RAG and Agent technique. It can address users' photography questions by querying professional photography reference image libraries, and provide professional photography suggestions based on reference photos, including composition, posing, ISO settings, and more.

Pipeline

Pass the query image to the embedding model to semantically represent it as an embedded query vector.
Pass the embedded query vector to reference professional photography DB.
Retrieve the top-k relevant photo – measured by distance between the query embedding and all the embedded photo in the database.
Pass the query question，query image and retrieved photo to our VLM model.
The VLM model will determine which agent should be used，including composition agent，posing agent and ISO setting agent.
The VLM model generate a response with agent using the retrieved referencecontext.

Requirements

transformers 4.37.2

gradio 4.38.0

SQLAlchemy 2.0.31

Run

Firstly download unsplash lite dataset.

Then run main.py in MaterialSerch folder to start RAG backend.The RAG backend used in this project is based on MaterialSearch. The cached CLIP embeding for unsplash lite dataset is in the MaterialSearch folder.

Finaly run demo_gradio_agent.py to start gradio webui.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
MaterialSearch		MaterialSearch
LLM.py		LLM.py
README.md		README.md
arch.png		arch.png
demo_gradio_agent.py		demo_gradio_agent.py
tool.py		tool.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI assist photo taking

Abstract

Pipeline

Requirements

Run

About

Releases

Packages

Languages

ColinWine/AI_assist_photo_taking

Folders and files

Latest commit

History

Repository files navigation

AI assist photo taking

Abstract

Pipeline

Requirements

Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages