Skip to content
/ Image-Captioning Public template

AI driven system that describes the Image features using deep neaural networks like RNN/LSTM and using NLP process the text.

Notifications You must be signed in to change notification settings

nikk-16/Image-Captioning

Repository files navigation

Image-Captioning

Dataset

  • Flickr_8K dataset

Open in Jupyter(NOTEBOOK LINK)


The goal of image captioning is to convert a given input image into a natural language description.

The objective of our project is to learn the concepts of a CNN and LSTM model and build a working model of Image caption generator by implementing CNN with LSTM.​ The task of image captioning can be divided into two modules logically .​

  • Image based model ​
  • Language based model ​

The most beautiful part of this project is that it uses all concepts of AI i.e.​

  • Deep learning ​
  • Machine Learning ​
  • Natural Language Processing

​at one place which covers all topics in AI and it uses module Tensorflow which is provided by Google.​ ​

The other best thing is its use, that is so helpful to visually impaired people that it can let them to know what they are seeing.​ ​


Real time usage​

  • Aid to the Blind​ ​
  • Google Image Search​ ​
  • Self Driven cars​ ​
  • CCTV Camera Alarm​ ​

RESULTS

About

AI driven system that describes the Image features using deep neaural networks like RNN/LSTM and using NLP process the text.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published