🆎 Image_captioning 📷

This is a repository for the final project in Deep Learning School about NLP spring 2021. Here I try to solve image captioning problem.

The whole idea of image captioning is to create the description of one picture. In two architectures below I am using the method of first giving the picture to the model and then making generation according to it and the previously decoded state.

Creating this project I've used three articles: Overview of image captioning models Show, Attend and Tell Metrics for image captionin

In a few scipts you can find:

model - with and without attention mechanism
training pipeline
metrics calculation

All of the parts of the models and its representation can be found here in image_captioning_project.ipynb

All of the training reports with metrics can be found here:

model without attention

model with attention

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
additional_features.py		additional_features.py
data_loaders.py		data_loaders.py
image_captioning_project (2).ipynb		image_captioning_project (2).ipynb
main.py		main.py
metric.py		metric.py
model.py		model.py
requirements.txt		requirements.txt
train_evaluate.py		train_evaluate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🆎 Image_captioning 📷

About

Releases

Packages

Languages

MilanaShhanukova/image_captioning

Folders and files

Latest commit

History

Repository files navigation

🆎 Image_captioning 📷

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages