Image Caption With PyTorch

This repository is still under construction.

This repository is created to show how to make neural network using pytorch to generate a caption from an image. The dataset that I use in this repository is Flickr8k and Flikcr30k Image Caption dataset. The model is divided into encoder and decoder to make it more clear to read the code.

Image Source : Encoder Decoder Image from Udacity Computer Vision Nanodegree Project

The image on the top is the ilustration of the network, but not very similar to what I do in this project, that image is just only the illustration of what the network do.

Project Structure

asda

Usage

Clone the repository
Download the dataset
Train the model
Test the model

Flickr Dataset Source

The dataset that I use for this repository can be downloaded from this dataset repository :

Flickr8k Dataset : https://www.kaggle.com/nunenuh/flickr8k
Flickr30k Dataset : https://www.kaggle.com/nunenuh/flickr30k

This dataset is in my data repository at kaggle, I change the dataset from original dataset to match what I need in this repository

Pretrained Model

Code Originally from

https://github.com/aladdinpersson/Machine-Learning-Collection/tree/master/ML/Pytorch/more_advanced/image_captioning

https://github.com/yunjey/pytorch-tutorial/tree/master/tutorials/03-advanced/image_captioning

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Image Caption With PyTorch

Project Structure

Usage

Flickr Dataset Source

Pretrained Model

Code Originally from

Files

README.md

Latest commit

History

README.md

File metadata and controls

Image Caption With PyTorch

Project Structure

Usage

Flickr Dataset Source

Pretrained Model

Code Originally from