Home Projects Experience Blogs Resume

Image Captioning using Deep Learning

Image Captioning using Deep Learning

May 10, 2021

Image Captioning is the process of generating textual description of an image. It uses both Natural Language Processing and Computer Vision to generate the captions. Image Captioning is an important computer vision problem with a multitude of applications. This thesis work aims to replicate a famous paper and improve on it. Attention based models are used to describe the content of the images. The model is trained in a deterministic manner using standard back-propagation techniques. We also show through visualization how the model uses attention layers to automatically learn to fix its gaze on salient objects while generating the corresponding words in the output sequence. This thesis also shows the use of several different hyper-parameters and networks to improve upon the existing work.

Concepts :

Convolution Recurrent Neural Network(CRNN)
Attention Networks
Long Short Term Memory(LSTM)
Image Captioning

🔍

🔗 Resume

https://drive.google.com/file/d/1ivh9T596JYhb7I9xl5zHd3h-OSLVHUFP/view

December 22, 2023

My Resume

Mastering the Terraform Associate (003) Certification - My Journey to Success

Mastering the Terraform Associate (003) Certification - My Journey to Success

December 21, 2023

A short blog on how I cleared my Terraform Associate (003) certification exam. Resources I used and some tips from my side.

read more →

Hurray! I cleared AWS DVA (Developer Associate) Exam

Hurray! I cleared AWS DVA (Developer Associate) Exam

February 28, 2023

A short blog on how I cleared my AWS DVA (Developer Associate) exam. Resources I used and some tips from my side.

read more →

2022

Randomized A/B testing using Cloudflare workers

Randomized A/B testing using Cloudflare workers

December 16, 2022

A project to perform randomized A/B experiments using a single link with the help of cloudflare workers and its KV (key value) store.

read more →

DevOps Engineer

September 2, 2022

16Bit

read more →

Teaching Assistant

September 1, 2022

Teaching Assistant (University of Toronto)

read more →

Creating my digital garden using Obsidian and mkdocs

Creating my digital garden using Obsidian and mkdocs

How I created a custom GitHub action to publish my obsidian vault to the web using mkdocs and netlify

read more →

Yay! I cleared AWS SAA Exam

Yay! I cleared AWS SAA Exam

A short blog on how I cleared my AWS SAA exam. Resources I used and some tips from my side.

read more →

Creating a custom GitHub action from scratch

Creating a custom GitHub action from scratch

How I created a custom GitHub action to cross post blogs to different blogging websites

read more →

Avoid rm

Avoid rm

February 10, 2022

A small article on why your should avoid using rm in *nix based OS whenever possible.

read more →

2021

RSS is Amazing

November 26, 2021

A small article on how RSS can help you protect your sanity by organizing your information sources

read more →

Automate sending files to an FTP server

Automate sending files to an FTP server

November 7, 2021

In this blog post I try to provide a detailed explanation on my project FTP-Automation

read more →

Member Technical Staff

Oracle

read more →

Using bookmarklets

Using bookmarklets

This post shows how you can use bookmarklets to toggle the visibility the YouTube control panel.

read more →

Full stack developer intern

Buyer assist

read more →

Kindle2Notion

Kindle2Notion

February 7, 2021

To organize Kindle highlights to Notion Pages

read more →

2020

Inventory Management Application

Inventory Management Application

December 10, 2020

Built an inventory management tool using Electron, React and Nedb

read more →

MERN Tracker

November 15, 2020

Built a exercise tracker app using the MERN stack

read more →

Coding a discord bot

Coding a discord bot

October 14, 2020

A discord mute bot for muting and unmuting everyone in the voice channel

read more →

Deep Learning Intern

Origin Health

read more →

Reseach Assistant (SPARK Fellowship)

Indian Institute of Technology Roorkee

read more →

OpenQuad

OpenQuad

To build an open source quad copter platform for research work.

read more →

2019

Gaze Tracking

Gaze Tracking

August 21, 2019

To build googles which could find where the user was looking, duration of the attention on that object and type of the object.

read more →

Research Assistant

Indian Institute of Technology Bombay

read more →

Search and Reconnaissance Robot (SRR)

Search and Reconnaissance Robot (SRR)

To design and fabricate an all-terrain robot for finding survivors in earth quake.

read more →

2018

Parallel Manipulator

Parallel Manipulator

September 15, 2018

2-DOF robotic manipulator that writes as you draw in mid-air in front of a laptop’s webcam

read more →

Object Flow Pattern Recognition

Object Flow Pattern Recognition

An OpenCV and Deep Learning project aimed at tracking objects.

read more →