This repository contains the projects that I've developed during Udacity's Computer Vision Nanodegree.
Using image processing and deep learning techniques to create a facial keypoint detection system that takes in any image with faces, and predicts the location of 68 distinguishing keypoints on each face, such as the position of the eyes, nose, and mouth.
Designed and trained a CNN-RNN model that automatically generates image captions. The network is trained on the Microsoft Common Objects in COntext (MS COCO) dataset.
Implementation of SLAM (Simultaneous Localization and Mapping) for a 2-dimensional world. Sensor and motion data gathered by a simulated robot is used to create a map of an environment. SLAM gives us a way to track the location of a robot in the world in real-time and identify the locations of landmarks such as buildings, trees, rocks, etc.