Deep neural network: Sign language recognition
Recognizing multiple classes of objects from images is a common computer vision task. Here we have a 6-class problem where we want to recognize 6 different digits (0, 1, 2, 3, 4, 5) of American Sign Language given an image of a hand. We build a neural network s for this multi-class classification task using TensorFlow. I did this project in the Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization course as part of the Deep Learning Specialization.
Datasets
We have 1080 training examples and 120 test examples, where each example is of shape (64, 64, 3) with each of RGB channel image is of size 64x64. The examples are labeled as one of 0, 1, 2, 3, 4, and 5 for the corresponding digits. These labels are one-hot encoded to be used as the target output of the recognizer, as shown below.
Neural network architecture
We used only TensorFlow and Keras API to build a neural network model and train it with GradientTape
. The model we build has 3 layers: INPUT -> LINEAR -> RELU -> LINEAR -> RELU -> LINEAR -> SOFTMAX -> OUTPUT.