top of page

Face Recognition: An Intermediate approach

Updated: Dec 30, 2020

Face Recognition has always been on the top when it comes to the domain of computer vision or image analysis. If you are an aspiring computer/data scientist or you have been working or learning about this domain for some time, you might have some idea about facial recognition systems or you might be familiar with facial recognition attendance systems, but those systems have limited capabilities and most of them match images not facial features. So, face recognition has been around for quite some time and in the last ten years, it has been progressing quickly and have matured so much that we have many sophisticated systems. You might have noticed, recently Facebook conducted an experiment on its users by starting a trend called ‘Ten Year Challenge’ where every user posted a side-by-side picture of themselves from ten years before and now. What this challenge was for the purpose of data collection and training their models so they can identify faces more efficiently and to enhance the functionality of people tagging other people in photos on Facebook. Many other companies such as Google, IBM, Apple etc. have been working in closely in this domain.



What is Face Recognition?


We can say that face recognition come from the idea of converting analog features of the face such as eyes, nose, chin etc. to digital or computer readable format or in number. We see it everywhere in the current era of technology. Apple has integrated it in their mobile devices for Biometric Facial Recognition to unlock the phone. Many mobile applications use it for personal verifications. Also, it has been most adopted method for attendance systems during this whole COVID pandemic due to its accessibility of without touching the device physically. We can say that facial recognition comprises of two main applications. The first one is verification where it verifies the person who is currently there with the previous records. Here the mobile phone unlocking example is applicable as it is the same mechanism. The second one is recognition, now this can be further divided into many use cases as it can be used for surveillance, traffic maintenance, road safety, law enforcement, criminal identification and so on. So, we can say that it is more of the security purpose.

What goes on in a Facial Recognition system is in simple terms is that an input image goes inside the system, the system creates facial embeddings of the image or converts the facial features to numbers and then matches those embedding to the embedding previously known to it.



Going into more details


As we jump to more details in face recognition, we say more possibilities as there are many. To train a face recognition system, we need a good number of facial images, which we can further prepare them to feed them to our model either a Convolutional Neural Network(CNN) or Deep Neural Network(DNN). I would suggest you learn Deep Learning at this point if you want to go to an advanced level in this domain. So, we know Face Recognition is being used in many domains such as Health, Automobiles, Mobile Phone, Surveillance, Policing etc. With so many applications come more work to do. It not only about identifying a face, human face has a range of facial expressions which are also to be considered, we cannot just train a neural network on simple images and expect it to identify the same person with different facial expression. It might be able to do that, but the probability is low. So, facial emotion recognition systems play an important role, what it does is that we feed four sample images for each person in our data which is used to be trained for such system, the four basic emotions such as Happy, Sad, Neutral and Angry. So, after that we can say that it might identify emotions as well which has helped many teachers in their online classes during the pandemic to see how their students are feeling during the classes.


You can also see many other use cases for that, as much work is being done on live face recognition and in videos also. China has in fact implemented such system for their public surveillance systems where they see each person walking on the side walks or on public places. Google has trained their system called FaceNet in 2015 which was trained on a dataset called Labelled Faces in the Wild which achieved the record-breaking accuracy of 99.63%. They use this system in Google Photos to sort photos based on face recognition automatically and they have an unofficial opensource version of it known as OpenFace. Do check that out.


So, if you want to learn and practice more about it, I suggest you pick problems from Kaggle and try to solve them. Also here are some of the important resources you might find useful as these are some research papers and courses:

bottom of page