Automatic Attendance System using Face Recognition ( OpenCV 3.1.0 & Raspberry Pi )

Project Phase

A Face Recognition system to be used for marking attendance in an organisation for a streamlined and centralized record of Employees or Members.



Phase includes the following stages:
  • A C++ program to detect and store faces. (Detection)
  • A Python Script to maintain and link available faces. (Linking)
  • A C++ program to fetch faces from a camera and compare them with available database. (Recognition)
  • A Python Script to update the record on Google Spreadsheets over a secure wireless connection. (Uploading)
All of this runs on a Raspberry Pi 3.

Phase is and has been my most ambitious project because of the way it works. And also because it is composed of three of my most relished domains: Embedded Linux, Machine Learning and Internet of Things.

Each Functional component took considerable time worth mentioning in this catalog however I also do acknowledge that a lot of hows and whys will still be skipped because they are really large in number.
Last but most important, Kudos to Stack Overflow and every developer that asked relevant questions for this project of mine to be completed even after so many dead ends I have encountered.

Project Phase:

My initial setup was Ubuntu 15.10 on an Intel core i5 laptop. Linux was a choice because I already planned to deploy the final project on a Raspberry Pi. This maintained familiarity with the development platform.

The entire project is too long to discuss every working bit in a single blog post so an extended summary is what this post is.

What Phase Does:

It Sees You, Remembers You, Recognizes You and Keeps a note of it on Google Drive !
Pretty Cool when you think about it.
Each task stated above uses a separate program linked together to work seamlessly.

It Sees You:

The Video is captured via the integrated webcam (when developing on ubuntu) and via a USB webcam (when run on Raspbian OS [Raspberry Pi]).This is made easy by always fetching the video from the default connected device.As Pi doesn't come with an inbuilt camera,default device is the USB webcam.Voila!

A C++ program linked with the OpenCV (build from source:make,make install) running a cascade Haar's Frontal Face classifier detects the faces in an image.The task of detection is the following two things:
  • Number of faces in the image
  • Segmenting ,Cropping and Resizing Faces
Detecting One Face and Saving to Database

Detecting One Face and Saving to Database

A Video Documenting the database creation is as shown:






It Remembers You:

The database of a face is created only when only a single face is detected by the classifier.This ensures that the database of a single individual contains images only of that individual.Before saving the faces are gray scaled and resized to 300 x 300 pixels.

The structure of the folder is as :

.
|-- s1
|   |-- 1.pgm
|   |-- ...
|   |-- 29.pgm
|-- s2
|   |-- 1.pgm
|   |-- ...
|   |-- 29.pgm
...
|-- s40
|   |-- 1.pgm
|   |-- ...
|   |-- 29.pgm

A lot of guidance was received from OpenCV documentation.This includes the above folder structures to store faces.

Root Folder
Database of an Individual

Although the database of images is created successfully,For a program to actually "See" them,It is crucial that every image is properly documented along with the ID of the person they represent.This path creation and Labeling is done by a Python Script.

The Python script creates a record in .csv format which contains :
  • Full Path of the image
  • Label Corresponding to the Image
Since images are stored in a folder named after the id of the person, The name of the folder is infact the Label for our task.

The .csv file created looks like: 



The .CSV file created is used by the next segment to fetch,load and train the Face Recognizer algorithm.

It Recognizes You

The Task of Face Recognition is done by C++ Program written using OpenCV library.
The Face Recognition module is not native to the official source yet so the additional libraries are built using a new method I came up with as documented here.This method is more reliable than the conventional route.

The program fetches live feed from the default imaging device and processes it frame by frame.

The first task that the program performs is to train its Two classifiers on the training database and labels of images.The Two algorithms used are:


Eigenface is single class specific i.e. It finds the similarities between multiple images of same individual whereas FisherFace finds the differences between different individuals.The Collective and commonly agreed result of both these algorithms trained on the same set of images is used as a confirmation of a prediction.

The Haar's cascade is run to segment the faces which are the evaluated by the two algorithms and predictions are returned by both.The value of prediction is accurate 90% of the trials however it depends on the quality of images in the database.

Video Documenting Face Recognition:





Keeping a note on Google Drive:

The task of connecting securely to google cloud is done by a python script. It uses the following package to do the task of accessing and updating attendance on google spreadsheet.

  • Oauth2client  (Google Cloud Authentication Client)
  • Gspread   (Google Spreadsheet API client)
  • PyOpenSSL (Python Open SSL package)
The Result of prediction (Roll No. or Unique ID) is given to the Cloud Connect Script as a command line argument. The script fetches the date of current day from the system.These two data elements are enough to mark a student as present.

The Logic here is always a tautology, 
i.e. if a student 'A' arrives before the system ,he is marked as present for the current day.
      if a student 'A' is absent, he never arrives for attendance before the system, hence he is not marked for that day thus stating him absent.

The Python Script connects to a google spreadsheet via valid security credentials and update the attendance onto it.The programming is done in such a way that it handles all the possible scenarios that can arise on the spreadsheet section. Few of the problem -> solution are:
  • Date Row not found -> Create row for Current Date. (When taking attendance on a new day)
  • Roll No not found -> Create column for Roll No. (When database is updated)
  • Date Row found, Roll No column not found -> Add Roll No column and write "Present" in current date row   (Database updated during current day)
  • Date Row not found, Roll No. Column Found -> Add Date Row and write "Present" in current Roll No. column  (Database Intact, Day changed )
The Data for the recognized individual is successfully updated in 3-4 seconds. This is slow compared to execution time of our Recognizer program however keeping in mind all the authorizations and Credential check every time, it for sure is a lead over other unsecured connections.

The Google spreadsheet is edited to give write access to our API token so that there is no conflict of permissions during write task. 

Here is the video of Phase updating the attendance of a detected individual in real time:




The Pi Setup:

The Setup is done with a Dell VGA monitor using an HDMI to VGA converter to connect to Raspberry Pi. Additionally USB Webcam,Keyboard & Mouse are connected via USB port.The webcam lights are kept off because of high current surge of 6 LEDs. They barely make any improvements in lighting conditions anyway.
  • An 8GB Sandisk MicroSD card is loaded with NOOBS and Raspbian OS is installed.
  • OpenCV is built from source using my method for extra modules building as stated here.
  • CodeBlocks is installed from apt-get and code is copied to from the ubuntu system to Pi using a thumb drive.
  • Static path for database storage, database linking,fetching and cloud uploading are set to get around using command line arguments every time.The Detection stage still employs CL arguments to denote the person being databased.
The entire system is enclosed in a box as follows :


The LCD and the glowing Leds are part of a temperature monitoring system. It measures the temperature of the box internals to warn or ward off any heat damage. And that is an entirely different story for a later time.

-----------------------------------------------------------------------------------------------------------------
[ Update 17 November,2018 ] :

The code for a dlib variant of the face detection and recognition project is available for access on my github here : https://github.com/sanjeev309/face-recognition-dlib-tensorflow-knn
You will need to modify the core code to suit your requirement for an attendance system.
Pull requests are welcome.
-----------------------------------------------------------------------------------------------------------------


Success is not final, failure is not fatal: it is the courage to continue that counts

Used :
Code::Blocks IDE
PyCharm IDE
Raspbian OS
Atmel Studio 7.0