A Single-Stage Face Detection and Face Recognition Deep Neural Network

Based on Feature Pyramid and Triplet Loss

Abstract:

A practical deep learning face recognition system can be divided into several tasks. These tasks can be time-consuming if executed each task with the original image as the input data. And the feature extractors using by different task may duplicate its function. In this paper, we propose a multi-task training method with the optimization of deep neural network to train a single-stage deep neural network for face detection and recognition. We use the feature pyramid and triplet loss combined with anchor boxes to localize the face location, and the regression layer of the neural network to align the face position. This structure combines the multiple faces detection task and recognition task in a single network without a significant precision lose where it was usually implemented in multi-stage network. Alternatively, the feature extraction task and feature matching task can be combined to extract face features by using convolution and match the person identification by a full connection layer with softmax function. Also the proposed loss function is used to establish the feature extractor and finally match the features through a simple mathematical function. On an Nvidia 2080Ti GPU accelerator, this system can achieve 212 FPS for a 640x640 resolution input.

 

Model Architecture:

 

A comparison between typical two-stage model and proposed single-stage model

The proposed model architecture

 

 

 

 

Made by  ¬ö¬f§Ê