Implementation of a cloud based embedded platform for object detection and recognition
Abstract
With the recent advancements in deep learning-based computer vision models, object detection and recognition applications such as video surveillance, Bio-Imaging, autonomous cars are increasing in number. Object detection techniques require some large image datasets, memory, a machine with GPU to train the algorithm and have high power consumption. Embedded platforms are characterized by low power consumption, space, and energy resources making the deployment of the algorithms on them difficult. In order to overcome these drawbacks, the detection algorithm (Faster R-CNN) is trained and tested with an image dataset obtained from ImageNet. This algorithm is implemented on a computer with MATLAB. An image acquisition device is set up using the Raspberry pi and pi camera to capture, process and send images to the detector via Dropbox cloud platform with Python. The Dropbox platform serves as an interface between the Raspberry pi and the remote detector. The detector was trained to locate five classes of objects which namely Broom, Fan, Keyboard, Mouse, and Television. The multi-class object detector was trained on 2500 images with each class having 500 still images and tested on 500 still images. The system was tested in real-time by capturing images on the Raspberry pi and transmitting it to and from the detector using internet access in order to determine the process duration. The detector accuracy is measured using the average precision (AP) metric for each class and calculating the mean average precision (mAP) metric for all classes. The multi-class object detector achieved a mean Average Precision (mAP) of 0.67 and the entire system procedure from image capturing to the final display was executed in an average of 45 seconds.