Application of Convolutional Neural Networks for Image Detection and Recognition Based on a Self-written Generator

Object recognition is a branch of artificial vision and one of the pillars of machine vision. It consists in identifying the forms described in advance in a digital image and, in general, in a digital video stream. Although, as a rule, it is possible to perform recognition from video clips, the learning process is usually performed on images. In this paper, an algorithm for classifying and recognizing objects using convolutional neural networks is considered. The purpose of the work is to implement an algorithm for detecting and classifying various graphic objects fed from a webcam. The task is to first classify and recognize an object with high accuracy according to a given data set, and then demonstrate a way to generate images to increase the volume of the training data set by using a self-written generator. The classification and recognition algorithm used is invariant to transfer, shift and rotation. A significant novelty of this work is the creation of a self-written generator that allows using various types of augmentation (artificial increase in the volume of the training sample by modifying the training data) to form new groups of modified images each time. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Authors
Bienvenue M.M.N. , Kozyrev D.
Publisher
Springer Science and Business Media Deutschland GmbH
Language
English
Pages
29-41
Status
Published
Volume
1748 CCIS
Year
2023
Organizations
  • 1 V. A. Trapeznikov Institute of Control Sciences of Russian Academy of Sciences, 65 Profsoyuznaya Street, Moscow, 117997, Russian Federation
  • 2 Peoples’ Friendship University of Russia (RUDN University), 6 Miklukho-Maklaya Street, Moscow, 117198, Russian Federation
Keywords
augmentation; convolutional neural networks; image detection; image recognition; R-CNN (Regional Convolutional Neural Networks) model; subdiscretization

Other records