Innovative Region Convolutional Neural Network Algorithm for Object Identification

Permanasari, Yurika; Ruchjana, Budi Nurani; Hadi, Setiawan; Rejito, Juli

doi:10.3390/joitmc8040182

Open AccessSystematic Review

Innovative Region Convolutional Neural Network Algorithm for Object Identification

by

Yurika Permanasari

^1,*,

Budi Nurani Ruchjana

²

,

Setiawan Hadi

³

and

Juli Rejito

³

¹

Doctor Program of Mathematics, Faculty of Mathematics and Natural Sciences, Universitas Padjadjaran, Bandung 40132, Indonesia

²

Department of Mathematics, Faculty of Mathematics and Natural Sciences, Universitas Padjadjaran, Bandung 40132, Indonesia

³

Department of Computer Science, Faculty of Mathematics and Natural Sciences, Universitas Padjadjaran, Bandung 40132, Indonesia

^*

Author to whom correspondence should be addressed.

J. Open Innov. Technol. Mark. Complex. 2022, 8(4), 182; https://doi.org/10.3390/joitmc8040182

Submission received: 15 September 2022 / Revised: 6 October 2022 / Accepted: 8 October 2022 / Published: 11 October 2022

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Object identification is a part of the field of computer science, namely, image processing, whose research continues to innovate. Object identification describes an object based on the main characteristics of the object. Many research innovations related to object identification have been carried out to obtain optimal identification results. The convolutional neural network (CNN) is one of the algorithms that is widely used by researchers in the field of object identification or object recognition in digital images. The purpose of this study was to analyze the development of object identification in the search for the best algorithm in terms of the speed and efficiency of identification. The article data used were obtained from several sources, namely, Dimensions AI, Science Direct, and Google Scholar. The database search results obtained 1041 articles in the form of publications from 2010–2021. Through a systematic literature review based on the articles obtained, 32 articles were selected. The evaluation of the articles was carried out in the form of article data visualization, object identification algorithm development, and the research objects used. CNN’s research innovation is growing rapidly, with improvements being made to the identification techniques in its algorithmic architecture. The use of the CNN algorithm in the identification of image objects, starting with the region CNN technique, is improved with Fast R-CNN, Faster-CNN, and Mask R-CNN. The object of research has developed from facial recognition and the identification of moving images to the introduction of ancient manuscripts that are useful for the development of history and tourism. The successful identification of ancient scripted texts will greatly assist the availability of such manuscripts in a digital format, which allows for further multidisciplinary research. The availability of ancient manuscripts in a digital format also helps the government to preserve culture and increase people’s understanding of the culture they have.

Keywords:

object identification; ancient manuscripts; R-CNN; Fast R-CNN; Faster R-CNN; Mask R-CNN

1. Introduction

Image processing technology has become an important technology for machine learning and artificial intelligence (AI) tasks [1]. The continuous development and innovation of image processing technology helps AI algorithms to obtain relevant image information. To achieve machine visualization, object detection algorithms attempt to recognize all target items in the image and obtain category and object location information. The purpose of object detection is to identify and locate one or more entities that have meaning in still or moving images [2].

The rapid development of science and technology in the field of image processing in recent years has led to the potential to identify objects not limited to face recognition or certain objects. Further research has led to the ability to identify historical objects that are very important to the wider community. Object identification or recognition involves the automatic extraction and classification of an object [3], and the automation of the object identification process requires proper machine learning. One of the machine learning approaches that is often used for object recognition in images is the convolutional neural network (CNN) [4].

The identification process goes through several stages, namely, preprocessing, main process, and postprocessing. To produce an algorithm that can identify manuscripts well, it is necessary to find the right method for each stage of the process. The study of object identification is currently developing and attracting the attention of many researchers. The identification of objects in the image requires matching the characteristics of the objects in the image [5]. The CNN algorithm is widely used for object recognition as it can identify and recognize digital objects [6]. The obstacles in researching object identification are the detection of small objects among larger ones and against complex backgrounds. This poses a challenge to continue to develop the best innovations that obtain accurate results.

2. Materials and Methods

2.1. Scientific Article Data

In this study, literature focusing on object identification using CNN was identified and selected for review. The main focus was object identification using the region CNN algorithm on several different objects. In accordance with scientific developments, the region neural networks reviewed are Fast R-CNN, Faster R-CNN, and Mask R-CNN techniques.

The data used were articles obtained from several indexed sources, namely Dimensions AI, Science Direct, and Google Scholar. The articles were published from 2011 to 2021, so ten years of the database were used. The representation of the data was achieved using Vos Viewer from the searches performed using the Publish or Perish software by selecting the Google Scholar data source. In selecting the data using a PRISMA procedure, the data sources were taken from the selected database, the keywords entered were “object identification” and “ancient manuscripts” and “R-CNN” or “region convolutional neural network” or “fast R-CNN” or “faster R-CNN” or “mask R-CNN” with a maximum yield of 1041 articles as shown in Table 1 below.

Table 2 presents the number of literature search results with the keywords shown in Table 1.

2.2. Selection of Literature

The literature data obtained from the Publish or Perish software were selected by deleting literature in the form of books or relating to topics that were considered irrelevant to the research. The selection was carried out to obtain literature related to the research objectives in the form of articles discussing object identification using the R-CNN algorithm with various research objects including ancient manuscripts. The obtained publications were checked one by one to ensure an appropriate range of results from journals, proceedings, and doctoral thesis were obtained, all of which were written in English as shown in Table 3. After making the selection, any duplicates obtained from the three databases were removed (filter 1). The selection continued by removing articles with titles that were less relevant to the identification of ancient manuscript objects (filter 2). The next stage of selection was to select articles with relevant abstracts (filter 3) and those with very relevant content to serve as references for the research to be carried out (filter 4).

2.3. Methods and Systematic Data Analysis

This study carried out a systematic literature review of the published articles. The article data were assessed, identified, and interpreted based on the findings obtained after reading each article according to the object identification research topic using R-CNN. In the review process, a systematic evaluation of the literature is also needed so that there is no duplication or plagiarism with previous research.

In this study, the PRISMA procedure in Figure 1 used for the systematic analysis of article data was utilized according to the following stages:

(1): Visualization of article data related to the relationship between the article and the appropriate word topic.
(2): Mapping of the number of articles in each year (from 2011–2021) and providing general information. Intervention studies involving animals or humans, and other studies that require ethical approval, must list the authority that provided approval and the corresponding ethical approval code.

3. Results

The following section describes the results of the data analysis of 1041 articles. The explanation includes article data visualization, object identification research innovation, and research information using the R-CNN algorithm with different techniques.

3.1. Article Data Visualization

CNN research itself is closely related to the problems of object identification, image processing, object detection, classification, and network [7,8], as shown in Figure 2.

CNN, as an algorithm architecture, is used as the basis for the development of R-CNN to obtain an optimal object identification algorithm. Research using Faster R-CNN and Mask R-CNN is still rarely carried out because it is still relatively new. The development of object identification studies using the latest R-CNN algorithm proposed by Girshik [9] is shown in Figure 3.

As the latest generation of R-CNN, Faster R-CNN and Mask R-CNN appeared around 2017–2018, and are marked in yellow in Figure 3 according to the color bar shown in the picture. Some papers use Mask R-CNN as an extension of Faster R-CNN, as it is a faster algorithm compared to other CNN algorithms [10].

3.2. Object Identification with R-CNN, Fast R-CNN, Faster R-CNN, and Mask R-CNN

CNN is one of the deep learning methods that currently has the most significant results in image recognition. This is because CNN imitates the image recognition system in the human visual cortex so that it can to process image information. However, CNN also has a weakness, namely, the need for model training, which takes a long time. Therefore, Girshick proposed R-CNN to shorten the processing time [11]. The research object identification model develops using the fastest search algorithm. A comparison of the speed of the test-time algorithm is presented in Figure 4 (source: https://towardsdatascience.com/ (accessed on 10 August 2022).

Thus, Fast R-CNN was developed as an improvement of the R-CNN algorithm, Faster R-CNN was developed as an improvement of the Fast R-CNN algorithm, and the latest Mask R-CNN model was developed from Faster R-CNN and its algorithmic model architecture.

Table 4 provides a summary of the four CNN models in the study in terms of the architecture of the algorithm model.

The identification of image objects is an interesting topic in the field of computer vision. Object identification applications can be used in search engines, biometric security, attendance engines, or traffic security [15,16]. With the development of automatic machines and computers, the development of object identification algorithms such as R-CNN is also advancing rapidly. The R-CNN object classification approach uses a deep learning system to recognize objects. The proposed region becomes very important for the performance of R-CNN to identify individual objects accurately. Research is developing not only in terms of the speed of the algorithm, but also in terms of the object of research that is adapted to the desired application [17,18,19].

Object identification research has also begun to be used for applications in the field of culture, for example, to identify ancient objects both in two dimensions (manuscripts) and three dimensions (reliefs/statues) [20]. Ancient objects are very vulnerable and not very informative due to their age, so having digital data that can provide more information about these ancient objects will be very useful. Several researchers have found different methods to facilitate the reading and copying of damaged manuscripts [21,22]. The following Table 5. is a resume of the research related to ancient manuscripts.

4. Discussion

4.1. Development of Object Identification Research

The research on the identification of manuscript objects in the last ten years has grown quite significantly. Research results provided in the form of articles have increased based on the number of publications from 2011 to 2021, reaching 1041 articles after filtering redundancies from three databases. The identification of historical objects is more related to 3D objects [29], while research on identifying manuscript objects with non-Latin letters is still limited [30,31]. Several studies of ancient Sundanese manuscripts have been carried out [32,33], but the medium of this manuscript is palm leaves, therefore research on ancient Sundanese manuscripts using paper media is still lacking.

In image processing, the role of media objects is very influential in determining the right algorithm. Image processing goes through several stages, and if one stage uses another method, it can be said that the process uses a different algorithm.

4.2. CNN Algorithm Development for Object Identification

Articles published on the topic of object identification using CNN are generally related to deep learning [34,35], feature extraction [36], and classification [19]. After Girshick proposed a region-based convolution network for accurate object detection and segmentation [11], many researchers used R-CNN for more accurate object identification. Research using CNN develops the search for the fastest algorithm with high accuracy and precision, starting with the Fast R-CNN algorithm [13], Faster R-CNN [14], and Mask R-CNN [10].

The CNN algorithm is an algorithm that uses a convolution technique to obtain a sufficient amount of training data for object classification [37,38]. However, the amount of training data will affect the time-consuming training process and overfitting. Overfitting can occur due to too much training data, so the algorithm loses the ability to generalize [39]. R-CNN is proposed to reduce this overfitting problem [11]. The R-CNN algorithm uses the SVM classification technique, which is an additional module of the CNN algorithm circuit, thus increasing the processing time. Therefore, Fast R-CNN is utilized to correct deficiencies in R-CNN, namely, not using the SVM module, but using the Softmax probability as in CNN. Research continues on the search for the fastest algorithm, such as Faster R-CNN, which uses the region of interest (RoI) as the input for classification [14] so that it is ensured that only inputs with the considered characteristics will be classified. The next algorithm development is Mask R-CNN, which is based on the Faster R-CNN algorithm [10] and adds one input channel, namely, RoI Align, so that in addition to this algorithm’s rapid detection capability, it is also possible to perform other tasks; for example, in addition to detecting faces, it can also detect poses on the screen in the same picture.

4.3. Convolutional Neural Network Algorithm and Open Innovation Engineering

In the development of science and technology, artificial intelligence has been used in various fields of life, and in-depth research on artificial intelligence has increased significantly. Research on the problem of target detection has achieved a high level of accuracy, but there are still many obstacles, especially for the detection of small targets against complex backgrounds that are often found in the fields of medicine, agriculture, or transportation [40,41]. This means that research in the field of target detection continues to increase, with various innovations being devised [42,43,44].

Research innovation using CNN utilizes the latest Faster R-CNN and Mask R-CNN algorithms, which have been tested and shown to have the best processing speed [45,46]. Research in the field of history and culture is very helpful for local governments to preserve cultural heritage and provide public knowledge of its cultural history. The CNN algorithm has been used in several studies with text objects [47], especially ancient text manuscripts in the fields of history and culture, which are prone to the loss of information, as shown in Table 2. The CNN algorithm is powerful enough to detect objects more quickly and accurately [48], although the challenge of detecting objects among larger ones and against more complex backgrounds [49,50] has led many researchers to use this algorithm as a basis for development and innovation to achieve better research results [51,52].

5. Conclusions

Identification applications are widely used in everyday life, such as in biometric detection for the benefit of employee attendance within a company or vehicle number plate detection for the benefit of road users’ safety. Thus, object identification research is of particular importance for further applications in human life.

The development of various object identification algorithms with CNN architecture has been discussed in this article. The discussion began with region CNN, which implements a bounding box, and then Fast R-CNN, which has a better processing speed than R-CNN, and finally Faster R-CNN, with the best processing speed in identifying objects. Mask R-CNN was then introduced as an object identification algorithm that is more accurate and has the best processing speed because it was developed based on the Faster R-CNN algorithm. The topic of new algorithms as a development of the existing R-CNN algorithms is still open for further study.

The R-CNN algorithm can also be developed to assist the government in preserving culture and increase public understanding of historical heritage by developing an ancient manuscript object identification algorithm. There are very few papers concerning the object identification of ancient manuscripts using CNN compared to the number of ancient manuscripts in Indonesia. Therefore, the research of ancient manuscripts is a critical challenge, and is quite an interesting approach to use to create a more effective object identification algorithm.

Author Contributions

Conceptualization, Y.P.; methodology, Y.P.; writing—original draft preparation, Y.P.; writing—review and editing, Y.P. and B.N.R.; supervision, B.N.R., S.H. and J.R.; funding acquisition, B.N.R. All authors have read and agreed to the published version of the manuscript.

Funding

The APC was funded by Universitas Padjadjaran through Academic Leadership Grant with contract number: 2203/UN6.3.1/PT00/2022.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank the Rector of the Universitas Padjadjaran, Indonesia, which financed this research through collaboration with MDPI and the Academic Leadership Grant.

Conflicts of Interest

The authors declare no conflict of interest.

References

Xu, C.; Shi, C.; Bi, H.; Liu, C.; Yuan, Y.; Guo, H.; Chen, Y. A Page Object Detection Method Based on Mask R-CNN. IEEE Access 2021, 9, 143448–143457. [Google Scholar] [CrossRef]
Reddy, S.P.K.; Kandasamy, G. Cusp Pixel Labelling Model for Objects Outline Using R-CNN. IEEE Access 2022, 10, 8883–8890. [Google Scholar] [CrossRef]
Khayyat, M.M.; Elrefaei, L.A. Manuscripts Image Retrieval Using Deep Learning Incorporating a Variety of Fusion Levels. IEEE Access 2020, 8, 136460–136486. [Google Scholar] [CrossRef]
Qin, Z.; Yu, F.; Liu, C.; Chen, X. How convolutional neural network see the world—A survey of convolutional neural network visualization methods. Am. Inst. Math. Sci. 2018, 1, 149–180. [Google Scholar] [CrossRef] [Green Version]
Kesiman, M.W.A.; Valy, D.; Burie, J.-C.; Paulus, E.; Suryani, M.; Hadi, S.; Verleysen, M.; Chhun, S.; Ogier, J.-M. Benchmarking of document image analysis tasks for palm leaf manuscripts from southeast Asia. J. Imaging 2018, 4, 43. [Google Scholar] [CrossRef]
Mao, H.; Yao, S.; Tang, T.; Li, B.; Yao, J.; Wang, Y. Towards Real-Time Object Detection on Embedded Systems. IEEE Trans. Emerg. Top. Comput. 2018, 6, 417–431. [Google Scholar] [CrossRef]
Albawi, S.; Mohammed, T.A.; Al-Zawi, S. Understanding of a convolutional neural network. In Proceedings of the 2017 international conference on engineering and technology, Antalya, Turkey, 21–23 August 2017; pp. 1–6. [Google Scholar]
Gaus, Y.F.A.; Bhowmik, N.; Akcay, S.; Breckon, T. Evaluating the Transferability and Adversarial Discrimination of Convolutional Neural Networks for Threat Object Detection and Classification within X-Ray Security. In Proceedings of the 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA), Boca Raton, FL, USA, 16–19 December 2019. [Google Scholar]
Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 580–587. [Google Scholar]
He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 2980–2988. [Google Scholar]
Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. 2015, 38, 142–158. [Google Scholar] [CrossRef]
Espejo, L.R.; Vázquez, M.S.G.; Acosta, A.A.R. Optimization of the keypoint density-based region proposal for R-CNN. In Proceedings of the Optics and Photonics for Information Processing XII, San Diego, CA, USA, 19–20 August 2018; Volume 10751, p. 107510S. [Google Scholar]
Girshick, R. Fast R-CNN. In Proceedings of the IEEE international conference on computer vision, Santiago, Chile, 7–13 December 2015; pp. 1440–1448. [Google Scholar]
Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intellegence 2017, 39, 1137–1149. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Gamero, E.D.T. Object Detection in Videos Using Principal Component Pursuit and Convolutional Neural Networks. Available online: Tesis.pucp.edu.pe (accessed on 1 December 2021).
Lv, X.; Wang, A.; Liu, Q.; Sun, J.; Zhang, S. Proposal-Refined Weakly Supervised Object Detection in Underwater Images. In Proceedings of the IEEE International Conference on Image Processing, Taipei, Taiwan, 22–25 September 2019. [Google Scholar]
Liu, T.; Wan, J.; Yu, T.; Lei, Z.; Li, S.Z. Age Estimation Based on Multi-Region Convolutional Neural Network. In Proceedings of the Chinese Conference on Biometric Recognition (CCBR), Chengdu, China, 14–16 October 2016; pp. 186–194. [Google Scholar]
Nagi, J.; Ducatelle, F.; Di Caro, G.A.; Cireçsan, D.; Meier, U.; Giusti, A.; Nagi, F.; Schmidhuber, J.; Gambardella, L.M. Max-pooling convolutional neural networks for vision-based hand gesture recognition. In Proceedings of the IEEE International Conference on Signal and Image Processing Applications(ICSIPA), Orlando, FL, USA, 30 September–3 October 2012; pp. 342–347. [Google Scholar]
Dalferth, J.; Winkelmann, S.; Schwenker, F. Using Mask R-CNN for Image-Based Wear Classification of Solid Carbide Milling and Drilling Tools. In Proceedings of the IAPR Workshop on Artificial Neural Networks in Pattern Recognition, Winterthur, Switzerland, 2–4 September 2020. [Google Scholar]
Fitri, A.; Widartono, B.S. Visualisasi 3 Dimensi Kawasan Cagar Budaya Menggunakan Cityengine dengan Wahana Quadkopter “Kompleks Candi Ijo, Kec. Prambanan, Yogyakarta”. J. Bumi Indones. 2017, 6, 1–8. [Google Scholar]
Rahmaningtyas, D.; Akmal, A.; Hadi, S. Analisis Perbandingan Kinerja Metode Binerisasi terhadap Citra Lontar Sunda Kuno. J. Inform. 2017, 1, 27. [Google Scholar] [CrossRef] [Green Version]
Sh Hagaggi, N.A.; Ayman Salah, T. Microbial deterioration of a 13 AH-century manuscript housed in Al-Azhar library in Egypt: A case study. J. Basic Environ. Sci. 2016, 3, 65–73. [Google Scholar]
Shima, T.; Terasawa, K.; Kawashima, T. Image Processing for Historical Newspaper Archives. In Proceedings of the 2011 Workshop on Historical Document Imaging and Processing, Beijing, China, 16–17 September 2011; pp. 127–132. [Google Scholar]
Widiarti, A.R.; Marsono; Harjoko, A.; Hartati, S. Combination of statistic and structural approach to scripts segmentation from line segmentation of Javanese manuscript image. In Proceedings of the 2013 Digital Heritage International Congress, Marseille, France, 28 October–1 November 2013; p. 775. [Google Scholar]
Sabeenian, R.S.; Paramasivam, M.E.; Dinesh, P.M. Appraisal of localized binarization methods on Tamil palm-leaf manuscripts. In Proceedings of the International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET), Chennai, India, 23–25 March 2016; pp. 793–797. [Google Scholar]
Paulus, E.; Hadi, S.; Suryani, M.; Suryana, I.; Simanjuntak, Y. Evaluating Ancient Sundanese Glyph Recognition Using Convolutional Neural Network. J. Phys. Conf. Ser. 2019, 1235, 012063. [Google Scholar] [CrossRef]
Chamchong, R.; Gao, W.; McDonnell, M.D. Thai Handwritten Recognition on Text Block-Based from Thai Archive Manuscripts. In Proceedings of the International Conference on Document Analysis and Recognition (ICDAR), Sydney, Australia, 20–25 September 2019; pp. 1346–1351. [Google Scholar]
Alaasam, R.; Kurar, B.; Kassis, M.; El-Sana, J. Experiment study on utilizing convolutional neural networks to recognize historical Arabic handwritten text. In Proceedings of the 1st International Workshop on Arabic Script Analysis and Recognition (ASAR), Nancy, France, 3–5 April 2017; pp. 124–128. [Google Scholar]
Permata, E. Identifikasi Obyek Benda Tajam Menggunakan Pengolahan Citra Digital Pada Citra X-Ray. Volt 2016, 1, 1–14. [Google Scholar]
Adam, K.; Al-Maadeed, S.; ABouridane, A. Letter-based classification of Arabic scripts style in ancient Arabic manuscripts. In Proceedings of the 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR), Nancy, France, 3–5 April 2017; pp. 95–98. [Google Scholar]
Yahya, S.R.; Abdullah, S.N.H.S.; Omar, K.; Zakaria, M.S.; Liong, C.-Y. Review on image enhancement methods of old manuscript with the damaged background. In Proceedings of the Proc. 2009 International Conference on Electrical Engineering and Informatics, ICEEI, Selangor, Malaysia, 5–7 August 2009; Volume 1, pp. 62–67. [Google Scholar]
Suryani, M.; Hadi, S.; Paulus, E.; Yulita, I.N.; Supriatna, A.K.; Supriatna, A.K. Sundanese ancient manuscripts search engine using probability approach. J. Phys. Conf. Ser. 2017, 893, 012064. [Google Scholar] [CrossRef] [Green Version]
Kesiman, M.W.A.; Valy, D.; Burie, J.-C.; Paulus, E.; Sunarya, I.M.G.; Hadi, S.; Sok, K.H.; Ogier, J.-M. Southeast Asian palm leaf manuscript images: A review of handwritten text line segmentation methods and new challenges. J. Electron. Imaging 2016, 26, 011011. [Google Scholar] [CrossRef]
Altini, N.; Cascarano, G.D.; Brunetti, A.; De Feudis, I.; Buongiorno, D.; Rossini, M.; Pesce, F.; Gesualdo, L.; Bevilacqua, V. A Deep Learning Instance Segmentation Approach for Global Glomerulosclerosis Assessment in Donor Kidney Biopsies. Electronics 2020, 9, 1768. [Google Scholar] [CrossRef]
Li, H.; Zhang, C.; Song, N.; Li, H. Deep Learning-Based Interference Fringes Detection Using Convolutional Neural Network. IEEE Photonics J. 2019, 11, 1–14. [Google Scholar] [CrossRef]
Souza, L.F.; Holanda, G.B.; Silva, F.H.; Alves, S.S.; Filho, P.P. Automatic Lung Segmentation in CT Images Using Mask R-CNN for Mapping the Feature Extraction in Supervised Methods of Machine Learning. Intell. Syst. 2019, 140–149. [Google Scholar] [CrossRef]
Mikhaylov, A.; Tarakanov, S. Development of levenberg-marquardt theoretical approach for electric networks. J. Phys. Conf. Ser. 2020, 1515, 052006. [Google Scholar] [CrossRef]
An, J.; Mikhaylov, A.; Kim, K. Machine learning approach in heterogeneous group of algorithms for transport safety-critical system. Appl. Sci. 2020, 10, 2670. [Google Scholar] [CrossRef]
Abbas, S.M.; Singh, S.N. Region-based object detection and classification using faster R-CNN. In Proceedings of the 2018 4th International Conference on Computational Intelligence & Communication Technology (CICT), Ghaziabad, India, 9–10 February 2018. [Google Scholar]
Zhao, Z.; Li, X.; Liu, H.; Xu, C. Improved Target Detection Algorithm Based on Libra R-CNN. IEEE Access 2020, 8, 114044–114056. [Google Scholar] [CrossRef]
Ullo, S.L.; Mohan, A.; Sebastianelli, A.; Ahamed, S.E.; Kumar, B.; Dwivedi, R.; Sinha, G.R. A New Mask R-CNN-Based Method for Improved Landslide Detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 3799–3810. [Google Scholar] [CrossRef]
Guo, Q.; Liu, L.; Xu, W.; Gong, Y.; Zhang, X.; Jing, W. An Improved Faster R-CNN for High-Speed Railway Dropper Detection. IEEE Access 2020, 8, 105622–105633. [Google Scholar] [CrossRef]
Cao, C.; Wang, B.; Zhang, W.; Zeng, X.; Yan, X.; Feng, Z.; Liu, Y.; Wu, Z. An Improved Faster R-CNN for Small Object Detection. IEEE Access 2019, 7, 106838–106846. [Google Scholar] [CrossRef]
Zhai, S.; Dong, S.; Shang, D.; Wang, S. An Improved Faster R-CNN Pedestrian Detection Algorithm Based on Feature Fusion and Context Analysis. IEEE Access 2020, 8, 138117–138128. [Google Scholar] [CrossRef]
Lin, S.; Jiang, Y.; Chen, X.; Biswas, A.; Li, S.; Yuan, Z.; Wang, H.; Qi, L. Automatic Detection of Plant Rows for a Transplanter in Paddy Field Using Faster R-CNN. IEEE Access 2020, 8, 147231–147240. [Google Scholar] [CrossRef]
Xu, X.; Zhao, M.; Shi, P.; Ren, R.; He, X.; Wei, X.; Yang, H. Crack Detection and Comparison Study Based on Faster R-CNN and Mask R-CNN. Sensors 2022, 22, 1215. [Google Scholar] [CrossRef]
Li, Y.; Yin, C. Application of Dual-Channel Convolutional Neural Network Algorithm in Semantic Feature Analysis of English Text Big Data. Comput. Intell. Neurosci. 2021, 2021, 1–15. [Google Scholar] [CrossRef]
Valdez-Rodríguez, J.E.; Calvo, H.; Felipe-Riverón, E.; Moreno-Armendáriz, M.A. Improving Depth Estimation by Embedding Semantic Segmentation: A Hybrid CNN Model. Sensors 2022, 22, 1669. [Google Scholar] [CrossRef] [PubMed]
Gawande, U.; Hajari, K.; Golhar, Y. SIRA: Scale illumination rotation affine invariant mask R-CNN for pedestrian detection. Appl. Intell. 2022, 52, 10398–10416. [Google Scholar] [CrossRef]
Zhang, J.; Cosma, G.; Watkins, J. Image Enhanced Mask R-CNN: A Deep Learning Pipeline with New Evaluation Measures for Wind Turbine Blade Defect Detection and Classification. J. Imaging 2021, 7, 46. [Google Scholar] [CrossRef]
Zhou, G.; Zhang, W.; Chen, A.; He, M.; Ma, X. Rapid Detection of Rice Disease Based on FCM-KM and Faster R-CNN Fusion. IEEE Access 2019, 7, 143190–143206. [Google Scholar] [CrossRef]
Han, C.; Li, G.; Ding, Y.; Yan, F.; Bai, L. Chimney detection based on faster r-cnn and spatial analysis methods in high resolution remote sensing images. Sensors 2020, 20, 4353. [Google Scholar] [CrossRef]

Figure 1. PRISMA systematic literature review procedure.

Figure 2. CNN’s relationship with other fields of science.

Figure 3. CNN algorithm innovation.

Figure 4. R-CNN test-time speed.

Table 1. Four types of keywords.

Type	Keywords
I	Object Identification AND Ancient Manuscript AND R-CNN OR Region Convolutional Neural Network
II	Keyword I OR Fast R-CNN
III	Keyword II OR Faster R-CNN
IV	Keyword III OR Mask R-CNN

Table 2. Number of publications from three databases with four types of keywords.

Keyword	I	II	III	IV
Science Direct	24,739	21,163	942	943
Dimensions AI	185,986	170,753	1516	18
Google Scholar	34,500	23,500	2570	80

Table 3. Results of the semi-automatic and manual selection.

Filter	1	2	3	4
Science Direct	242	143	21	3
Dimensions AI	18	12	8	9
Google Scholar	75	71	31	20
N	335	226	60	32

Table 4. Summary of the four CNN algorithm models.

CNN Model	Author	Approach Used	Objective	Result
R-CNN	[11,12]	Labelling data regions Feature point density (SIFT)	Region selection	Effectively speeds up the processing time
Fast R-CNN	[13]	RoI pooling Probability Softmax	Process acceleration	Faster than R-CNN
Faster R-CNN	[14]	Identifying regional proposals is done with a separate network (RPN)	Process acceleration	Faster than Fast R-CNN
Mask R-CNN	[10]	Prediction of object masking for bounding box recognition Developed from faster R-CNN RoI align	Facilitate training Task generalization	Can perform other tasks in the same framework

Table 5. Summary of ancient manuscript research.

No	Author	Titles	Research Object
1	[23]	Image Processing for Historical Newspaper Archives	Improve ancient image processing methods and degraded document images Image processing method with character segmentation (Hough Transform)
2	[24]	Combination of statistic and structural approach to scripts segmentation from line segmentation of Javanese manuscript image	Combines statistical and structural analysis to generate Java scripts from line segmentation of Java script drawings Characters in the script are identified using the interconnect operation to identify the components of the script
3	[25]	Appraisal of localized binarization methods on Tamil palm-leaf manuscripts	Localized binary method for storing text information from digital Tamil manuscript images
4	[26]	Evaluating Ancient Sundanese Glyph Recognition using Convolutional Neural Network	CNN algorithm for pattern recognition of ancient Sundanese manuscript in lontar media
5	[5]	Benchmarking of document image analysis tasks for palm leaf manuscripts from Southeast Asia	Palm-leaf manuscript image analysis
6	[27]	Thai Handwritten Recognition on Text Block-Based from Thai Archive Manuscripts	Thai handwriting recognition using CNN
7	[28]	Historical Arabic Manuscripts Text Recognition Using Convolutional Neural Network	Arabic text recognition using CNN

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Permanasari, Y.; Ruchjana, B.N.; Hadi, S.; Rejito, J. Innovative Region Convolutional Neural Network Algorithm for Object Identification. J. Open Innov. Technol. Mark. Complex. 2022, 8, 182. https://doi.org/10.3390/joitmc8040182

AMA Style

Permanasari Y, Ruchjana BN, Hadi S, Rejito J. Innovative Region Convolutional Neural Network Algorithm for Object Identification. Journal of Open Innovation: Technology, Market, and Complexity. 2022; 8(4):182. https://doi.org/10.3390/joitmc8040182

Chicago/Turabian Style

Permanasari, Yurika, Budi Nurani Ruchjana, Setiawan Hadi, and Juli Rejito. 2022. "Innovative Region Convolutional Neural Network Algorithm for Object Identification" Journal of Open Innovation: Technology, Market, and Complexity 8, no. 4: 182. https://doi.org/10.3390/joitmc8040182

Article Menu

Innovative Region Convolutional Neural Network Algorithm for Object Identification

Abstract

1. Introduction

2. Materials and Methods

2.1. Scientific Article Data

2.2. Selection of Literature

2.3. Methods and Systematic Data Analysis

3. Results

3.1. Article Data Visualization

3.2. Object Identification with R-CNN, Fast R-CNN, Faster R-CNN, and Mask R-CNN

4. Discussion

4.1. Development of Object Identification Research

4.2. CNN Algorithm Development for Object Identification

4.3. Convolutional Neural Network Algorithm and Open Innovation Engineering

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI