Writer Identification Using Handwritten Cursive Texts and Single Character Words
Abstract
:1. Introduction
2. Materials
2.1. Secure Password Dataset
2.2. IAM Online Handwriting Dataset
2.3. Data Adaptation
3. Methods for Feature Extraction
3.1. Geometrical Features
- REGRESSION_LOWER_HORANGLE (Table 1 #1): Sum of points under horizontal axis (Equation (1));
- REGRESSION_ON_HORANGLE (Table 1 #2): Sum of points at horizontal axis (Equation (2));
- CENTRAL_POINT (Table 1 #3): Number of the central point of the Signature in x direction (Equation (3));
- REGRESSION_UPPER_HORANGLE (Table 1 #4): Sum of points above horizontal axis (Equation (4));
- EUCLID (Table 1 #5): Considers the Euclidean distance between the single points of the segment.
- HORIZONTAL_ POINT_ANGLE (Table 1 #7): This angle is calculated from the horizontal corner between the terminal points of the segments of the whole signature. To calculate this angle, a point-displaced eight units to the left from the center of rectangle has to be selected in this implementation. (see Figure 5, Equation (6)).
- SPHI (Segment length distance relation; Table 1 #8): The length of the segment (Euclidean distance of all points) divided by the distance between the starting point and endpoint of the segment, (Equation (7));
3.2. Statictical Features
- POINTS (Table 1 #19): Is the number of the pixels n occupied for the signature on the display.
- SEGMENTS (Table 1 #20): consider the number of segments of the signature. A new segment begins when the display is touched by pencil, to begin the writing process, and ends when the pencil is removed from the display.
- The number of pen-ups NUM_STROKES (Table 1 #21); actually equivalent to the number of segments, hence it can be directly calculated according to the index in raw data.
- MEAN_NUMBER_POINTS/SEGMENT (Table 1 #24): Average of all points per segment (see Equation (11));
- DTW(x,y,t) (Table 1 #25,#26,#27): (Dynamic Time Warping Distance feature): The Dynamic Time Warping Distance is calculated by m and n, where the dimensional vectors are s[0 … m], t[0 … n]. The DTW Matrix has n rows, m columns and the first column and row are initialized by infinity. For every pair si and ti, the Euclidean distance is calculated, and the result is called cost. The minimum of DTWi,j, DTWi,j−1 and DTWi−1,j−1 are added for the cost. This is done for every index in the DTW Matrix. The Result of the DTW is stored at DTWn,m. (see Equations (12) and (13));
3.3. Temporal Features
- TIME (Table 1 #52): Whole time user needs for one secure password sample or cursive text sample.
- The Relative Duration of Writing RELATIVE_WRITING_DURATION (Table 1 #53) defined as below (see Equation (20));It can be calculated according to the timestamps and index in raw data.
- TIME_V_MAX (Table 1 #54): Point in time of overall maximum speed.
- RELATION_TIME_GAP_ALL (Table 1 #55): Relation Time to the number of gaps.
4. Methods for Feature Reduction and Classification System
4.1. Methods of Feature Reduction
- Fisher Score: The Generalized Fisher Score [10] is a joint feature selection criterion, which aims at finding a subset of features and maximizes the lower bound of the traditional Fisher Score. It also resolves redundant problems in the feature selection process. The mathematical description of the Fisher Score is shown as below in Equation (29).
- j—the j-th feature.
- i—the i-th class, which could be interpreted as the i-th subject in our test.
- n—the size of the instances for a certain class
- μ—the mean value for a certain class
- σ—the standard deviation for a certain class
- Correlation analysis: Although there are many attributes which are correlated, this proposal removes attributes which have a correlation coefficient above ±0.9. For this purpose, all attributes will be compared with each other. However, before an attribute gets discarded, we compare the Fisher Score of both attributes. If the score of the actual attributing better than the other score of the compared attribute, then the attribute gets marked for removal. When all other attributes are tested and no other attribute is better than the actual, all marked attributes get removed. However, if there is one Attribute which is better than the actual attribute, then it gets removed and the marked ones get unmarked.
- Info Gain Attribute Evaluation: At second, Information Gain Attribute Evaluation (IG) [44] is used for ranking. This ranker evaluates the worth of an attribute by measuring the information gain with respect to the class. The mathematical description of Information Gain Attribute Evaluation is shown as below in Equation (30).
- H(Y) is the entropy of Y.
- H(Y/X) is the entropy of Y after observing X.
- p(y) is the marginal probability density function for the random variable Y.
- p(y/x) is the conditional probability of y given x.
4.2. Methods of Classification Systems
- KNN: For KNN (value of k = 1 for all experiments), we use the distance between an instance and the centroid of a class as the decision threshold. To calculate the ROC-Curve, we determine the greatest distance over all test instances. Then we alter the threshold from 0 to this determined greatest value. When the distance of a test instance is greater than the current decision threshold, we refuse the user. For every altered threshold, we calculate the TPR and FPR.
- Naïve Bayes and Bayes Net: For every instance, we calculate the class probability. If the probability is higher than the threshold, the user is accepted. As higher class probabilities occur more often than lower probabilities, we use a higher resolution for higher probabilities.
5. Experimental Methodology
6. Experiments and Results
6.1. Results for Different Feature Sets in Secure Password DB 150
6.2. Results for Different Feature Sets in IAM Online Handwriting DB
7. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Bouadjenek, N.; Nemmour, H.; Chibani, Y. Fuzzy Integral for Combining SVM-based Handwritten Soft-biometrics Prediction. In Proceedings of the 2016 12th IAPR Workshop on Document Analysis Systems (DAS), Santorini, Greece, 11–14 April 2016; pp. 311–316. [Google Scholar] [CrossRef]
- Gattal, A.; Djeddi, C.; Siddiqi, I.; Chibani, Y. Gender classification from offline multi-script handwriting images using oriented Basic Image Features (oBIFs). Expert Syst. Appl. 2018, 99, 155–167. [Google Scholar] [CrossRef]
- He, S.; Schomaker, L. General Pattern Run-Length Transform for Writer Identification. In Proceedings of the 2016 12th IAPR Workshop on Document Analysis Systems (DAS), Santorini, Greece, 11–14 April 2016; pp. 60–65. [Google Scholar] [CrossRef]
- Frery, J.; Largeron, C.; Juganaru-Mathieu, M. Author Identification by Automatic Learning. In Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, France, 23–26 August 2015; pp. 181–185. [Google Scholar] [CrossRef]
- Pal, S.; Alaei, A.; Pal, U.; Blumenstein, M. Performance of an Off-line Signature Verification Method based on Texture Features on a Large Indic-script Signature Dataset. In Proceedings of the 2016 12th IAPR Workshop on Document Analysis Systems (DAS), Santorini, Greece, 11–14 April 2016; pp. 72–77. [Google Scholar] [CrossRef]
- Fischer, A.; Diaz, M.; Plamondon, R.; Ferrer, M.A. Robust Score Normalization for DTW-Based On-Line Signature Verification. In Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, France, 23–26 August 2015; pp. 241–245. [Google Scholar] [CrossRef]
- Impedovo, D.; Pirlo, G.; Plamondon, R. Handwritten signature verification: New advancements and open issues. In Proceedings of the 2012 International Conference on Frontiers in Handwriting Recognition, Bari, Italy, 18–20 September 2012; pp. 367–372. [Google Scholar] [CrossRef]
- Adak, C.; Chaudhuri, B.B. Writer Identification from Offline Isolated Bangia Characters and Numerals. In Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, France, 23–26 August 2015; pp. 486–490. [Google Scholar] [CrossRef]
- Raje, S.; Mehrotra, K.; Belhe, S. Writer Adaptation of Online Handwritten Recognition using Adaptive RBF Network. In Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, France, 23–26 August 2015; pp. 691–695. [Google Scholar] [CrossRef]
- Obaidullah, S.M.; Halder, C.; Das, N.; Roy, K. A new dataset of word-level offline handwritten numeral images from four official Indic scripts and its benchmarking using image transform fusion. Int. J. Intell. Eng. Inform. 2016, 4, 1–20. [Google Scholar] [CrossRef]
- Narwade, P.N.; Sawant, R.R.; Bonde, S.V. Offline Handwritten Signature Verification Using Cylindrical Shape Context. 3D Res. 2018, 9, 48. [Google Scholar] [CrossRef]
- Alpar, O.; Krejcar, O. Online signature verification by spectrogram analysis. Appl. Intell. 2018, 48, 1189–1199. [Google Scholar] [CrossRef]
- Hafemann, L.G.; Oliveira, L.S.; Sabourin, R. Fixed-sized representation learning from offline handwritten signatures of different sizes. Int. J. Doc. Anal. Recognit. 2018, 21, 219–232. [Google Scholar] [CrossRef] [Green Version]
- Taskiran, M.; Cam, Z.G. Offline Signature Identification via HOG Features and Artificial Neural Networks. In Proceedings of the 2017 15th International Symposium on Applied Machine Intelligence and Informatics (SAMI), Herl’any, Slovakia, 26–28 January 2017; pp. 83–86. [Google Scholar]
- Pesch, H.; Hamdani, M.; Forster, J.; Ney, H. Analysis of Preprocessing Techniques for Latin Handwriting Recognition. In Proceedings of the 2012 13th International Conference on Frontiers in Handwriting Recognition (ICFHR), Bari, Italy, 18–20 September 2012; pp. 280–284. [Google Scholar] [CrossRef]
- Balbed, M.A.M.; Ahmad, S.M.S.; Shakil, A. ANOVA-Based Feature Analysis and Selection in HMM-Based Offline Signature Verification System. In Proceedings of the 2009 Innovative Technologies in Intelligent Systems and Industrial Applications CITISIA, Monash, Malaysia, 25–26 July 2009; pp. 66–69. [Google Scholar] [CrossRef]
- Ahmed, K.; El-Henawy, I.M.; Rashad, M.Z.; Nomir, O. On-line signature verification based on PCA feature reduction and statistical analysis. In Proceedings of the 2010 International Conference on Computer Engineering & Systems (ICCES), 30 November–2 December 2010; pp. 3–8. [Google Scholar] [CrossRef]
- Indermühle, E.; Liwicki, M.; Bunke, H. Combining Alignment Results for Historical Handwritten Document Analysis. In Proceedings of the 2009 10th International Conference on Document Analysis and Recognition, Barcelona, Spain, 26–29 July 2009; pp. 1186–1190. [Google Scholar] [CrossRef]
- Fischer, A.; Bunke, H. Character prototype selection for handwriting recognition in historical documents. In Proceedings of the 2011 19th European Signal Processing Conference, Barcelona, Spain, 29 August–2 September 2011; pp. 1435–1439. [Google Scholar]
- Gattal, A.; Djeddi, C.; Chibani, Y.; Siddiqi, I. Isolated Handwritten Digit Recognition Using oBIFs and Background Features. In Proceedings of the 2016 12th IAPR Workshop on Document Analysis Systems (DAS), Santorini, Greece, 11–14 April 2016; pp. 305–310. [Google Scholar] [CrossRef]
- Jain, R.; Doermann, D. Combining Local Features for Offline Writer Identification. In Proceedings of the 2014 14th International Conference on Frontiers in Handwriting Recognition, Heraklion, Greece, 1–4 September 2014; pp. 583–588. [Google Scholar] [CrossRef]
- Simistira, F.; Katsouros, V.; Carayannis, G. Recognition of online handwritten mathematical formulas using probabilistic SVMs and stochastic context free grammars. Pattern Recognit. Lett. 2015, 53, 85–92. [Google Scholar] [CrossRef]
- Shaw, B.; Bhattacharya, U.; Parui, S.K. Combination of Features for Efficient Recognition of Offline Handwritten Devanagari Words. In Proceedings of the 2014 14th International Conference on Frontiers in Handwriting Recognition, Heraklion, Greece, 1–4 September 2014; pp. 240–245. [Google Scholar] [CrossRef]
- Jang, W.; Kim, S.; Kim, Y.; Lee, E.C. Automated Verification Method of Korean Word Handwriting Using Geometric Feature. In Advances in Computer Science and Ubiquitous Computing. CUTE 2017, CSA 2017; Park, J., Loia, V., Yi, G., Sung, Y., Eds.; Lecture Notes in Electrical Engineering; Springer: Singapore, 2018; Volume 474, pp. 1340–1345. [Google Scholar]
- Gupta, J.D.; Samanta, S.; Chanda, B. Ensemble classifier-based off-line handwritten word recognition system in holistic approach. IET Image Process. 2018, 12, 1467–1474. [Google Scholar] [CrossRef]
- Giotis, A.R.; Gerogiannis, D.R.; Nikou, C. Word Spotting in Handwritten Text Using Contour-based Models. In Proceedings of the 2014 14th International Conference on Frontiers in Handwriting Recognition, Heraklion, Greece, 1–4 September 2014; pp. 399–404. [Google Scholar] [CrossRef]
- Griechisch, E.; Malik, M.I.; Liwicki, M. Online Signature Verification based on Kolmogorov-Smirnov Distribution Distance. In Proceedings of the 2014 14th International Conference on Frontiers in Handwriting Recognition, Heraklion, Greece, 1–4 September 2014; pp. 738–742. [Google Scholar] [CrossRef]
- Wibowo, C.P.; Thumwarin, P.; Matsuura, T. On-line Signature Verification Based on Forward and Backward Variances of Signature. In Proceedings of the 2014 The 4th Joint International Conference on Information and Communication Technology, Electronic and Electrical Engineering (JICTEE), Chiang Rai, Thailand, 5–8 March 2014; pp. 1–5. [Google Scholar]
- Slim, M.A.; Abdelkrim, A.; Benrejeb, M. An Efficient Handwriting Velocity Modelling for Electromyographic Signals Reconstruction Using Radial Basis Function Neural Networks. In Proceedings of the 2015 7th International Conference on Modelling, Identification and Control (ICMIC), Sousse, Tunisia, 18–20 December 2015; pp. 71–76. [Google Scholar] [CrossRef]
- Davila, K.; Ludi, S.; Zanibbi, R. Using Off-line Features and Synthetic Data for On-line Handwritten Math Symbol Recognition. In Proceeding of the 2014 14th International Conference on Frontiers in Handwriting Recognition, Heraklion, Greece, 1–4 September 2014; pp. 323–328. [Google Scholar] [CrossRef]
- Yamaji, Y.; Shibata, T.; Tonouchi, Y. Online Handwritten Stroke Type Determination Using Descriptors Based on Spatially and Temporally Neighboring Strokes. In Proceedings of the 2014 14th International Conference on Frontiers in Handwriting Recognition, Heraklion, Greece, 1–4 September 2014; pp. 116–121. [Google Scholar]
- Al-Hmouz, R.; Pedrycz, W.; Daqrouq, K.; Morfeq, A.; Al-Hmouz, A. Quantifying dynamic time warping distance using probabilistic model in verification of dynamic signatures. Soft Comput. 2019, 23, 407–418. [Google Scholar] [CrossRef]
- Parziale, A.; Fuschetto, S.G.; Marcelli, A. Exploiting stability regions for online signature verification. In New Trends in Image Analysis and Processing; Lecture Notes in Computer Science; Springer: Basel, Switzerland, 2013; Volume 8158, pp. 112–121. [Google Scholar]
- Slim, M.A.; Abdelkrim, A.; Benrejeb, M. Handwriting Velocity Modeling by Sigmoid Neural Networks with Bayesian Regularization. In Proceedings of the 2014 International Conference on Electrical Sciences and Technologies in Maghreb (CISTEM), Tunis, Tunisia, 3–6 November 2014; pp. 1–7. [Google Scholar] [CrossRef]
- Faundez-Zanuy, M.; Sesa-Nogueras, E.; Roure-Alcobe, J.; Esposito, A.; Mekyska, J.; Lopez-de-Ipina, K. A Preliminary Study on Aging Examining Online Handwriting. In Proceedings of the 2014 5th IEEE Conference on Cognitive Infocommunications (CogInfoCom), Vietri sul Mare, Italy, 5–7 November 2014; pp. 221–224. [Google Scholar] [CrossRef]
- Nguyen, C.T.; Zhu, B.L.; Nakagawa, M. A semi-incremental recognition method for on-line handwritten English text. In Proceedings of the 2014 14th International Conference on Frontiers in Handwriting Recognition, Heraklion, Greece, 1–4 September 2014; pp. 234–239. [Google Scholar] [CrossRef]
- Aubin, V.; Mora, M.; Santos-Peñas, M. Off-line writer verification based on simple graphemes. Pattern Recognit. 2018, 79, 414–426. [Google Scholar] [CrossRef]
- Vásquez, J.L.; Ravelo-García, A.G.; Alonso, J.B.; Dutta, M.K.; Travieso, C.M. Writer identification approach by holistic graphometric features using off-line handwritten words. Neural Comput. Appl. 2018, 1–14. [Google Scholar] [CrossRef]
- Kalbitz, M.; Scheidat, T.; Vielhauer, C. First investigation of feasibility of contact-less non-destructive optical sensors to detect, acquire and digitally process forensic handwriting based on pressure information. In Proceedings of the 2016 4th International Conference on Biometrics and Forensics (IWBF), Limassol, Cyprus, 3–4 March 2016; pp. 1–6. [Google Scholar] [CrossRef]
- Okawa, M.; Yoshida, K. Off-line writer verification using shape and pen pressure information. In Proceedings of the 2012 International Conference on Frontiers in Handwriting Recognition, Bari, Italy, 18–20 September 2012; pp. 625–630. [Google Scholar] [CrossRef]
- Bhateja, A.K.; Chaudhury, S.; Saxena, P.K. A Robust Online Signature based Cryptosystem. In Proceedings of the 2014 14th International Conference on Frontiers in Handwriting Recognition, Heraklion, Greece, 1–4 September 2014; pp. 79–84. [Google Scholar] [CrossRef]
- Kutzner, T.; Dietze, M.; Bonninger, I.; Travieso, C.M.; Dutta, M.K.; Singh, A. Online Handwriting Verification with Safe Password and Increasing Number of Features. In Proceedings of the 2016 3rd International Conference on Signal Processing and Integrated Networks (SPIN), Noida, India, 11–12 February 2016; pp. 656–661. [Google Scholar] [CrossRef]
- Aubin, V.I.; Doorn, J.H. Exploring New Handwriting Parameters for Writer Identification. Encyclopedia of Information Science and Technology, 4th ed.; IGI-Global: Hershey, Pennsylvania, 2019. [Google Scholar] [CrossRef]
- Lee, P.X.; Ding, J.J.; Wang, T.C.; Lee, Y.-C. Automatic Writer Verification Algorithm for Chinese Characters Semi-Global Features and Adaptive Classifier. In Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), San Diego, CA, USA, 23–27 July 2018; pp. 1–6. [Google Scholar] [CrossRef]
- Kutzner, T.; Bönninger, I.; Travieso, C.M. Nutzer-Authentifizierung mittels handschriftlichen Passworten auf Mobil-Geräten. In 12. Wissenschaftstage der Hochschule Lausitz (FH); University of Applied Sciences: Senftenberg, Germany, 2012. [Google Scholar]
- Venugopal, V.; Sundaram, S. An improved online writer identification framework using codebook descriptors. Pattern Recognit. 2018, 78, 318–330. [Google Scholar] [CrossRef]
- Kutzner, T.; Ye, F.; Bönninger, I.; Travieso, C.M.; Dutta, M.K. User Verification using Safe Handwritten Passwords on Smartphones. In Proceedings of the 2016 9th International Conference on Contemporary Computing (IC3), Noida, India, 11–13 August 2016. [Google Scholar]
- Kutzner, T.; Travieso, C.M.; Bonninger, I.; Alonso, J.B.; Vasquez, J.L. Writer identification on mobile device based on handwritten. In Proceedings of the 2013 47th International Carnahan Conference on Security Technology (ICCST), Medellin, Colombia, 8–11 October 2013; pp. 1–5. [Google Scholar] [CrossRef]
- Liwicki, M.; Bunke, H. IAM-OnDB—An on-line English sentence database acquired from handwritten text on a whiteboard. In Proceedings of the 8th International Conference on Document Analysis and Recognition (ICDAR’05), Seoul, South Korea, 31 August–1 September 2005; pp. 956–961. [Google Scholar] [CrossRef]
- Hafemann, L.G.; Sabourin, R.; Oliveira, L.S. Offline handwritten signature verification—Literature review. In Proceedings of the 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA), Montreal, QC, Canada, 28 November–1 December 2017; pp. 1–8. [Google Scholar] [CrossRef]
- Sharma, A.; Sundaram, S. On the exploration of information from the DTW cost matrix for online signature verification. IEEE Trans. Cybern. 2017, 48, 611–624. [Google Scholar] [CrossRef] [PubMed]
- Chapran, J. Biometric writer identification: Feature analysis and classification. Int. J. Pattern Recognit. Artif. Intell. 2006, 20, 483–503. [Google Scholar] [CrossRef]
- Schlapbach, A.; Liwicki, M.; Bunke, H. A writer identification system for on-line whiteboard data. Pattern Recognit. 2008, 41, 2381–2397. [Google Scholar] [CrossRef]
- Frank, E.; Hall, M.A.; Witten, I.H. The WEKA Workbench. Online Appendix for “Data Mining: Practical Machine Learning Tools and Techniques”, 4th ed.; Morgan Kaufmann: Auckland, New Zealand, 2016. [Google Scholar]
- Venugopal, V.; Sundaram, S. Online Writer Identification With Sparse Coding-Based Descriptors. IEEE Trans. Inf. Forensics Secur. 2018, 13, 2538–2552. [Google Scholar] [CrossRef]
# | Parameter | Reference | # | Parameter | Reference |
---|---|---|---|---|---|
1 | GEO_REGRESSION_LOWER_HORANGLE | own | 35 | STAT_MIN_VY | own |
2 | GEO_REGRESSION_ON_HORANGLE | own | 36 | STAT_MAX_VX | own |
3 | GEO_CENTRAL_POINT | own | 37 | STAT_MAX_VY | own |
4 | GEO_REGRESSION_UPPER_HORANGLE | own | 38 | STAT_RELATION_VX_MAX | own |
5 | GEO_EUCLID | own | 39 | STAT_RELATION_NVxz | own |
6 | GEO_POINT_ANGLE | own | 40 | STAT_RELATION_NVyz | own |
7 | GEO_HORIZONTAL_POINT_ANGLE | own | 41 | STAT_VERTICAL_SKEWNESS | [53] |
8 | GEO_SPHI | own | 42 | STAT_SPREADNESS | [53] |
9 | GEO_HYP_ANGLE | own | 43 | STAT_INERTIAL_RATIO | [50] |
10 | GEO_REG_ANGLE | own | 44 | STAT_ASPECT_RATIO | [53] |
11 | GEO_WORD_WIDTH | own | 45 | STAT_HORIZONTAL_SKEWNESS | [53] |
12 | GEO_WORD_HEIGHT | own | 46 | STAT_BALANCE_HORIZONTAL_EXTENSION | [53] |
13 | GEO_SURFACE | own | 47 | STAT_BALANCE_VERTICAL_EXTENSION | [53] |
14 | GEO_SLANT_AMPLITUDE | [53] | 48 | STAT_DWH | own |
15 | GEO_SLANT | [53] | 49 | STAT_RELATION_POINTS_SPEED | own |
16 | GEO_ORIENTATION | [53] | 50 | STAT_X_NUMBER_POINTS/SEGMENT | own |
17 | GEO_WIDTH | own | 51 | STAT_Y_NUMBER_POINTS/SEGMENT | own |
18 | GEO_HEIGHT | own | 52 | TEMP_TIME | own |
19 | STAT_POINTS | own | 53 | TEMP_RELATIVE_WRITING_DURATION | own |
20 | STAT_SEGMENTS | own | 54 | TEMP_TIME_V_MAX | own |
21 | STAT_NUM_STROKES | own | 55 | TEMP_RELATION_TIME_GAP_ALL | own |
22 | STAT_STANDARD_DERIVATION_X | own | 56 | TEMP_TIME_MIN_X | own |
23 | STAT_STANDARD_DERIVATION_Y | own | 57 | TEMP_TIME_MIN_Y | own |
24 | STAT_MEAN_NUMBER_POINTS/SEGMENT | own | 58 | TEMP_TIME_MAX_X | own |
25 | STAT_DTW_Y | own | 59 | TEMP_TIME_MAX_Y | own |
26 | STAT_DTW_X | own | 60 | TEMP_TIME_VX_MAX | own |
27 | STAT_DTW_T | own | 61 | TEMP_TIME_VY_MAX | own |
28 | STAT_RELATION_VX_NEGATIVE | own | 62 | TEMP_TIME_VX_MIN | own |
29 | STAT_RELATION_VX_POSITIVE | own | 63 | TEMP_TIME_VY_MIN | own |
30 | STAT_RELATION_VY_NEGATIVE | own | 64 | TEMP_TIME_X_POS | own |
31 | STAT_RELATION_VY_POSITIVE | own | 65 | TEMP_TIME_Y_POS | own |
32 | STAT_MEDIAN_X | own | 66 | TEMP_TIME_X_NEG | own |
33 | STAT_MEDIAN_Y | own | 67 | TEMP_TIME_Y_NEG | own |
34 | STAT_MIN_VX | own |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 66.55% | 81.23% | 73.25% |
Hold-out 66% | 57.32% | 78.39% | 69.34% |
AVG (Time in s.) | 0.9 | 0.03 | 0.01 |
# of features | 18/67 | 18/67 | 18/67 |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 80.23% | 83.52% | 91.15% |
Hold-out 66% | 72.29% | 80.19% | 90.41% |
AVG (Time in s.) | 2.45 | 0.05 | <10−2 |
# of features | 33/67 | 33/67 | 33/67 |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 58.31% | 69.13% | 72.24% |
Hold-out 66% | 36.04% | 65.44% | 69.02% |
AVG (Time in s.) | 0.96 | 0.03 | <10−2 |
# of features | 16/67 | 16/67 | 16/67 |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 89.83% | 89.15% | 93.62% |
Hold-out 66% | 85.04% | 85.67% | 92.00% |
AVG (Time in s.) | 4.71 | 0.09 | 0.02 |
# of features | 67/67 | 67/67 | 67/67 |
Index | Parameter | Index | Parameter |
---|---|---|---|
7.964 | STAT_STANDARD_DERIVATION_Y | 3.089 | STAT_VERTICAL_SKEWNESS |
7.281 | GEO_HEIGHT | 3.058 | STAT_MAX_VY |
7.044 | GEO_SPHI | 2.788 | STAT_MIN_VY |
6.975 | GEO_WORD_HEIGHT Correlate with STAT_STANDARD_DERIVATION_Y (0.97) | 2.546 | STAT_Y_NUMBER_POINTS/SEGMENT |
6.709 | STAT_ASPECT_RATIO Correlate with STAT_STANDARD_DERIVATION_Y (−0.98) | 2.381 | TEMP_RELATION_TIME_GAP_ALL |
5.997 | STAT_INERTIAL_RATIO Correlate with STAT_STANDARD_DERIVATION_Y (−0.93) | 2.197 | STAT_MIN_VX |
5.894 | TEMP_TIME_Y_NEG | 2.102 | STAT_SEGMENTS Correlate with STAT_NUM_STROKES (1.0) |
5.831 | TEMP_TIME_Y_POS | 2.102 | STAT_NUM_STROKES |
5.590 | TEMP_T’IME_X_NEG | 2.099 | STAT_HORIZONTAL_SKEWNESS |
5.582 | STAT_MEDIAN_Y | 2.029 | STAT_X_NUMBER_POINTS/SEGMENT |
5.510 | GEO_SURFACE | 2.009 | STAT_MAX_VX |
5.447 | STAT_SPREADNESS | 1.717 | STAT_BALANCE_HORIZONTAL_EXTENSION |
5.402 | GEO_WIDTH | 1.579 | STAT_BALANCE_VERTICAL_EXTENSION |
5.254 | GEO_CENTRAL_POINT Correlate with TEMP_TIME_Y_POS (0.92) | 1.529 | STAT_RELATION_VX_NEGATIVE |
5.173 | GEO_EUCLID | 1.511 | STAT_RELATION_VY_POSITIVE Correlate with STAT_RELATION_VX_NEGATIVE (0.93) |
5.005 | STAT_POINTS | 1.458 | TEMP_RELATIVE_WRITING_DURATION |
4.919 | STAT_MEAN_NUMBER_POINTS/SEGMENT Correlate with STAT_POINTS (0.95) | 1.401 | STAT_RELATION_NVxz |
4.801 | TEMP_TIME_X_POS | 1.393 | STAT_STANDARD_DERIVATION_X |
4.743 | GEO_REGRESSION_ON_HORANGLE | 1.366 | STAT_DTW_X |
4.740 | GEO_REGRESSION_LOWER_HORANGLE Correlate with GEO_REGRESSION_ON_HORANGLE (0.99) | 1.350 | GEO_SLANT |
4.607 | TEMP_TIME Correlate with STAT_POINTS (0.95) | 1.328 | STAT_RELATION_VX_POSITIVE |
4.402 | STAT_DTW_Y Correlate with STAT_STANDARD_DERIVATION_Y (0.92) | 1.325 | STAT_RELATION_POINTS_SPEED Correlate with STAT_RELATION_VX_NEGATIVE (0.93) |
4.312 | GEO_HYP_ANGLE | 1.290 | STAT_RELATION_VY_NEGATIVE |
3.262 | GEO_REGRESSION_UPPER_HORANGLE | 1.280 | STAT_RELATION_NVyz |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 88.47% | 89.68% | 95.38% |
Hold-out 66% | 85.46% | 89.36% | 94.42% |
AVG (Time in s.) | 2.81 | 0.11 | 0.02 |
# of features | 37/67 | 37/67 | 37/67 |
Index | Parameter | Index | Parameter |
---|---|---|---|
2.118 | STAT_STANDARD_DERIVATION_Y | 1.363 | GEO_SURFACE |
2.000 | STAT_ASPECT_RATIO | 1.331 | STAT_POINTS |
1.909 | GEO_WORD_WITH | 1.307 | STAT_MEAN_NUMBERS/SEGMENTS |
1.875 | STAT_INERTIAL_RATIO | 1.274 | GEO_ORIENTATION |
1.568 | GEO_SPHI | 1.274 | GEO_REGRESSION_UPPER_HORANGLE |
1.513 | TEMP_TIME_Y_NEG | 1.273 | GEO_EUCLID |
1.500 | GEO_WIDTH | 1.212 | TEMP_TIME |
1.472 | STAT_NUM_STROKES | 1.211 | STAT_VERTICAL_SKEWNESS |
1.472 | STAT_SEGMENTS | 1.104 | GEO_HYP_ANGLE |
1.461 | STAT_MEDIAN_Y | 1.057 | STAT_MAX_VY |
1.459 | TEMP_TIME_Y_POS | 0.955 | TEMP_TIME_VY_MIN |
1.452 | STAT_DTW_Y | 0.880 | TEMP_TIME_MAX_Y |
1.419 | GEO_REGRESSION_ON_HORANGLE | 0.859 | STAT_DWH |
1.412 | GEO_REGRESSION_LOWER_HORANGLE | 0.829 | STAT_MIN_VY |
1.384 | STAT_SPREADNESS | 0.785 | STAT_Y_NUMBER_POINTS/SEGMENT |
1.384 | TEMP_TIME_X_NEG | 0.760 | STAT_DTW_X |
1.373 | GEO_CENTRAL_POINT | 0.747 | TEMP_RELATION_TIME_GAP_ALL |
1.366 | TEMP_TIME_X_POS | 0.729 | STAT_RELATION_POINTS_SPEED |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 86.89% | 87.86% | 92.55% |
Hold-out 66% | 82.61% | 86.09% | 91.68% |
AVG (Time in s.) | 2.63 | 0.05 | <10−2 |
# of features | 37/67 | 37/67 | 37/67 |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 17.8% | 54.62% | 34.10% |
Hold-out 66% | 9.81% | 34.53% | 28.11% |
AVG (Time in s.) | 0.45 | 0.01 | <10−2 |
# of features | 18/67 | 18/67 | 18/67 |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 16.54% | 35.32% | 10.06% |
Hold-out 66% | 10.75% | 20.38% | 10.38% |
AVG (Time in s.) | 0,44 | 0,02 | <10−2 |
# of features | 33/67 | 33/67 | 33/67 |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 95.84% | 96.15% | 58.21% |
Hold-out 66% | 89.43% | 90.94% | 48.87% |
AVG (Time in s.) | 0.5 | <10−2 | <10−2 |
# of features | 16/67 | 16/67 | 16/67 |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 98.65% | 93.53% | 36.03% |
Hold-out 66% | 94.53% | 80.57% | 31.70% |
AVG (Time in s.) | 1,41 | 0.05 | <10−2 |
# of features | 67/67 | 67/67 | 67/67 |
Index | Parameter | Index | Parameter |
---|---|---|---|
344,702,213 | TIME_MAX_Y | 9328 | MEAN_NUMBER_POINTS/SEGMENT |
344,309,419 | TIME_V_MAX | 8913 | RELATION_VX_MAX |
340,892,335 | TIME_VX_MAX | 6362 | WIDTH |
337,972,869 | TIME_VY_MAX | 5661 | REG_ANGLE |
329,873,423 | TIME_VY_MIN | 3451 | X_NUMBER_POINTS/SEGMENT |
328,062,080 | TIME_MAX_X | 3047 | RELATION_NVyz |
328,062,080 | TIME_MIN_X | 2832 | RELATIVE_WRITING_DURATION |
327,360,471 | TIME_MIN_Y | 2726 | RELATION_NVxz |
323,060,762 | TIME_VX_MIN | 2645 | HEIGHT |
18,411 | SLANT | 2597 | Y_NUMBER_POINTS/SEGMENT |
9381 | TIME | 2540 | POINT_ANGLE |
9348 | POINTS |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 98.65% | 96.92% | 64.42% |
Hold-out 66% | 94.53% | 85.85% | 59.10% |
AVG (Time in s.) | 1.3 | 0.04 | <10−2 |
# of features | 59/67 | 59/67 | 59/67 |
Index | Parameter | Index | Parameter |
---|---|---|---|
7.4759 | TIME_MIN_Y | 7.4719 | TIME_VX_MIN |
7.4754 | TIME_MIN_X | 1.9984 | SLANT |
7.4754 | TIME_MAX_X | 1.9317 | RELATION_VX_MAX |
7.4744 | TIME_VX_MAX | 1.5304 | WIDTH |
7.4737 | TIME_MAX_Y | 1.526 | TIME |
7.4737 | TIME_V_MAX | 1.4895 | POINTS |
7.4732 | TIME_VY_MIN | 1.0784 | MEAN_NUMBER_POINTS/SEGMENT |
7.4727 | TIME_VY_MAX |
Classifiers | Bayes Net | Naïve Bayes | KNN |
---|---|---|---|
Cross 10 | 98.65% | 98.34% | 87.44% |
Hold-out 66% | 94.53% | 93.40% | 82.26% |
AVG (Time in s.) | 0.62 | 0.01 | <10−2 |
# of features | 22/67 | 22/67 | 22/67 |
Reference | Method | Dataset (# User) | Accuracy |
---|---|---|---|
[10] | Handwritten character and numerals using an Adaptive Radial Basis Function Network | Private (10) | 97% |
[42] | 39 statistical parameters classified by Bayes Net | Private (32) | 96.87%–100% |
This work | Fisher Score reduction method of 67 statistical, geometrical and temporal features, classified with KNN for password (set of characters) | Private (150) | 95.38% |
[46] | Codebook descriptors on text line level | IAM | 89.92% |
[52] | Gaussian mixture models on text line level | IAM (200) | 88.96% |
[55] | Sparse work frame using a traditional SVM on text line level | IAM | 90.28% |
This work | Fisher Score reduction method of 67 statistical, geometrical and temporal features, classified with Naïve Bayes for words | IAM (220) | 98.34% |
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kutzner, T.; Pazmiño-Zapatier, C.F.; Gebhard, M.; Bönninger, I.; Plath, W.-D.; Travieso, C.M. Writer Identification Using Handwritten Cursive Texts and Single Character Words. Electronics 2019, 8, 391. https://doi.org/10.3390/electronics8040391
Kutzner T, Pazmiño-Zapatier CF, Gebhard M, Bönninger I, Plath W-D, Travieso CM. Writer Identification Using Handwritten Cursive Texts and Single Character Words. Electronics. 2019; 8(4):391. https://doi.org/10.3390/electronics8040391
Chicago/Turabian StyleKutzner, Tobias, Carlos F. Pazmiño-Zapatier, Matthias Gebhard, Ingrid Bönninger, Wolf-Dietrich Plath, and Carlos M. Travieso. 2019. "Writer Identification Using Handwritten Cursive Texts and Single Character Words" Electronics 8, no. 4: 391. https://doi.org/10.3390/electronics8040391
APA StyleKutzner, T., Pazmiño-Zapatier, C. F., Gebhard, M., Bönninger, I., Plath, W. -D., & Travieso, C. M. (2019). Writer Identification Using Handwritten Cursive Texts and Single Character Words. Electronics, 8(4), 391. https://doi.org/10.3390/electronics8040391