4.1. Patent Retrieval Results
Before performing technology network analysis, the patent retrieval results were analyzed to gain a preliminary overview of technological development. Machine-learning technologies in optical communications involved 407 four-digit IPCs, which indicated a wide scope of involvement.
Table 1 presents the quantity of the top 10 four-digit IPCs.
In
Table 1, the frequency denotes the number of patents that have appeared in the IPC classification; for example, 170 patents under G06F17 belonged to the IPC classification. The percentage represents the proportion accounted for by an IPC classification out of the total number of IPC classifications; for example, the 824 patents contained 3291 IPC classifications (one patent might have more than two IPC classifications), and G06F17 appeared 170 times, accounting for 5.17%. The results indicated that that the technologies were mostly concentrated under the classifications G06F17, G06F3, G06K9, H04L29, and A61B5 (
Table 1);
Appendix A displays the definition of each IPC code. According to the IPC definitions, G06F17 refers to data-processing methods, and G06F3 is the classification for data input and output devices (e.g., interface arrangements). G06K9 covers methods and devices for recognizing patterns, and H04L29 is communication control. A61B5 is the classification for measuring diagnostic purposes, among which G06F17 was more related to AI [
35] because it appeared the most frequently.
The analysis results of the top ten patentees revealed that Apple Inc., which focuses on AI and communication technology development, holds the greatest number of patents, followed by SAS Institute Inc., Intel Corporation, and International Business Machines Corporation, which are global leaders in intelligent software and services (
Table 2). The next patentee, Volcano Corporation, develops biological diagnostic systems with ultrahigh resolution and holds numerous patents to technologies that can be applied for medical diagnostics.
Furthermore, G06N in three-digit IPCs describes computer systems based on specific computational models, and G06N will be subdivided into four-digit IPCs, which is further explained different computational models. Therefore, the patents collected in this study reveal computational models (i.e., G06N). In this study, four-digit IPCs related to G06N appeared 131 times, and the distribution of the four-digit IPCs, including computer systems based on biological models (G06N3), computer systems using knowledge-based models (G06N5), subject matter not provided for in other groups of this subclass (G06N99), as shown in
Table 3.
4.2. Technology Network Analysis
The results of previous studies have suggested that technology co-classification analysis can be used to analyze the relationship between fields of technology [
30,
31]. Because patents may be subject to multiple patent classification codes, co-classification information can be used to define the relationship between technical fields, as shown in
Figure 1.
The numbers in the matrix in
Figure 1 represent the frequencies of different IPCs appearing in the same patent. A greater number represents a stronger technical connection between IPCs. For example, the IPCs of P
1 belong to IPC
1, IPC
2, IPC
3, and IPC
4. Because IPC
1, IPC
2, IPC
3, and IPC
4 simultaneously appear in P
1, a technical connection exists among IPC
1, IPC
2, IPC
3, and IPC
4. Moreover, regarding the relationship between IPC
1 and other technical fields, because IPC
1, IPC
2, IPC
3, and IPC
4 only appear simultaneously in P
1, the technical connections between IPC
1 and IPC
2, IPC
3, and IPC
4 were consistently “1”, as shown in the first column of the matrix. Because IPC
1 does not concurrently appear with IPC
5 and IPC
6 in patents, the technical connection of IPC
1 to IPC
5 and IPC
6 is “0”. This approach was adopted to gradually develop co-classification matrices of all technical fields. Therefore, the present study can facilitate plotting of the technology network map through the matrix.
Table 4 presents all parameters used in the technology network analysis.
Figure 2 presents the network model of key technologies, and the key IPCs are listed in
Table 5. The centrality performance index of the top 10 IPC codes in the frequency analysis (
Table 1) has been added to
Appendix B.
The overall network, as shown in
Table 4, was composed of 407 nodes and 206,194 links; a total of 407 IPCs were related to machine learning technologies in optical communications. The network density and compactness were 0.159 and 0.377, respectively, indicating that the network was sparsely distributed and that the interaction frequency between nodes was low. The average path length was 2.688, meaning that connecting a node to other nodes required nearly 3 steps, on average. The technology nodes depicted in
Figure 1 are crucial technology nodes linking more than 60 different technology nodes; that is, they represent the more critical technology fields in patents for machine-learning technologies in optical communications. Regarding degree centrality, eigenvector centrality, and structural hole, the IPCs G06F3, G06F17, H04W4, and H04L12 had more than two indices in the top five technology fields (
Table 5). In the overall IPC codes, the mean eigenvector centrality was 0.028, and a larger value indicated greater relative importance of a node. The structural hole effect was measured using the network constraint index. The mean of said index was 0.519, and a higher value indicated a lower structural hole effect. This suggested that key machine-learning technologies in optical communications are mainly concentrated in data input and output devices (e.g., interface arrangements; G06F3), data-processing methods (G06F17), wireless communication networks (H04W4), and the transmission of digital information (H04L12). The IPCs A61B5 and G06F19 only appeared in the structural hole column, which signified that these two technology fields belonged to different technology clusters and played a cross-disciplinary role. Thus, key technologies in the cross-disciplinary uses of machine learning in optical communications were for measurements for diagnostic purposes (A61B5) and information and communication technology (ICT) specially adapted for specific application fields (G06F19).
Additional insights available through network centrality metrics were as follows. Despite the slightly lower frequency of certain technology fields appearing in patents, in terms of the overall technology network, the connected technology nodes were more diverse and had an interdisciplinary nature in terms of applications. For example, although H04W4 and G06F19 are not listed in the top 10 technology fields with the highest frequency in
Table 1, H04W4 was observed to connect to more different nodes in terms of degree centrality. Eigenvector centrality considered whether H04W4 was connected to a node with relatively high centrality, whereas structural holes revealed that G06F19 and H04W4 occupied the main channel of network communication; that is, the degree to which the connection between technology clusters depended on G06F1 and H04W4 was revealed.