1. Introduction
Over the past several decades, economists have examined the critical role that industrial cluster plays in generating innovation performance by asking how and why industrial cluster could accelerate overall innovation performance within the cluster. Extant studies on this question have mainly focused on two aspects. The first stream of literature has focused on the industry cluster as an integrated object, and researchers following this tradition study competitive advantages of industry clustering. Porter has been one of the pioneers in studying the competitive edge of industry cluster at the national level [
1]. It is increasingly popular to combine the theory of new economic geography with the development of industry cluster to study the effects of geographical proximity on regional economic output [
2]. Another stream of research that investigates industry cluster’s impacts on innovation performance is centered on the roles of enterprises within the industrial cluster. An increasing number of studies have paid attention to the agglomeration effect of enterprise’s innovation performance at the micro-level [
3,
4]. Other studies have been conducted to examine the interaction between enterprises or even different actors within and beyond clusters [
5,
6,
7]. The research on agglomeration effects of industry cluster and enterprises’ innovation performance has generated an international literature and academic dialogue. In particular, it becomes highly relevant in China as China intensifies its industrial reconstructing. How to strengthen enterprises’ innovation capability in an increasingly globalized and industrialized network remains one of the main tasks for firms and government in China.
As a representative high-tech industry, the biopharmaceutical industry demonstrates features of agglomeration in technology and capital. Under the market economy with Chinese characteristics, biomedical industrial parks of different sizes have nurtured during the past ten years. Influenced by policy guidance and industry planning by local governments, biopharmaceutical industry has taken off rapidly. Examples include Zhangjiang Park of Shanghai and Optical Valley Park of Wuhan, both of which have begun to generate agglomeration effect. With more frequent joint technology R&D and industry–university–research cooperation, biomedicine enterprises developed close and reciprocal cooperation relationships, through signing technological collaboration contracts between and among biomedicine enterprises, universities, and research institutions. Whether this microcontract cooperative mechanism has promoted innovation performance of biomedicine enterprises has significant practical implications for these enterprises. An equal and reciprocal cooperative contract will likely lead to quality cooperation, and induce more technology sharing and spatial agglomeration of enterprises. Therefore, we analyze whether the agglomeration effects of biomedicine industry have produced impacts on innovation performance. The practical implications of this study are to inform enterprises of the technological development patterns and to facilitate evidence-based policy making on science and technology policy.
This study presents a novel perspective to the study of local industrial cluster in the following ways. First, we collected unique network data on Wuhan Biopharmaceutical Industrial Cluster Network, which was one of the first in the literature to focus on industrial innovation in central China region. Second, we are able to predict corporate innovation performance with network indicators, as informed by social capital theories. This study helps build a bridge between social science theories and innovation research.
The next section focuses on the technological connections among the enterprises in the industry cluster network. Then we discuss inherent logic connections among market transactions, agglomeration effects and innovation performance, based on which we formulate hypotheses that agglomeration effect measured by structure embeddedness of the core enterprises would promote firms’ technological innovation ability and economic performance. We then collect data from various sources, including websites of these enterprises, the Patent Star (China’s official patent website) and CNKI (China National Knowledge Infrastructure). After coding these data, we run social network analysis to measure the scale of cluster’s agglomeration effects. Then two empirical models are estimated to test the hypotheses and examine subnetwork structures. The results showed that betweenness centrality and clustering coefficient are positively related to enterprises’ technological innovation capability, while structural holes produce negative impacts. We conclude with policy implications that professional platforms should be established by core enterprises, and that government should provide a supportive environment to promote close cooperation among enterprises.
2. Literature Review
With interdependent enterprises, science and research institutions, intermediary organizations and consumers, industry cluster takes the shape of a closely connected network, which leads to agglomeration effects [
8]. Agglomeration effects of industry cluster presents a process of knowledge and information disseminating, human resource sharing, cooperative and compatible matchmaking among different enterprises. This leads to an external economy for enterprises to acquire higher productivity [
9]. Enterprises’ connection degree in industry cluster network can be measured by the degree of spillover in network resources and the strength of embeddedness in network structure.
The spillover of resources, including knowledge, information, personnel, capital, and technology in and between clusters, has become an emerging research topic [
10,
11]. Knowledge spillover was particularly emphasized by Powell [
10] and Simona [
12]. Knowledge connections were further divided into component knowledge and architectural knowledge to study their differentiated effects on cluster knowledge spillover and competitive edge [
13]. Financial transaction network among film businesses in Potsdam, Germany was examined [
14]. Formal and informal managerial connections between cluster enterprises were utilized to study the effects of Toronto mutual fund cluster network connections on enterprises’ innovation performance [
15]. It is argued that information advantage enjoyed by core enterprises enhances knowledge spillover in the cluster [
16].
Recently, researchers started to evaluate cluster network’s agglomeration degree through its structural embeddedness [
17,
18,
19]. Qian took structural hole and centrality as two major indicators to measure the adjustment effects, while paying close attention to effects in economic output brought by enterprises’ position in Shenzhen ICT cluster network [
20]. Research on Chengdu furniture manufacturing in China showed that cluster network’s structural embeddedness generated pronounced impacts on the enterprise’s outputs, finding positive effects of degree centrality and betweenness centrality [
19]. Another study on the textile industry cluster with economic performance demonstrated that science and technology research institutions and leading businesses play a key role in cluster network. At the same time, a good network position would enhance the innovation capability of an enterprise, facilitating multiple outputs [
21]. Resource acquisition ability of enterprises in technological knowledge was taken into consideration to build a more comprehensive model in testing network centrality and intermediary effect on innovation improvement [
22,
23,
24,
25]. Further analysis investigated the impact of clustering coefficient on output efficiency of equipment manufacturing in Northeast China [
26].
In this paper, we combine technological spillover with cluster’s network structure embeddedness. The scale of agglomeration effects can be measured by the degree of network structural embeddedness, operationalized with the number of technological cooperation contracts and agreements among enterprises. At the same time, economic attributes and self-learning property are highlighted in our paper, while emphasis is placed on interactions among organizations in the cluster network [
27].
3. Theoretical Framework and Hypotheses
Network structure is determined by enterprises’ relative positions, the functions they possess, and the types of links between these enterprises and other actors. The features of network structure have significant impact on agglomeration, enhancing the firms’ economic outputs.
3.1. Agglomeration Effect of Industry Cluster
Relationships existed in a cluster network can be used as a valuable resource [
28] and innovative activities usually accompany efficient resource collection and accurate processing. Effective absorption and efficient use of external resources may give enterprise more opportunities to survive in intense market competition [
29]. Through mechanisms to enhance trust-building, information exchange, and joint problem-solving, industry cluster regulates enterprises’ market operation and expectations [
30]. Market transactions, agglomeration effects and innovation performance are closely correlated, when firms are clustered in a network.
First of all, market transactions promote agglomeration effects within the industry cluster. After repeated market trading operations, each enterprise in the cluster network would observe the cooperation contracts and respect the interests of other firms. As a result, ad hoc cooperation or speculative trading is replaced by long-term cooperation based on mutual benefits [
31,
32]. The so-called “habitual” trust relationship becomes a credible commitment for each party. Any actions that break the contract will suffer immense loss in capital financing, information access and reputation, impeding further cooperation within the cluster. Over the long run, this will turn an enterprise into an active cooperator who prefers a collaborative strategy. Under the environment with high mutual trust, incidences to break the contract will drop, while transaction costs and fees paid by enterprises will be minimized.
Secondly, a spillover cluster environment is facilitated by agglomeration effects. A higher level of agglomeration effect means more frequent exchange of knowledge, information, capital, and personnel. All these will have great impacts on enterprises’ innovation performance through network structural embeddedness [
33]. General equilibrium model of moral hazard problem confronted by heterogeneous groups in market transaction demonstrated the sustainable growth effect in outputs brought by trustful environment [
34,
35].
3.2. Betweenness Centrality and Innovation Performance
Betweenness centrality for individual enterprise refers to proportion of the shortest paths (or geodesics) that must be passed through, and it accounts for all the shortest paths in the cluster network. Active and efficient involvement in information transmission leads to a higher betweenness centrality score for an enterprise. An enterprise may enjoy a dominant position if it possesses multiple shortest information paths. The formula for betweenness centrality can be shown as [
36]
of which
means the number of the shortest paths passed through enterprise i in all paths between enterprise j and k.
represents the total number of paths between enterprise j and k.
Hypothesis 1: Betweenness centrality of the enterprise is positively related to its level of technological innovation.
Hypothesis 2: Betweenness centrality of the enterprise is positively related to its economic performance.
3.3. Structural Holes and Innovation Performance
Structural holes will be developed if two adjacent market actors (both are indirectly connected through a third enterprise (i) are not directly connected. Heterogeneous connections are possessed by enterprise i who acts as a “bridge” that control diverse information exchange paths. This would provide enterprise i with an advantage in facilitating industrial application of their hi-tech research findings. However, for the two actors who have to connect with enterprise i in the industry cluster, structural holes will likely constrain technological knowledge acquisition and other market activities. We focus on the aggregate constraint of structural hole on enterprise’s innovation outputs. So the constraint degree of enterprise i affected by enterprise j can be demonstrated as [
37]
P
iq presents the proportion of the paths that will get through or is affected by enterprise q, accounting for the number of connections to enterprises within the cluster. m
qj means the marginal strength of the connection from enterprise j to enterprise q or the value of the connection from enterprise j to enterprise q plus the highest values in all the connections that are linked with enterprise j. So the formula of enterprise i in limitation degree can be described as
Huge cost will likely be incurred in maintaining a healthy relationship with enterprises or other market actors for a pharmaceutical enterprise. Under this condition, redundant links must be cut off and resources in management must be put into key connections to achieve efficient output both for the enterprise and the community. A higher structural hole constraint would result in a lower degree of freedom for the enterprise to seek opportunities for cooperation.
Hypothesis 3: Structural holes in the constraint are negatively related to an enterprise’s technological innovation.
Hypothesis 4: Structural holes in the constraint are negatively related to an enterprise’s economic performance.
3.4. Clustering Coefficient and Innovation Performance
Clustering coefficient measures the quality or frequency of one enterprise’s connection with its neighboring actors. The definition is [
38]
C
i is the clustering coefficient of enterprise i, of which k
i represents the degree centrality of enterprise i and e
i means the number of links between its cooperators. Further, CC
t, the clustering coefficient of the cluster network can be defined as
High quality links and frequent cooperation among enterprises located in a cluster network enhance their relationship, trust, and stability and improve efficiency in resource sharing, driving innovative activities.
Hypothesis 5: Clustering coefficient of an enterprise is positively related to its technological innovation.
Hypothesis 6: Clustering coefficient of an enterprise is positively related to its economic performance.
4. Research Design
4.1. Network Boundary
Establishing network boundary is key in applying social network analysis to industrial cluster study, the process of which includes defining the cluster boundary, cluster actor selection, and measurement of network links.
4.1.1. Why Study Biopharmaceutical Industry Cluster in Wuhan
With a comparative advantage in land, labor and transportation costs and favorable industrial policies, Wuhan has become an emerging city in biopharmaceutical industry. More than 500 enterprises have been established within six industry parks in Wuhan East Lake High-tech Zone, with a total corporate income over 500 billion RMB yuan, the third place in China. Since the year of 2008, cooperation between enterprises, universities and research institutions has become more frequent and agglomeration effects have emerged. Biopharmaceutical industry features high degree of agglomeration in capital and high-tech talents, and agglomeration effects of enterprise reflects cluster level environmental change.
4.1.2. Defining Network Actors and Measuring Network Connections
The roles the enterprises play are embedded in their network relationships in the cluster network. In this study, we focus on connections between core biopharmaceutical enterprises and other enterprises which show similar features with core enterprises, universities and research institutions. The network relationships capture both industry–university–research cooperation and interfirm cooperation within the Wuhan East Lake High-Tech Zone.
To measure the cluster network, we focus on technological cooperation that spans new technological innovation, patent application and product development. All these can be clearly measured by inter-enterprise contracts or agreements, which were collected by searching enterprises’ websites, the Patent Star, China’s official patent website and CNKI. Contracts or cooperative texts are available online, and numbers of cooperation and duration of such cooperation are coded. For example, if a contract or a cooperative agreement was signed between enterprises i and j, then, we assume that an equal, reciprocal and undirected link was established between them.
4.1.3. The Process of Identifying Core Enterprise Network
As discussed above, technological cooperation data of a hundred and one biopharmaceutical enterprises within Wuhan East Lake High-Tech Zone were collected and coded based on technical cooperation contracts among these enterprises from 2011 to 2015. Then we conduct two rounds of interviews (one conducted in 2 June 2015 and another in 2 December 2015) with biopharmaceutical industry park leaders of Wuhan East Lake High-Tech Zone and senior managers of nine leading profitmaking enterprises, including chemical drug enterprises; traditional Chinese medicine enterprises; and biological product enterprises. The network data for forty core enterprises are collected from the interviews. With the interview data, we are able to capture the core network of the biopharmaceutical industry cluster. See
Table 1 for descriptions of the number of links for the core enterprises.
A visualization of the core enterprises in the industrial cluster was performed with Pajek4.0. Core enterprises are located in the center of structure graph, while universities, research institutions, and enterprises of the same type are distributed in the periphery, as show in
Figure 1.
4.2. Variables
4.2.1. Research Method
Betweenness centrality, constraint of structural holes and clustering coefficient of structure embeddedness in agglomeration effects are calculated with Pajek4.0. A linear regression model is employed to test the effect of structural embeddedness on its innovation performance. Firstly, only control variables are included in the regression model, then, explanatory variables are included to test their effects on patent outputs. Secondly, explanatory variables are added to test the effects on profits of new products from 2012 to 2015, separately.
4.2.2. Description of Explanatory Variables
Table 2 provides descriptive statistics for the explanatory variables used in this study.
4.2.3. Dependent Variables
Innovation performance outputs can be measured in different ways, with indicators on technological, financial, and management performance. We select patent output to evaluate innovation capability of enterprise based on previous studies. Patents authorized from 2011 to 2015 are archived and coded. R & D investment, average annual profits in new products and net profits can be used to evaluate economic performance at the micro-level. Here we choose average annual profits in new products as our measure. All these profits data are collected through our interview with administrative staff in the biopharmaceutical industry park.
4.2.4. Control Variables
Capital investment in innovation and management has impacts on the outputs of new products. Capital investment represents comprehensive strengths and strategic management in development. Although biopharmaceutical industry possesses high degree of agglomeration in capital and high-tech talents, it takes a long period for technology development and patent application. Thus time capital also plays a key role in innovation. The registered capital assets and establishment of joint stock enterprise time span were taken into consideration as two control variables. See
Table 3 for detailed information for all the variables.
5. Empirical Analysis
5.1. Pearson Correlation Result
Pearson correlation coefficient was calculated between explanatory variables and dependent variables. From the results, Pearson correlation values between betweenness centrality, constraint of structural holes, clustering coefficient and patent outputs are 0.828, −0.556, and 0.795, which are all statistically significant at the 0.01 level. Pearson correlation values between betweenness centrality, structural holes, clustering coefficient and economic performance from 2012 to 2015 are also very strong. The results are presented in
Table 4.
5.2. Testing the Hypotheses
5.2.1. OLS Regression Analysis
We employ ordinary least square (OLS) regressions to test our hypotheses. We use OLS due the fact that our dependent variable is continuous and that our data is cross-sectional. Two empirical models are estimated to test the hypotheses:
Model 1: P = 1 + 2BC + 3SH + 4CC + 5RT + 6RC +
Model 2: Lnyp(t) = 1 + 2 BC + 3 SH + 4 CC + 5RT + 6RC +
Four submodels are estimated with explanatory variables BC, SH, CC and control variables RT and RC based on model 1 with linear regression method.
Explanatory variables BC, BH and CC are included separately in submodels 2 to 4 based on submodel 1, which added additional control variables. Regression results show relatively large and positive coefficients for BC and CC, which are 208.383 and 391.705, F(BC) = 34.02, P(BC) < 0.01; F(CC) = 27.29, P(CC)< 0.01). Regression coefficient is negative and F(SH) = 9.11, P(SH)< 0.01 in S-M3, which are also significant. A higher goodness-of-fit in Model 1(Adj-R
2 = 0.77) showed a better explanation for the effect of agglomeration effects on innovation capacity improvement. From all of the above, we can draw a conclusion that agglomeration effects have accelerated innovation capacity to some degree. Hypotheses 1, 3, and 5 have been supported.
Table 5 reports results for the regression models.
Regression analysis of explanatory variables on innovation capacity tests whether stronger ability in technological knowledge learning and information processing has been developed in the cluster network. We study whether technological knowledge acquisition and processing have been applied in industrialization, and whether they have improved economic performance.
Regression results of S-M 5 to 7 showed BC and CC didn’t promote profits of enterprise and constraint of SH has negative relationship with profit achievement. Negative regression coefficient of constraint of SH is present in 2012 to 2015, of which p(2012) < 0.05, p(2013) < 0.05, p(2014) < 0.01, p(2015) < 0.05, which is also statistically significant. These results are supportive for hypothesis 4, but not for hypotheses 2 and 6.
5.2.2. Multicollinearity Test
After testing our hypotheses, we have conducted multicollinearity test with STATA 13. The results showed that VIF values for all explanatory variables are less than 10 and the average VIF value is 3.64. The VIF value for BC, SH, CC, RT, RC is 7.48, 1.9, 6.09, 1.29, and 1.43. Thus we can conclude that we do not observe serious multicollinearity among the independent variables.
5.2.3. Heteroscedasticity Test and Robust Regression Analysis
We conducted heteroscedasticity tests, both White test and BP test, for estimated model 1 and estimated model 2. In estimated model 1, p-value is 0.24 > 0.05 in White test, and in BP test the p-value is 0.17 > 0.05. In estimated model 2, with Lnyp (2015) as the explained variable, the results showed that p-value is 0.01 < 0.05 in White test and 0.00 < 0.05 in BP test. From the test above we find no evidence of heteroscedasticity in model 1 and that the OLS regression analysis is efficient. For the estimated model 2, we do find heteroscedasticity and that further robust regression analysis is needed.
The robust regression analysis also shows that BC and CC didn’t promote enterprise’ profits in 2015, and also the same to the year of 2012/2013/2014 we have tested. But the constraint of SH has negative relationship with profit achievement in 2015 which presents the same results in
Table 6. Negative regression coefficient of constraint of SH in 2015 is
p(2015) < 0.05, which means that it is statistically significant. So from the results of estimated model 1 and model 2, we can conclude that the agglomeration effects of Wuhan biopharmaceutical industry cluster, measured by network structural embeddedness dimensions of betweenness centrality, constraint of structure holes, and clustering coefficient in our paper, emerged directly and had positive effects on enterprises’ innovation as represented by patents, while the agglomeration effects didn’t facilitate significant effects on profits. In other words, the technological cooperation among the cluster have produced significant innovation achievements, in generating patents or other technological improvements, while not directly promoted enterprise’s profits, which makes practical sense. Because technical cooperation is a long-term process and may take a long time to bring profits for enterprise and more importantly, enterprise’s profits are mainly determined by the operation strategy and management methods of the company. We will analyze the cluster network in the following to better understand the overall cooperation status of the cluster. This is reported in
Table 6.
5.3. Cluster Network Structure Analysis
Betweenness centrality and clustering coefficient would produce positive impacts on innovation outputs based on regression analysis between structure embeddedness and patent outputs. Every one percent increase in strength of betweenness centrality and clustering coefficient would lead to an increase of 95.7% and 215.6%, respectively, in innovation capability. This means paths are controlled by some core enterprises where information and resources are exchanged. However, positive effects are ameliorated due to the constraint of structure holes.
Subnetwork structures of the core industry cluster (
Figure 2,
Figure 3 and
Figure 4) are formulated to make comparative analysis that gives us an overall perspective on the industry cluster. Seventy-five universities, 41 research institutions, and 137 enterprises have established cooperative relationships with core enterprises respectively, which developed 124, 159, and 218 edges, respectively, in three different subnetwork structures.
From
Table 7, we can observe that interactive links among different actors are more close and intimate in subnetwork structure of core enterprises (40) and other enterprises (
Figure 4), thus producing the largest average betweenness centrality and clustering coefficient (
Figure 4) compared to overall subnetwork (
Figure 2 and
Figure 3). E & R, which represents network of enterprise and research institution (
Figure 3), shows the relative low degree centrality of network and weak relationship between core enterprises and research institutions. Overall average clustering coefficient is smaller in both
Figure 2 and
Figure 3, and cooperation between core enterprises and research institutions are loose as they are distributed in small groups. They are more likely to connect with research institutions in small groups or individually instead of being linked in large groups. This would not intensify the connection for the overall network in resource exchange. This has important implications for choosing technical cooperators. Because small group connections may benefit individuals as enterprises can maintain sustainable cooperation with lower cost. In
Figure 4, interactive cooperation between enterprises is highly intensive, which indicates core enterprises enjoy a stronger relationship in the cluster network. These results are is reported in
Table 7,
Table 8 and
Table 9.
Next, we focus on
Figure 2, some enterprises and universities, for example, Humanwell Healthcare Group Co. Ltd. (C1), WuXi App Tec (C3), Ma yinglong Pharm (C20), Wuhan University (D1), and Huazhong University of Science and Technology (D2) possessed larger degree centrality, betweenness centrality and closeness centrality while smaller aggregate constraint of structure holes. This also contributes to a high degree centrality and betweenness centrality of network structure between enterprise and university (
Figure 2).
Further, individual enterprise network structure graph is extracted from cluster network structure graph of core enterprises (40) for a micro-level analysis. In
Figure 5, many SMEs have established relationship with C1 and R28 while they experience a lack of cooperation among themselves. This point has also been demonstrated in
Figure 3 and
Figure 4. Therefore, C1 and R28 both control knowledge flow paths and act as intermediary actors. Financial and time costs have been spent in market transaction for SMEs. Therefore, a better business and policy environment is needed to ensure cooperation between SMEs.
6. Discussion and Conclusions
In this study, we examine whether technological spillover demonstrated by embeddedness have impacts on enterprises’ outputs. We measured the degree of network structure embeddedness through collecting technological cooperation texts and calculating the scale of technological spillover with network analysis. Further, we have analyzed the characteristics of three different subnetworks. The results showed that with the fast development of biopharmaceutical industry in Wuhan East Lake High-Tech Zone, the positive spillover effects will become more prominent if more frequent and efficient cooperation developed not only among core enterprises, but also between SMEs and other types of network actors. The overall network structure can be optimized by strengthening intermediary function of core enterprises and institutions, by tightening exchanges among SMEs, and by removing constraint of core enterprises. The following suggestions may be helpful for decision makers both from the supply-side (government) and demand-side (enterprises) in promoting cluster development, and offer a reference for regions sharing similar conditions in rapid industrialization.
First of all, this article’s findings confirm intermediary function of some core enterprises and research institutions in innovation diffusion. Wuhan, a central city in Central Region of China, with rich educational resources and research institutions, policy-makers (government) should encourage building multistakeholder technical service platforms, particularly resource integration and information transmission platforms, including Instruments Sharing Platform and Key Technology Service Platform. The involvement of enterprises in these platforms are essential.
Secondly, we learned from the analysis that cooperation among SMEs and SMEs with other types of network actors need to be strengthened. Regression analysis between structure embeddedness and economic performance demonstrated agglomeration effects on economic performance are not consistent with what we predicted. This suggests that policies should be provided to SMEs in the areas of capital, knowledge and technology and a better institutional environment is needed.
Finally, regression analysis between structure embeddedness and innovation outputs and economic performance demonstrated that aggregate constraint of structure holes generate negative impacts on core enterprises, and degree centrality play a limited role in improving outputs. This means core enterprises should decrease their redundant links with other actors. Policy-makers should slow down their efforts in supporting the “key controlled enterprise”.
This study provides insights regarding paths for future research. First, future studies can collect data on innovation cluster from other regions in China, so that scholars are able to generate conclusions that are generalizable across geographic locations. Second, this study mainly used patents as the measure for innovation performance, future studies should also focus on more nuanced measurement of innovation performance.