Both first-order and second-order models were used in this study. The first-order model included one endogenous construct—PERF (business performance)—and seven exogenous constructs—TECH (big data analytics technology capability); MAN (big data analytics management capability); HUM (big data analytics human capability); INFR (big data analytics infrastructure capability); SUST (sustainability); DIGIT (digitalisation); and DSR (digital sustainability reporting). BDAC (Big data analytics capability) and DIGITTY (digitainability), were conceptualised as higher-order constructs within the second-order model.
5.1. Model 1
First-order model.
Figure 3 presents the first-order model. The model contained one reflective and eight formative constructs, and the following assessment criteria for validity and reliability were used to evaluate the model: outer weights and loadings (size and significance), composite reliability, average variance extracted (AVE; for convergent validity), and discriminant validity. R-squared and variance inflation factor (VIF) tests were performed, following [
11] guidelines to evaluate validity for the formative constructs. The first-order model was constructed and estimated using SmartPLS 4. The impact of the control variables on the individual level—job position, gender, work area [
13]—and on the organisational level—sector [
9,
75], size [
9,
76], age [
9], iXBRL reporting experience [
77], and year of the first sustainability reporting [
1]—on the dependent variable, business performance, was examined using dummy variables.
Outer weights and loadings. Because the majority of the constructs are formative, outer weights were examined to assess whether the indicators have relative significance in their association with related constructs. For the indicators with the lowest outer weights, outer loadings were also checked. According to [
90], formative constructs contain some indicators with low weights. They suggested that these could be retained if justified by the researcher. Thus, the author examined the statements and wording of the indicators that showed weights and loadings below the 0.5 threshold, and the following items were removed from the model: DIGIT8, 9, 11; SUST5, 6, 7; HUM1, 2, 11, 12; INFR4, 7, 8; MNG2, 3, 8, 10, 13; TECH1, 2, 4, 5, 8; PERF 2, 4, 6. Some were retained because of their critical importance for their distinct contribution to the model. Furthermore, the author explains the possible reasons for poor indicators and the logic behind removing the above-listed indicators (see the questionnaire in
Table S1).
Omitting DIGIT8, 9, and 11 highlights the irrelevance of aspects related to the digitalisation of sustainability accounting processes and the non-appearance of their integration with financial accounting. This may be because the digitalisation policies of large Latvian organisations currently do not incorporate DSR-related accounting processes due to the emerging nature of the phenomenon and a lack of expertise in its underlying methodologies.
SUST 5, 6, and 7 illustrate the corporate sustainability framework from a compliance perspective, aligning it with regulations and best-practice standards. The low statistical significance of these measurements for sustainability construct may indicate that adherence to best-practice standards in environmental and social responsibility is not prioritised in Latvian organisations. This may be attributed to the low demand for such sustainable practices within the Latvian market and the lack of financial incentives to support their implementation.
HUM 1, 2, 11, 12, and MNG 2, 3 measurements are deleted, as they statistically do not contribute to the BDA human capability. Content-wise, they mainly refer to the data analytics unit’s staff, while other measurements that contribute to the construct relate to the analytical competencies of other organisational staff. Such discrepancy may be due to the fact that large Latvian organisations often do not have separate data analytics units within their organisational structures.
The indicators MNG 8, 13, and INFR4 relate to the understanding and control of organisational sustainability. The lack of their statistical contribution to the respective constructs may indicate that executives in Latvia are not yet prepared to implement sustainability frameworks at a level comparable to other areas of business operations. Thus, they are eliminated from the model.
The low significance of INFR 7 and 8, which focus on operational and sustainability risk management systems, may stem from the fact that many Latvian organisations lack well-integrated risk management systems, as it was revealed by the qualitative field study by [
80]. Thus, INFR 7 and 8 do not contribute to BDA infrastructure capability and are omitted from further analysis of the model.
The poor construct fit of TECH 1 suggests a limited understanding among responders of XBRL-based systems. Hence, it is excluded. TECH2, 4, 5, and 8 also show statistical insignificance, indicating that they do not contribute to measuring the unity of the information systems as the key success factor of the BDA technology capability construct.
Notably, the exclusion of the reverse-coded item PERF2—“Management evaluates financial results separately from environmental and social results”—due to its lack of statistical significance in measuring business performance, suggests that executives in Latvia do not perceive environmental and social factors as integral to the business performance construct. This aligns with previous empirical findings from the Latvian market [
80]. One possible explanation is that these factors are not seen as directly contributing to short-term financial outcomes, which are typically prioritised by management in large organisations. As a result, such indicators may not be considered valuable for assessing the dependent variable. PERF 4, 5, and 6, which attributed to the ROI, ROA, and ROE ratios, were eliminated as these measurements of business performance, in addition to poor weights and loadings, exhibited extremely high collinearity. This may be because the respondents were primarily not professionals with financial backgrounds and were, therefore, unable to differentiate between these ratios. However, measurement referencing overall financial performance was retained in the model.
DSR was examined as a reflective construct based on the loading with the same threshold, resulting in the elimination of DSR1, 2, and 7. Indicators that measure readiness for iXBRL reporting in the DSR construct, DSR8 (0.443) and DSR9 (0.464), were rounded to the threshold value of 0.5 and kept in the model. The bootstrapping algorithm was then applied to define statistical significance. The results showed that all loadings were statistically significant (p-value 0.000).
Collinearity statistics. Next, the variance inflation factor (VIF) was examined for formative constructs. A VIF greater than 10 suggests high multicollinearity. However, a more conservative approach to formative constructs involves applying a threshold of 5. All values in the first-order model were below 5, indicating the absence of multicollinearity.
Cronbach’s alpha. As shown in
Table 3, all results for the constructs were above the minimum required threshold of 0.7 for Cronbach’s alpha. Therefore, the scale of measurement reliability in the model was internally consistent.
Convergent validity. The convergent validity of the constructs was evaluated using AVE, considering a threshold of 0.5.
Table 3 shows that the digitalisation (DIGIT) value of 0.456 and DSR value of 0.482 are below the threshold. However, they could be rounded to 0.5 and accepted as valid in terms of convergent validity. The composite reliability values were higher than the minimum threshold of 0.7 Thus, according to [
91], even if the AVE is less than 0.5, but composite reliability is acceptable, the convergent validity of the construct is considered acceptable as well. Therefore, all constructs in the model were treated as valid in terms of convergent validity.
Composite reliability. All constructs are above 0.8, which is above the threshold of 0.7, indicating sufficient composite reliability.
Discriminant validity. The heterotrait–monotrait ratio of correlations (HTMT) was used to assess discriminant validity [
92,
93]. The threshold <0.85 should be applied for conceptually distinct constructs, for example, in
Table 4 for BDA management capability and DSR (0.419) or digitalisation and business performance (0.657). However, for conceptually similar constructs, the threshold should be <0.9. Only MNG-INFR had a value of 0.878 and was considered to be below the limit of 0.9 applied to conceptually similar constructs. The HTMT values of all other constructs range from 0.411 for INFR-DSR to 0.819 for MNG-DIGIT, which were below the lower limit of 0.85. Thus, the validity of the measured constructs’ differentiability was confirmed.
The results support the appropriateness of all indicators for measuring their respective constructs and suggest that the measurements of the first-order model are valid for further analysis.
Validity of the first-order model. Following [
11], the validity of the endogenous variables was evaluated based on the R-squared values. The R-squared criterion for SEM measures the predictive accuracy of the model [
94]. The impact ranges between 0 and 1. The R-squared values of 0.25, 0.50, and 0.75 for endogenous variables indicate low, moderate, and high predictive capacity, respectively. As shown in
Table 5, the adjusted R-squared demonstrate that all indicators provide valid representations of their respective constructs, with the highest predictive accuracy for business performance (0.656).
First-order model results.
Table 6 presents the path coefficient results for the first-order model. Using these effect sizes, users can ascertain whether the effects indicated by path coefficients are weak, medium, or strong. The recommended values are 0.02, 0.15, and 0.35, respectively [
95].
The individual path coefficient data analyses revealed a substantial relationship between MNG and DIGIT (0.352), HUM and SUST (0.315), and TECH and DSR (0.377).
The insignificant values of the path coefficients between TECH and DIGIT (0.007), and DSR and DIGIT (−020) show that these relationships do not affect each other. Thus, BDA technology capability does not affect DSR. DSR has no impact on digitalisation; this is also supported by the poor p-value (0.825), proving the absence of a related effect. Thus, H6.2 is rejected.
The relationships between SUST and PERF and MNG and PERF had negative path coefficients and p-values higher than 0.05, indicating no connection between these constructs. All other constructs showed medium positive cause–effect relationships.
According to the literature, a
p-value of 0.05 and lower indicates that the predictor variable relates to changes in the response variable.
Table 6 shows the strong predictive power of the relationship between HUM, MNG, and DIGIT; TECH with DSR and SUST; and HUM with SUST. None of the constructs in the first-order model were below the 0.05
p-value threshold; thus, none had predictive power for business performance. However, the
p-value for TECH was 0.058 (with a medium path coefficient of 0.225), which is close to the significance level and could potentially have some effect. Therefore, further investigation and discussion are required.
Finally, only sector (p = 0.017) and size (p = 0.044) significantly influenced business performance. Therefore, they were retained for control in the second-order model. Based on evaluating the path coefficients for control in the second-order model, the author also retained the variables with high path coefficients: work area (0.691) and iXBRL (0.359). Considering poor path coefficients and p-values, as well as Table’s 1 examination, other control variables (gender, position, age and CSRD) were excluded from further analysis.
5.2. Model 2
Second-order model. PLS enables the investigation of models with high abstraction levels. After the fit has been performed within analyses of the first-order model, a higher-order model using the hierarchical components approach was designed to test BDAC and Digitainability within the model relationships. The second-order constructs, BDAC and Digitainability, were directly measured using observed variables from the first-order constructs. First, in the second-order model, only BDAC, as a formative–formative general construct represented by the manifest constructs—TECH, HUM, MAN, and INFR—was introduced in the analysis to test the direct relationships of BDAC with sustainability and digitalisation (
Figure 4). Digitainability was then added as a formative–reflective second-order construct in the model, represented by sustainability and digitalisation as its manifest variables (
Figure 5).
The higher-order model results from the PLS analysis are summarised in
Table 7. Based on the model shown in
Figure 4, the author tested the hypotheses related to sustainability and digitalisation. Both SUST (0.871) and DIGIT (0.795) had excellent explained variance (R-squared) with high predictive capacity.
BDAC has a significantly positive effect on sustainability (path coef. 0.664;
p-value 0.000), thus supporting H3.1. These empirical results align with the SLR findings of [
54], who emphasised data openness and accountability in management control systems, staff analytical collaboration, centralised information sharing, and knowledge management as key factors in promoting sustainability. BDAC has also been proven to have a substantial positive impact on digitalisation (path coef. 0.739;
p-value = 0.000), thus supporting H3.2. The hypotheses testing showed that higher BDAC improves organisational sustainability and digitalisation. The key findings of [
54] also indicate that achieving all four BDAC elements for enhanced sustainability goals requires integrating digital tools and technologies, thereby fostering organisational digitalisation.
DSR has a moderately positive effect on sustainability (path. coef. 0.218; p-value 0.016). Thus, H6.1 is supported: organisations that introduce DSR positively influence their sustainability. However, a high p-value (0.643) in the second-order model for DSR-DIGIT supports the conclusion from the first-order model that H6.2 is rejected.
Neither sustainability nor digitalisation were found to influence business performance, leading to the rejection of H5.1. Aligned with the empirical qualitative research findings on the key challenges of preparing digital sustainability reports by large organisations in Latvia by [
80], the rejection of sustainability as an influential aspect of business performance may be attributed to the fact that sustainability is not treated as a core component of the business model but rather as a stand-alone practice within the organisation. H5.2 is also rejected. Since digitalisation does not demonstrate a relation to business performance, the author argues that Industry 4.0 has become an essential standard for modern market players rather than a source of competitive advantage. This suggests that digitalisation alone is insufficient to enhance competitiveness and profitability unless complemented by strong BDAC. This is consistent with the findings reported by [
96], who suggests that to achieve superior business performance results, organisations should move towards the integration of Industry 5.0.
5.3. Model 3
The DIGGITY construct was added to the second-order model, as presented in
Figure 5. Following the recommendations of Sarstedt et al. (2022) [
97], the author chose the final structural model from the PLS analysis with the best fit based on the Bayesian information criterion (BIC) results, with the aim of minimising BIC. In Model 3, the independent (BDAC) and dependent (PERF) BIC values were substantially away from zero (
Table 8). Thus, Model 3 has a better fit than Model 2 for testing the other hypotheses in further analysis.
The Model 3 results revealed that BDAC has a significant positive effect on business performance (path coef. = 0.391;
p-value = 0.003), and the demonstrated validity of the construct was substantially high (R-squared = 0.997). Thus, H2 is supported. This aligns with [
98], whose meta-analysis of 34 selected articles and over 60,000 observations provides strong evidence that enhanced sustainability reporting significantly improves economic performance, highlighting its crucial role in corporate success.
Additionally, the author compares the strong impact of BDAC on business performance with the results of the first-order model (
Table 6), where none of the BDAC formative constructs (TECH, MNG, HUM, and INFR), when presented as independent variables, showed an effect on business performance. This indicates that BDAC positively impacts business performance (PERF) only when all first-order constructs are collectively present and accounted for in the model. TECH, MNG, HUM, and INFR were identified as key success elements of BDAC. Thus, it supports H1.
As shown in
Table 7, DIGITTY is moderately strong (R-squared = 673), reinforcing its theoretical significance as an assemblage of sustainability and digitalisation within this model [
63] in a reflective manner. As shown in
Figure 5, Model 3 demonstrates a strong positive BDAC–DIGITTY relationship (path coef. 0.760;
p-value = 0.000). Accordingly, H3 is supported.
Further, the author tests the relationship between digitainability and business performance. The Model 3 results could not establish this (path coef. 0.0.003; p-value = 0.983); thus, H5 is rejected.
Model 3 also shows a significant effect of BDAC on DSR (path coef. 0.528;
p-value = 0.000). The R-squared for DSR (0.267) had the lowest value in the model but was still statistically significant. Thus, it supports H4. This indicates that the development of BDAC is essential for the successful execution of DSR. Furthermore, the author tested the relationship between DSR and business performance.
Figure 5 shows the medium effect of DSR (path coef. 0.192) on business performance. The
p-value was 0.062, slightly above the threshold of 0.05. Interestingly, in Model 2, the
p-value of the aforementioned relationship was 0.022 (
Table 7), which is below the threshold of 0.05. Additionally, in Model 2, DSR demonstrates a stronger impact on business performance, with a path coefficient of 0.238, compared to Model 3. Consequently, given the borderline
p-value in Model 3, which does not definitively reject the hypothesis, the author attributes a better fit of the DSR and business performance relationships to the second-order model without digitainability (
Figure 4). Therefore, H7 is supported. This phenomenon requires further investigation to gain deeper insight into its underlying mechanisms and implications. Furthermore, a comparison of the first-order and second-order model results, in alignment with the empirical findings of [
33], confirms that DSR has a positive impact on sustainability and business performance only when the organisation possesses all four BDAC factors.
Furthermore, the author tested the impact of DSR on digitainability, and the results (path coef. 0.000; p-value = 0.140) did not reveal any relationship between these constructs. Thus, H6 is rejected. Thus, while DSR positively impacts sustainability, as supported in H6.1, it does not impact digitainability and digitalisation, as rejected in H6 and H6.2. The author suggests that, at this early stage of implementation, the approach to DSR remains primarily centred on sustainability, with little to no emphasis on broader organisational digitalisation aspects related to the new regulatory environment. This may indicate that the Latvian market has yet to fully recognise the necessity of establishing robust sustainability accounting and reporting systems to ensure robust DSR.
Furthermore, mediation tests were performed. The results of the mediation tests for Model 3 (
Figure 5), shown in
Table 9, strongly reject the mediating effect of digitainability on the relationship between BDAC and business performance. Therefore, H2.1 is rejected. However, the
p-value in the relationship between BDAC and DSR and PERF was 0.62, which is close to the threshold of 0.5. Thus, the author analysed this mediation without a digitainability construct based on Model 2 (
Figure 4). The results presented in
Table 9 support the hypothesis, with a
p-value of 0.024 and moderate path coefficient of 0.126, indicating a statistically significant mediation effect. Thus, H2.2 is supported. Interestingly, the total mediation effect between BDAC and business performance shown in
Table 9, had the strongest correlation (path coef. 0.488;
p-value = 0.000). This aligns with the survey results of [
49], who found that BDA and corporate sustainable performance are strongly mediated by data-driven competitive sustainability.
The main results of the hypotheses testing from Models 2 and 3 are shown in
Figure 6. In addition to confirming the strong direct relationship between BDAC and business performance, the results indicate that BDAC has a significant positive influence on DSR, sustainability, digitalisation, and digitainability. This provides compelling market evidence that enhancing BDAC—by strengthening its key factors—is essential for driving successful business development in the evolving DSR environment.
5.5. Challenges to DSR Readiness
The survey included an open-ended question: “Please name the main challenge faced when preparing for digital sustainability (ESRS) reporting?”. All respondents provided an answer to this question. To summarise 75 answers and highlight the main challenges mentioned by the respondents, the author visualised the results using word cloud (
Figure 8) and text blocks (
Figure 9) techniques in the online Voyant text analysis tool [
99]. The aim is to apply content analysis to the qualitative data in order to support and enrich the interpretation of the statistical results.
Figure 8 clearly illustrates the central and challenging role of data in the preparation process for Digital Sustainability Reporting (DSR). The word “data” was mentioned 40 times in respondents’ answers, making it the most frequently used term—significantly more than “report” (14 mentions) and “sustainability” (13 mentions).
Figure 8 and
Figure 9 together highlight a strong contextual emphasis on data-related challenges, particularly the recurring theme of its insufficiency.
Besides the lack of data, the main challenges faced by organisations related to sustainability data are its quality and availability (see
Figure 9). Based on the links provided in
Figure 8, respondents also outline the absence of the systems’ universalism, stating that data lacks consolidation, as it is located in different departments and systems or unavailable from the value chain. The respondents provided the following related answers about the challenges: “Correct data entry, data consolidation”, “Data gaps and quality of existing data in systems”, “Data extraction from the value chain, data interpretation, manual data processing”, “Data aggregation”, “Lack of unified data management in the group (directions, departments work in different systems, we also widely use Excel, but in different quality)”, “Receiving information from third parties; compiling and analysing existing data (different data formats, etc.)”. These responses underscore the critical role of integrated IT systems within organisations in ensuring data availability and quality, key prerequisites for developing BD technology capability that support robust sustainability reporting.
The connections illustrated in
Figure 9 highlight an additional, distinct theme in the responses: the challenge of controlling data processing. This, in turn, influences data quality and reliability in DSR. For instance, the author retrieved the following statement: “Much of the data collection and processing process is manual—with high chances of human error. There is no synchronised single system that automatically reads in the sustainability data currently in the company…”. According to the literature, control over data is crucial for flexibility over sustainability reporting related processes [
100]. That draws BDA infrastructure capability as the second crucial element of BDAC to be developed in the organisation for securing sufficient DSR readiness.
Additionally, respondents stress that environmental uncertainty around CSRD regulation brings challenges to understanding requirements for DSR. In line with that, as shown in
Figure 8 and
Figure 9, ten respondents said that the main challenge for their teams is “understanding the scope of ESRS reporting” or “insufficient staff resources that understand ESRS reporting, data point content…” This point highlights a key challenge related to organisational human analytical competencies within the context of DSR. Additionally, these competencies should be adaptable and capable of evolving in parallel with the rapidly changing regulatory environment. This indicates that the compliant and timely implementation of DSR depends on the proactive development of the BDA human capability.
Finally,
Figure 8 shows the ultimate role of the management. Some respondents identified that “lack of understanding at the top management level, which limits funding to address technical and data collection issues for reporting purposes” or “management buyout” mentioned as the main challenge they have to achieve DSR readiness. In line with that, it is challenging to overcome “management complexity to meet strict regulatory standards and deadlines”. The respondents’ statements position top management at the forefront of the organisational changes compelled by the new SR regulation. Therefore, the dynamic capability for knowledge management must be embedded at the top management level to establish a coherent foundation for developing other elements of BDAC essential for preparing DSR. Considering the points outlined above, the presence of BDA management capability within an organisation is a critical element in establishing the artefacts and routines required for effective sustainability reporting.
The investigation conducted by the author in this section provides content-based evidence that BDA technology, management, human, and infrastructure capabilities are necessary elements for BDAC to have a substantial positive effect on DSR readiness. Thus, the results of the content analyses for the answers to the open-ended question from the survey visualised in
Figure 8 and
Figure 9 validate the approval of H1 and H4 (
Table S2) based on the statistical hypotheses tests. However, results show that it is challenging for organisations to integrate BDAC elements to ensure proper DSR readiness.
Therefore, to address the gap, the author proposes step-by-step guidance to support organisations in identifying challenges and building the robust BDAC necessary for integrating DSR and enhancing business performance. A sustainable business development driven by BDAC and digitainability framework (
Figure 10) was developed, drawing on an extensive literature review, the GST, OIPT and RBT theoretical framework, and empirical analyses. This framework outlines five optimal and coherent key steps comprising the main elements that organisations should adopt to enhance their business performance.
Based on the hypotheses testing, the author stresses that to achieve a positive impact on business performance, steps from the sustainable business development driven by BDAC and digitainability framework (
Figure 10) should not be applied in a chaotic order, and none of the steps should be skipped or adopted partially. Only a consistent implementation of all the outlined elements in the framework will lead to an efficient and effective result.