Clean Customer Master Data for Customer Analytics: A Neglected Element of Data Monetization
Abstract
:1. Introduction
2. Theoretical Background
2.1. Digitization, Digitalization, and Digital Transformation
2.2. Data Monetization
2.3. Data, Data Structuring, and Data Analytics
2.4. Customer Analytics
2.5. Algorithms for Data Integration and Cleansing
3. Research Methodology
3.1. Empirical Context
3.2. Data Collection
3.3. Data Analysis
4. Results—Insights into the Data Cleansing Initiative
4.1. The Three Main Phases of Using Data for Customer Analytics
4.2. Data Cleanliness, Integration, and Harmonization: A Key Challenge
4.3. Challenge Identification—Impeding Progress
4.4. Proposed Solutions for Overcoming These Challenges
4.5. A Framework for a Customer Master Data Cleansing Process
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Danuso, A.; Giones, F.; da Silva, E.R. The digital transformation of industrial players. Bus. Horiz. 2022, 65, 341–349. [Google Scholar] [CrossRef]
- Wixom, B.H.; Piccoli, G.; Rodriguez, J. Fast-track data monetization with strategic data assets. MIT Sloan Manag. Rev. 2021, 62, 1–4. [Google Scholar]
- Zhang, X.; Yue, W.T.; Yu, Y.; Zhang, X. How to monetize data: An economic analysis of data monetization strategies under competition. Decis. Support Syst. 2023, 173, 114012. [Google Scholar] [CrossRef]
- Top Trends in Data and Analytics 2024. Available online: https://www.gartner.com/smarterwithgartner/gartner-top-10-data-and-analytics-trends-for-2021 (accessed on 24 July 2024).
- How to Monetize Your Customer Data. Available online: https://www.gartner.com/smarterwithgartner/how-to-monetize-your-customer-data/ (accessed on 24 July 2024).
- How to Create a Business Case for Data Quality Improvement. Available online: https://www.gartner.com/smarterwithgartner/how-to-create-a-business-case-for-data-quality-improvement/ (accessed on 24 July 2024).
- Ritala, P.; Keränen, J.; Fishburn, J.; Ruokonen, M. Selling and monetizing data in B2B markets: Four data-driven value propositions. Technovation 2024, 130, 102935. [Google Scholar] [CrossRef]
- Najjar, M.S.; Kettinger, W.J. Data monetization: Lessons from a retailer’s journey. MIS Q. Exec. 2013, 12, 213–225. [Google Scholar]
- Saarikko, T.; Westergren, U.H.; Blomquist, T. Digital transformation: Five recommendations for the digitally conscious firm. Bus. Horiz. 2020, 63, 825–839. [Google Scholar] [CrossRef]
- Kokkinou, A.; van Kollenburg, T.; Mandemakers, A.; Hopstaken, H.; van Elderen, J. The data analytic capability wheel: An implementation framework for digitalization. In Proceedings of the 36th Bled eConference: Digital Economy and Society: The Balancing Act for Digital Innovation in Times of Instability, Bled, Slovenia, 25–28 June 2023. [Google Scholar]
- Legner, C.; Eymann, T.; Heß, T.; Matt, C.; Böhmann, T.; Drews, P.; Mädche, A.; Urbach, N.; Ahlemann, F. Digitalization: Opportunity and challenge for the business and information systems engineering community. Bus. Inform. Syst. Eng. 2017, 59, 301–308. [Google Scholar] [CrossRef]
- Tilson, D.; Lyytinen, K.; Sørensen, C. Research commentary—Digital infrastructures: The missing IS research agenda. Inf. Syst. 2010, 21, 748–759. [Google Scholar] [CrossRef]
- Brynjolfsson, E.; McAfee, A. The Second Machine Age: Work, Progress, and Prosperity in a Time of Brilliant Technologies; WW Norton & Company: New York, NY, USA, 2014. [Google Scholar]
- Machado, C.G.; Winroth, M.; Carlsson, D.; Almström, P.; Centerholt, V.; Hallin, M.C. Industry 4.0 readiness in manufacturing companies: Challenges and enablers towards increased digitalization. Procedia CIRP 2019, 81, 1113–1118. [Google Scholar] [CrossRef]
- Ofulue, J.; Benyoucef, M. Data monetization: Insights from a technology-enabled literature review and research agenda. Manag. Rev. Q. 2024, 74, 521–565. [Google Scholar] [CrossRef]
- Wixom, B.H.; Ross, J.W. How to monetize your data. MIT Sloan Manag. Rev. 2017, 58, 10–13. [Google Scholar]
- Faroukhi, A.Z.; El Alaoui, I.; Gahi, Y.; Amine, A. Big data monetization throughout big data value chain: A comprehensive review. J. Big Data 2020, 7, 3. [Google Scholar] [CrossRef]
- Wixom, B.; Yen, B.; Rellich, M. Maximizing value from business analytics. MIT Sloan Manag. Rev. 2013, 12, 111–123. [Google Scholar]
- Kietzmann, J.; Paschen, J.; Treen, E. Artificial intelligence in advertising: How marketers can leverage artificial intelligence along the consumer journey. J. Advert. 2018, 58, 263–267. [Google Scholar] [CrossRef]
- Sivarajah, U.; Kamal, M.M.; Irani, Z.; Weerakkody, V. Critical analysis of big data challenges and analytical methods. J. Bus. Res. 2017, 70, 263–286. [Google Scholar] [CrossRef]
- Sanders, N.R. How to use big data to drive your supply chain. Calif. Manag. Rev. 2016, 58, 26–48. [Google Scholar] [CrossRef]
- Tabesh, P.; Mousavidin, E.; Hasani, S. Implementing big data strategies: A managerial perspective. Bus. Horiz. 2019, 62, 347–358. [Google Scholar] [CrossRef]
- Shah SI, H.; Peristeras, V.; Magnisalis, I. Government big data ecosystem: Definitions, types of data, actors, and roles and the impact in public administrations. ACM J. Data Inform. Qual. 2021, 13, 1–25. [Google Scholar] [CrossRef]
- Meena, P.; Sahu, P. Customer relationship management research from 2000 to 2020: An academic literature review and classification. Vision 2021, 25, 136–158. [Google Scholar] [CrossRef]
- Erevelles, S.; Fukawa, N.; Swayne, L. Big data consumer analytics and the transformation of marketing. J. Bus. Res. 2016, 69, 897–904. [Google Scholar] [CrossRef]
- Hossain, M.A.; Akter, S.; Yanamandram, V.; Wamba, S.F. Data-driven market effectiveness: The role of a sustained customer analytics capability in business operations. Technol. Forecast. Soc. Chang. 2023, 194, 122745. [Google Scholar] [CrossRef]
- Velcu-Laitinen, O.; Yigitbasioglu, O. The use of dashboards in performance management: Evidence from sales managers. Int. J. Digit. Account. Res. 2012, 12, 36–58. [Google Scholar]
- What Is Customer Analytics? Available online: https://www.forbes.com/advisor/business/customer-analytics/ (accessed on 24 July 2024).
- Chen, H.; Chiang, R.H.; Storey, V.C. Business intelligence and analytics: From big data to big impact. MIS Q. 2012, 36, 1165–1188. [Google Scholar] [CrossRef]
- Dover, C. How dashboards can change your culture. Strat. Fin. 2004, 86, 42. [Google Scholar]
- Pappas, L.M.; Whitman, L. Riding the technology wave: Effective dashboard data visualization. In Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2011; pp. 249–258. [Google Scholar]
- Watson, H.J.; Goodhue, D.L.; Wixom, B.H. The benefits of data warehousing: Why some organizations realize exceptional payoffs. Inform. Manag. 2002, 39, 491–502. [Google Scholar] [CrossRef]
- Gudivada, V.N.; Apon, A.; Ding, J. Data quality considerations for big data and machine learning: Going beyond data cleaning and transformations. Intern. J. Adv. Soft. 2017, 10, 1–20. [Google Scholar]
- Renear, A.H.; Sacchi, S.; Wickett, K.M. Definitions of dataset in the scientific and technical literature. Proc. Assoc. Inf. Sci. Technol. 2010, 47, 1–4. [Google Scholar] [CrossRef]
- Lee, G.Y.; Alzamil, L.; Doskenov, B.; Termehchy, A. A survey on data cleaning methods for improved machine learning model performance. arXiv 2021, arXiv:2109.07127. [Google Scholar]
- Pansara, R.R. Maturity Model of Master Data Management at Enterprise Level. Sch. J. Eng. Tech. 2024, 2, 31–39. [Google Scholar] [CrossRef]
- McGilvray, D. Executing Data Quality Projects: Ten Steps to Quality Data and Trusted Information TM; Academic Press: Cambridge, MA, USA, 2008. [Google Scholar]
- Wedekind, H. Bestandsdaten, Bewegungsdaten, Stammdaten. In Lexikon der Wirtschaftsinformatik; Springer: Berlin/Heidelberg, Germany, 1997; p. 61. [Google Scholar]
- Arnold, J.; Hammwöhner, R. Data Integration and Data Cleaning: Solutions for Improving Data Quality; Springer: Berlin/Heidelberg, Germany, 2022. [Google Scholar]
- Mahdavi, M.; Neutatz, F.; Visengeriyeva, L.; Abedjan, Z. Towards automated data cleaning workflows. Mach. Learn. 2019, 15, 16. [Google Scholar]
- Ridzuan, F.; Zainon, W. A review on data cleansing methods for big data. Procedia Comput. Sci. 2019, 161, 731–738. [Google Scholar] [CrossRef]
- Eisenhardt, K.M. Building theories from case study research. Acad. Manag. Rev. 1989, 14, 532–550. [Google Scholar] [CrossRef]
- Yin, R.K. Qualitative Research from Start to Finish, 2nd ed.; The Guilford Press: New York, NY, USA, 2015. [Google Scholar]
- Langley, A. Strategies for theorizing from process data. Acad. Manag. Rev. 1999, 24, 691–710. [Google Scholar] [CrossRef]
- Langley, A.; Smallman, C.; Tsoukas, H.; Van De Ven, A.H. Process studies of change in organization and Management: Unveiling temporality, activity, and flow. Acad. Manag. J. 2013, 56, 1–13. [Google Scholar] [CrossRef]
Key Research Theme | Summary |
---|---|
Digitization, Digitalization, and Digital Transformation | Researchers distinguish the concepts of digitization, digitalization, and digital transformation. Digitization involves converting analog signals into digital ones, thereby separating data from their medium. Digitalization involves integrating digital technologies into organizational processes to create new value opportunities. Digital transformation immerses the entire enterprise in digital methods, extending beyond processes and data to impact operations, business models, and competencies. |
Data Monetization | Data monetization is integral to discussions of digitization, digitalization, and digital transformation. It involves deriving economic benefits from available data sources through direct or indirect methods. Direct methods include data sales, licensing, and participation in data marketplaces. Indirect methods involve utilizing data to optimize internal operations, refine products, or enhance services. |
Data Structuring and Data Analytics | Data cleaning, preparation, and harmonization are crucial for effective data monetization. Big data analytics is a process that comprises four phases: turning data into insights, transforming insights into decisions, translating decisions into actions, and generating data points for future decision making. Companies should progress through four stages of maturity in developing data analytics capabilities: data structuring, data availability, basic analytics, and advanced analytics. |
Customer Analytics | Data cleaning is essential for customer analytics as it enables businesses to make data-driven decisions and enhance customer engagement, marketing effectiveness, and profitability. It involves examining and interpreting customer data to understand and predict behaviors, preferences, and trends. Customer analytics empowers sales representatives and managers to increase revenue and improve customer satisfaction and loyalty. |
Algorithms for Data Integration and Cleaning | Various algorithms facilitate data integration and cleaning, ensuring the quality, consistency, and reliability of data for analytics and decision making. These algorithms include entity resolution, schema matching, data fusion, ontology-based integration, missing value imputation, outlier detection and removal, normalization and standardization, text cleaning, and data transformation. Advanced techniques, such as deep learning, reinforcement learning, and graph-based algorithms, enhance the effectiveness of data cleaning. |
Challenge | Proposed Solution | Description |
---|---|---|
Data Cleansing | Comprehensive data cleansing of existing records | Addressed hierarchy issues, business partner discrepancies, and inconsistencies with golden records by removing inactive accounts and unnecessary records. Started with the GSA team conducting analysis to identify and deactivate records with no recent activity. |
Data Governance | Establishing a data governance process | Recommended a formal governance process to prevent future data issues, ensuring long-term data quality and consistency. |
Consistency in Customer Master Data | Creation of a data handbook | Developed guidelines for structuring customer master data to ensure consistency and accuracy in future data management efforts, supporting clean reporting and reducing pricing and delivery issues. |
Inactive and Unnecessary Accounts | Removal of inactive accounts and reduction of customer records | Removed 126,000 inactive accounts and further reduced records in Germany by categorizing records as “needed” or “unneeded”, significantly reducing unnecessary records and enhancing CRM efficiency. |
Customer Hierarchy and Structure | Cleansing of hierarchical customer setup | Broke down customer trees into manageable parts, identified true TLAs (Top-Level Accounts), resolved orphan records, and created a structured TLA base. Updated each customer record with accurate hierarchy details, ensuring accurate reporting and pricing at all levels of the customer hierarchy. |
Duplicate Records | Identification and consolidation of duplicates | Conducted a validation round to identify and merge duplicate records, creating a streamlined customer database. |
Parent — Child Account Relationships | Validation and correction of linked “child” accounts | Verified and corrected parent account assignments to ensure accurate reporting and consistent pricing and terms for child accounts. |
Data Handbook for Customer Setup | Development of a data handbook for customer setup guidelines | Provided guidelines on setting up customer data in the system, solving pricing and delivery issues caused by incorrect account hierarchies and structures. |
Accurate Customer Tree Levels | Correction of levels within customer trees | Merged duplicate accounts, assigned customer categories, and added data points (e.g., beds for hospitals), enhancing tree structure for strategic planning and market differentiation. |
Business Partner Alignment | Alignment of names, addresses, and customer information across systems | Enhanced readability, reduced duplicate account creation, and ensured consistency across systems to support various functions such as order management. |
Key Activities | (1) Problem Collection Activities | (2) Master Data Cleaning Activities | |||||||
---|---|---|---|---|---|---|---|---|---|
Key Tasks | (1a) Problem Identification Process | (1b) Categorization into High-Level Topics | (1c) Setting Problem Priorities | (1d) Division of Problems into Categories | (2a) Setting Up a Customer Base Overview | (2b) Deactivating Unnecessary Customers | (2c) Reassigning Top Customers | (2d) Assigning Sub- Customers to Top Customers | (2e) Cleaning Up Customer Trees |
Objectives | To achieve the point where issues are repeatedly mentioned during brainstorming sessions. | To categorize all identified problems into high-level topics based on their sources. | To prioritize problems based on the urgency of issues related to uncleanliness. | To separate problems into technical and non- technical categories for targeted resolution. | To create a comprehensive overview of the customer base with all relevant data for a functional working file. | To reduce the customer base by deactivating unnecessary customers. | To ensure the accuracy of top customer assignments by reassigning incorrectly designated top customers. | To accurately assign sub-customers to their corresponding top customers, creating initial customer trees. | To prioritize and clean up each customer tree based on customer importance. |
Steps | Conduct brainstorming sessions by gathering a diverse team for multiple sessions and encouraging open discussion and idea sharing. Document Issues through recording all mentioned issues without filtering and ensuring all participants’ input is captured. Identify repetition through reviewing the documented issues from each session and highlighting issues that are mentioned multiple times. Analyze patterns by looking for common themes and recurring problems and prioritize issues based on their frequency of mention. | Collect all problems identified in the previous problem identification process. Determine high-Level Categories by establishing high-level categories based on the sources of the problems and ensuring categories are comprehensive and cover all problem areas. Categorize Problems by sorting each problem into the appropriate high-level category and verifying that each problem is placed accurately according to its source. | Identify criteria for Prioritization Identify the most urgent problems caused by uncleanliness. Evaluate the impact of each problem, such as its effect on reporting. | Identify problem categories by reviewing the list of identified problems. Classifying problems by dividing problems into non-technical issues and technical issues Assigning responsibility to the appropriate technical teams and non- technical issues to the relevant non-technical teams. | Collect customer data by gather all relevant data pertaining to the customer base. Organize data by structuring the data in a clear and systematic manner. Create a working file by compiling the organized data into a single, functional file. | Identify unnecessary customers by reviewing the customer base and identifying customers that are no longer needed. Deactivate identified customers from the active database. Update the customer base by ensuring the updated customer base to reflect only the necessary and active customers. | Review current top customers by analyzing existing list of top customers and identifying inaccuracies. Identify incorrect assignments by determining which customers have been incorrectly labeled as top customers. Reassign customers correctly to ensuring the top customer list reflects the actual top customers. | Identify sub- customers by listing all sub-customers needing assignment. Determine correct top customers by identifying appropriate top customer for each sub-customer. Assign sub-customers for linking each sub- customer to their correct top customer. Create customer trees by establish initial customer trees by organizing sub- customers under their respective top customers. | Prioritize customer trees by ranking customer trees according to the importance of the top customers. Review each customer tree through examine the structure and details of each customer tree. Perform cleanup by removing inaccuracies, redundancies, and outdated information within each tree. |
Outcome | A clear identification of key problems, as indicated by their repeated mention during brainstorming sessions. | A structured categorization of all problems into a specified number of high-level topics, ensuring clarity and focus on the sources of each issue. | A prioritized list of problems, with the most urgent issues related to uncleanliness and their impacts on key areas like reporting given top priority | A clear division of problems into technical and non-technical categories, ensuring that each type of issue is addressed by the appropriate teams using the right approaches. | A complete and structured overview of the customer base, contained in a working file with all relevant data readily accessible. | A streamlined and reduced customer base, focusing only on active and necessary customers | A correct and updated list of top customers, accurately representing the most important clients. | Accurately assigned sub-customers, forming structured customer trees under the correct top customers. | A set of well-organized and accurate customer trees, prioritized by the importance of each customer |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Singh, J.; Gebauer, H. Clean Customer Master Data for Customer Analytics: A Neglected Element of Data Monetization. Digital 2024, 4, 1020-1039. https://doi.org/10.3390/digital4040051
Singh J, Gebauer H. Clean Customer Master Data for Customer Analytics: A Neglected Element of Data Monetization. Digital. 2024; 4(4):1020-1039. https://doi.org/10.3390/digital4040051
Chicago/Turabian StyleSingh, Jasmin, and Heiko Gebauer. 2024. "Clean Customer Master Data for Customer Analytics: A Neglected Element of Data Monetization" Digital 4, no. 4: 1020-1039. https://doi.org/10.3390/digital4040051
APA StyleSingh, J., & Gebauer, H. (2024). Clean Customer Master Data for Customer Analytics: A Neglected Element of Data Monetization. Digital, 4(4), 1020-1039. https://doi.org/10.3390/digital4040051