*Review* **An Overview of Data Warehouse and Data Lake in Modern Enterprise Data Management**

**Athira Nambiar \* and Divyansh Mundra**

> Department of Computational Intelligence, School of Computing, SRM Institute of Science and Technology, Chennai 603203, India

**\*** Correspondence: athiram@srmist.edu.in

**Abstract:** Data is the lifeblood of any organization. In today's world, organizations recognize the vital role of data in modern business intelligence systems for making meaningful decisions and staying competitive in the field. Efficient and optimal data analytics provides a competitive edge to its performance and services. Major organizations generate, collect and process vast amounts of data, falling under the category of big data. Managing and analyzing the sheer volume and variety of big data is a cumbersome process. At the same time, proper utilization of the vast collection of an organization's information can generate meaningful insights into business tactics. In this regard, two of the popular data managemen<sup>t</sup> systems in the area of big data analytics (i.e., data warehouse and data lake) act as platforms to accumulate the big data generated and used by organizations. Although seemingly similar, both of them differ in terms of their characteristics and applications. This article presents a detailed overview of the roles of data warehouses and data lakes in modern enterprise data management. We detail the definitions, characteristics and related works for the respective data managemen<sup>t</sup> frameworks. Furthermore, we explain the architecture and design considerations of the current state of the art. Finally, we provide a perspective on the challenges and promising research directions for the future.

**Keywords:** big data; data warehousing; data lake; enterprise data management; OLAP; ETL tools; metadata; cloud computing; Internet of Things
