Lakehouse-based data architecture: modern approaches to building a management model for educational organizations based on data analysis
Abstract
Lakehouse-based data architecture: modern approaches to building a management model for educational organizations based on data analysis
Incoming article date: 11.02.2025Nowadays, educational organisations face the need to effectively manage growing volumes of heterogeneous data from academic performance and digital educational resources to administrative processes. The article is dedicated to the study of modern approaches to building an Corporate data warehouse (DWH) using Data Lake technology to manage educational organisations. The article considers the integration of traditional methods of structured data storage with the flexibility and scalability of Data Lake, which allows to work effectively with large volumes of heterogeneous data. The description of DWH architecture adapted for educational institutions is given. The description of Apache Airflow platform is given.
Keywords: Data Lake, corporate data warehouse, Apache Airflow, Greenplum, ETL