Unlock the power of data with our comprehensive Talend project aimed at constructing a robust Data Warehouse (DWH) from the renowned Northwind dataset. Divided into two pivotal phases, this project seamlessly integrates data from the Northwind Access Database and the Transactional Database (Northwind) in SQL Server.
In the Staging Area, data is ingested and prepared for further processing. Two primary sources are utilized:
-
Northwind Access Database:
- This database serves as a key source of data for the project.
- Talend is used to extract data from the Access database, transforming and cleaning it for compatibility with the data warehouse schema.
-
Transactional Database (Northwind) in SQL Server:
- The SQL Server database provides additional transactional data for a more comprehensive data warehouse.
- Talend is employed to extract relevant data, ensuring consistency and conformity with the overall project requirements.
The second part of the project involves building the Data Warehouse. This involves:
-
Schema Design:
- Designing an effective and scalable data warehouse schema to accommodate the requirements of the project.
- Ensuring that the schema supports efficient querying and reporting.
-
ETL Processes:
- Developing Extract, Transform, Load (ETL) processes using Talend to populate the Data Warehouse.
- Transforming data from the Staging Area to fit the warehouse schema.
- Handling any necessary data cleansing and enrichment.
-
Optimization:
- Implementing optimization techniques to enhance the performance of the warehouse.
- Indexing, partitioning, and other strategies are considered for efficient data retrieval.
-
Customer Dimension
-
Employee Dimension
-
Product Dimension
-
Date Dimension
-
Fact Table 1
-
Fact Table 2
-
Schema
To replicate this project, follow these steps:
- Clone the Repository:
git clone https://github.com/your-username/Building-Northwind-DWH-Using-Talend.git