Introduction to Data Ingestion
Impact_Analytics
Module 1: Introduction to Data Ingestion - Duration: 1/2 Hour
-
-
- Overview of Data Ingestion
- Importance and Applications
- Key Concepts and Terminology
-
Module 2: Sourcing Pipeline - Duration: 1/2 Hour
-
-
- Understanding Business Requirements
- Identifying Data Sources and Formats
- Determining Data Volume and Velocity
- Selecting data connectors and extraction mechanisms
-
Module 3: Ingestion Pipeline - Duration: 1/2 Hour
-
-
- Understanding data standardization and data transformation (sourcing) queries
- Identifying data sanity steps and role of QC Bot
- Understanding product and store hierarchies and attributes
-
Module 4: Derived tables Pipeline - Duration: 1/2 Hour
-
-
- Understanding flow of data into different applications
- Understanding derived table transformations queries and its related SPs
- Understanding Backsync jobs and its dependencies
-
Module 5: Modeling Pipeline - Duration: 1/2 Hour
-
-
- Understanding flow of data into ADA/modeling pipelines
- Introduction to clustering and Loss sale imputation
- FMT - Modeling data preparation
-