Skip to main content

Introduction to Data Ingestion


Impact_Analytics

Module 1: Introduction to Data Ingestion - Duration: 1/2 Hour

      • Overview of Data Ingestion
      • Importance and Applications
      • Key Concepts and Terminology

Module 2: Sourcing Pipeline - Duration: 1/2 Hour

      • Understanding Business Requirements 
      • Identifying Data Sources and Formats
      • Determining Data Volume and Velocity
      • Selecting data connectors and extraction mechanisms

Module 3: Ingestion Pipeline - Duration: 1/2 Hour

      • Understanding data standardization and data transformation (sourcing) queries
      • Identifying data sanity steps and role of QC Bot
      • Understanding product and store hierarchies and attributes

Module 4: Derived tables Pipeline - Duration: 1/2 Hour

      • Understanding flow of data into different applications
      • Understanding derived table transformations queries and its related SPs
      • Understanding Backsync jobs and its dependencies

Module 5: Modeling Pipeline - Duration: 1/2 Hour

      • Understanding flow of data into ADA/modeling pipelines
      • Introduction to clustering and Loss sale imputation
      • FMT - Modeling data preparation
Enroll