Skip to main content

BigQuery Usage


Impact_Analytics

About This Course

Module 1: Best Practices in writing SQL Queries

      1. Best Practices While Creating Tables 
          1. Implementing Proper Schema Design
              • Use case specific schema design
              • Usage of appropriate data types
              • Avoiding complex data structures and repeated fields
          2. Clustering & Partitioning tables
              • How to Cluster & Partition tables?
              • How does it improve performance?
              • Additional Scenarios where clustering and Partitioning helps in optimization
      2. Best Practices While Querying Tables
          1. Selective Projection (Avoid Select *)
          2. Optimal Usage of Where clauses to scan minimal data 
          3. Join Optimizations
          4. Optimal Usage of Analytic Functions  
          5. Use Data Sampling for testing queries

Module 2: Cost Structures & Cost Optimization Techniques

      1. Cost Structure of GBQ
          1. Cost components of GBQ
          2. Active vs Long Term Storage Pricing
          3. Flat Rate vs On Demand Pricing
      2. Techniques to make query runs cost effective ?
          1. Caching policy management
          2. Data lifecycle management
          3. Query Profiling
          4. Query Cost Monitoring

Module 3: Protocols & Standard Operating Procedures At IA

      1. Restrictions Imposed on Querying By IA
          1. Restrictions of Service Accounts
          2. Restrictions on Individual Accounts
          3. Time Based Restrictions
      2. Standard Operating Procedures
          1. When does a ticket get raised? 
          2. What is the process involved in resolving a ticket? Who are the stakeholders involved?
          3. What are the standard error messages given by BigQuery? What is the protocol for each of the error messages?
          4. What is the procedure to raise exceptions to restrictions imposed?
Enroll