BigQuery Usage
Impact_Analytics
About This Course
Module 1: Best Practices in writing SQL Queries
-
-
- Best Practices While Creating Tables
-
- Implementing Proper Schema Design
-
- Use case specific schema design
- Usage of appropriate data types
- Avoiding complex data structures and repeated fields
-
- Clustering & Partitioning tables
-
- How to Cluster & Partition tables?
- How does it improve performance?
- Additional Scenarios where clustering and Partitioning helps in optimization
-
- Implementing Proper Schema Design
-
- Best Practices While Querying Tables
-
- Selective Projection (Avoid Select *)
- Optimal Usage of Where clauses to scan minimal data
- Join Optimizations
- Optimal Usage of Analytic Functions
- Use Data Sampling for testing queries
-
- Best Practices While Creating Tables
-
Module 2: Cost Structures & Cost Optimization Techniques
-
-
- Cost Structure of GBQ
-
- Cost components of GBQ
- Active vs Long Term Storage Pricing
- Flat Rate vs On Demand Pricing
-
- Techniques to make query runs cost effective ?
-
- Caching policy management
- Data lifecycle management
- Query Profiling
- Query Cost Monitoring
-
- Cost Structure of GBQ
-
Module 3: Protocols & Standard Operating Procedures At IA
-
-
- Restrictions Imposed on Querying By IA
-
- Restrictions of Service Accounts
- Restrictions on Individual Accounts
- Time Based Restrictions
-
- Standard Operating Procedures
-
- When does a ticket get raised?
- What is the process involved in resolving a ticket? Who are the stakeholders involved?
- What are the standard error messages given by BigQuery? What is the protocol for each of the error messages?
- What is the procedure to raise exceptions to restrictions imposed?
-
- Restrictions Imposed on Querying By IA
-