- 14 Sections
- 114 Lessons
- 8 Weeks
Expand all sectionsCollapse all sections
- Introduction to Databricks & Apache Spark5
- Working with Data Formats7
- DataFrame Operations17
- 3.1Select Columns
- 3.2Add New Column
- 3.3Rename Column
- 3.4Case When in DataFrame
- 3.5Filter Data
- 3.6Sort Data
- 3.7Drop Columns & Duplicates
- 3.8Handle Null Values
- 3.9Group By
- 3.10Collect List vs Collect Set
- 3.11Explode Function
- 3.12Row_Number vs Rank vs Dense Rank
- 3.13Join DataFrames
- 3.14Types of Views
- 3.15Write Data into Table
- 3.16Managed vs External Table
- 3.17User-Defined Functions (UDFs)
- Delta Lake Essentials10
- Batch & Streaming Data5
- Databricks Utilities3
- Unity Catalog & Governance12
- 7.1Introduction to Unity Catalog
- 7.2Create Databricks, ADLS Gen2 and Connector
- 7.3Setup Access Connector & MetaStore
- 7.4Create External Location & Storage Credential
- 7.5Create Catalog, Schema, and Tables
- 7.6Lineage
- 7.7Delta Sharing (Non-Databricks Customer)
- 7.8Masking Columns – Sensitive Data
- 7.9Row-Level Access Control
- 7.10Create SPN and Grant Access
- 7.11SQL Commands for Unity Catalog
- 7.12Compute Types and Group Management
- Spark Optimization Techniques15
- 8.1Spark Architecture in Databricks
- 8.2Z-Order Optimization
- 8.3Predictive Optimization
- 8.4Cluster Tuning
- 8.5Shuffle Explained and Optimization
- 8.6Partitioning
- 8.7Serialization
- 8.8Storage Optimization
- 8.9Skew and Skew Solutions
- 8.10Spill
- 8.11Performance Debugging
- 8.12Liquid Clustering
- 8.13Deletion Vectors
- 8.14Vacuum Revisited
- 8.15Databricks UI Simulator
- Testing & CI/CD Integration8
- Delta Live Tables (DLT)3
- DevOps & Job Orchestration4
- Cloud Integration - Azure & AWS11
- 12.1Connect Databricks with Azure Data Lake Gen2
- 12.2Set Up Access Connector and Mount Storage in Azure
- 12.3Create Unity Catalog Metastore for Azure
- 12.4Connect Databricks with AWS S3 using IAM Role
- 12.5Assume role Integration with External AWS Accounts
- 12.6Read & Write Data from S3 Buckets
- 12.7AWS Glue as Metastore Integration (optional)
- 12.8Token-Based Access vs Role-Based Access
- 12.9Securing Secrets with Databricks CLI or Azure Key Vault
- 12.10Cross-Cloud Architecture: Move Data from AWS to Azure and Vice Versa
- 12.11Configure External Locations for AWS and Azure
- Real-Time Project (LLM POC)6
- Bonus: Suggested Additional Topics8
What is RDD?
Next
