Databricks Certified Data Engineer Professional Certification || Resources/ Tips/ Tricks/ Preparation
Important Resources and concepts to prepare
Apache Spark is most used parallel processing engine used in Data Engineering and Databricks is the best platform providing efficient version of Apache Spark with extra features. So, This Certification will help you a lot as a Data Engineering to crack interviews and in your Career.
In this blog, I have prepared complete guide mentioning Topics to prepare for the Exam and Resource I followed to crack this Exam!
Important Note:
First of all, I want to mention,It’s not an easy exam to crack, So you need to learn so many concepts in Databricks and Apache Spark andI would Highly suggestyou to completeDatabricks Certified Data Engineer Associate Certification.
Associate Certification isnot an mandatory/pre-requisites,but you need all the skills of Associate Certification.Personally I did not attended for Associate Certification but I had learned all the skills of Associate and then started to learn required skills for Professional one.
Let’s break down all the important concepts of Spark and Databricks which you need to learn to easily crack this Certification.
Using Change Data Capture (CDC) to propagate changes
Optimizing workloads
Structured Streaming
Incremental Data Ingestion
Auto Loader
Databricks SQL
3. Data Modeling — 20%:
Medallion/Multi-hop Architecture
Bronze, Silver, Gold Layer of Medallion Architecture
Slowly Changing Dimensions (SCD)
Constraints
Lookup Tables
4. Security and Governance — 10%:
Dynamic Views
Propagating Deletes
Managing clusters and jobs permissions with ACLs
Creating row- and column-oriented dynamic views to control user/group access
Securely delete data as requested according to GDPR & CCPA
Unity Catalog
5. Monitoring and Logging — 10%:
Managing Cluster
Recording logged metrics
Debugging errors
6. Testing and Deployment — 10%:
Data Pipeline Testing
Relative Import
Scheduling Jobs
Orchestration Jobs
7. Performance Tuning:
Partitioning Delta Lake Tables
Delta Lake Transaction Log
Auto Optimize Feature
Resources I followed
Very Important Note:Before proving you with resources, let me mention that these resources are provided to you keeping in mind that you have good understanding of all the databricks concepts required to crackAssociate Certification:
No comments:
Post a Comment