cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Does Databricks Certified Associate Developer for Apache Spark 3.0 in Python assess knowledge in Spark Streaming, ML, GraphX, RDD and UDF? May I get a link to the syllabus for this exam? Thank you!

Personal1
New Contributor
1 ACCEPTED SOLUTION

Accepted Solutions

Kaniz
Community Manager
Community Manager

Hi @Abhishek Pradhanโ€‹ , The Apache Spark Associate Developer is applied for Data Engineer and Data Scientist learning paths. This exam will assess you in Spark architecture and in the use of Spark DataFram API to manipulate data.

What is covered by the exam? ๐Ÿ“‹

Although the exam covers data manipulation, the SQL language is not assessed. All questions related to data manipulation will be asked to solve using Spark DataFrame API. Spark Streaming is another topic that the exam doesnโ€™t cover.

 The exam questions are distributed into three categories:

Spark Architecture โ€” Conceptual

  • Cluster architecture: nodes, drivers, workers, executors, slots, etc.
  • Spark execution hierarchy: applications, jobs, stages, tasks, etc.
  • Shuffling
  • Partitioning
  • Lazy evaluation
  • Transformations vs Actions
  • Narrow vs Wide transformations

Spark Architecture โ€” Applied

  • Execution deployment modes
  • Stability
  • Storage levels
  • Repartitioning
  • Coalescing
  • Broadcasting
  • DataFrames

Spark DataFrame API

  • Subsetting DataFrames (select, filter, etc.)
  • Column manipulation (casting, creating columns, manipulating existing columns, complex column types)
  • String manipulation (Splitting strings, regex)
  • Performance-based operations (repartitioning, shuffle partitions, caching)
  • Combining DataFrames (joins, broadcasting, unions, etc)
  • Reading/writing DataFrames (schemas, overwriting)
  • Working with dates (extraction, formatting, etc)
  • Aggregations
  • Miscellaneous (sorting, missing values, typed UDFs, value extraction, sampling)

Source

View solution in original post

2 REPLIES 2

Kaniz
Community Manager
Community Manager

Hi @Abhishek Pradhanโ€‹ ! My name is Kaniz, and I'm the technical moderator here. Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question first. Or else I will get back to you soon. Thanks.

Kaniz
Community Manager
Community Manager

Hi @Abhishek Pradhanโ€‹ , The Apache Spark Associate Developer is applied for Data Engineer and Data Scientist learning paths. This exam will assess you in Spark architecture and in the use of Spark DataFram API to manipulate data.

What is covered by the exam? ๐Ÿ“‹

Although the exam covers data manipulation, the SQL language is not assessed. All questions related to data manipulation will be asked to solve using Spark DataFrame API. Spark Streaming is another topic that the exam doesnโ€™t cover.

 The exam questions are distributed into three categories:

Spark Architecture โ€” Conceptual

  • Cluster architecture: nodes, drivers, workers, executors, slots, etc.
  • Spark execution hierarchy: applications, jobs, stages, tasks, etc.
  • Shuffling
  • Partitioning
  • Lazy evaluation
  • Transformations vs Actions
  • Narrow vs Wide transformations

Spark Architecture โ€” Applied

  • Execution deployment modes
  • Stability
  • Storage levels
  • Repartitioning
  • Coalescing
  • Broadcasting
  • DataFrames

Spark DataFrame API

  • Subsetting DataFrames (select, filter, etc.)
  • Column manipulation (casting, creating columns, manipulating existing columns, complex column types)
  • String manipulation (Splitting strings, regex)
  • Performance-based operations (repartitioning, shuffle partitions, caching)
  • Combining DataFrames (joins, broadcasting, unions, etc)
  • Reading/writing DataFrames (schemas, overwriting)
  • Working with dates (extraction, formatting, etc)
  • Aggregations
  • Miscellaneous (sorting, missing values, typed UDFs, value extraction, sampling)

Source

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.