I recently have been studying for the Databricks Spark Developer Associate exam through the Databricks Academy and have come across an issue that needs addressed.
The course that needs addressed is the Apache Spark Programming with Databricks
While walking through the material in my Databricks Workspace the provided Notebook material that Databricks gives you. I attempt to execute the first cell (%run ./Includes/Classroom-Setup-SQL). While digging into where the cell is referencing, it executes the following.
spark.sql("""CREATE TABLE IF NOT EXISTS events USING parquet OPTIONS (path "{}")""".format(eventsPath))
spark.sql("""CREATE TABLE IF NOT EXISTS sales USING parquet OPTIONS (path "{}")""".format(salesPath))
spark.sql("""CREATE TABLE IF NOT EXISTS users USING parquet OPTIONS (path "{}")""".format(usersPath))
spark.sql("""CREATE TABLE IF NOT EXISTS products USING parquet OPTIONS (path "{}")""".format(productsPath))
displayHTML("")
the namePath variables are represented as follows
module_name = "spark-programming"
spark.conf.set("com.databricks.training.module-name", module_name)
salesPath = "/mnt/training/ecommerce/sales/sales.parquet"
usersPath = "/mnt/training/ecommerce/users/users.parquet"
eventsPath = "/mnt/training/ecommerce/events/events.parquet"
productsPath = "/mnt/training/ecommerce/products/products.parquet"
There is a cell that also executes that references a long script in Student-Environment – which I’ve attached to this posting below.
While doing some research on this issue, I found this other posting on the Community here that points this problem out back in April and the solution was to download the same .dbc file only updated in a Github repo.
Well Databricks has now removed that repo stating the following:
The courseware materials for this course are no longer available through GitHub.
To access the current body of courseware, please sign in to Databricks Academy using one of the following three options:
- Customer Academy
- Partner Academy
- Employee Academy
I have submitted a ticket to training-support@databricks.com and have yet to receive a response back from them. The ticket number for anyone’s reference is #00343266. I do not know what the urgency of this is on their end, I just want to make this post on the community in hopes that it can get some traction. I’m fairly new to Databricks and want to continue my studies for the Spark Developer exam, and there is little to no third-party material out there that I can find that preps you for this exam other then the Databricks Academy’s.
I hope that Databricks can remediate this issue quickly as it is the main course for this exam.