Can we use Databricks or code in data bricks without learning Pyspark in depth which is used for ETL purpose and data engineering perspective.

KVNARK
Honored Contributor II

Can we use Databricks or code in data bricks without learning Pyspark in depth which is used for ETL purpose and data engineering perspective. can someone throw some light on this. Currently learning Pyspark (basics of Pythion in handling the data) and also learning Databricks SQL by free learning provided in data bricks portal itself. Any guidance would be much appreciated.

Ajay-Pandey
Databricks MVP

You can use spark sql to build your ETS

Ajay Kumar Pandey

KVNARK
Honored Contributor II

Databricks SQL doesn't require any prior knowledge on Pyspark right. But to build ETL pipelines, we may need to learn Pyspark, I mean for creating the ETL pipelines. Is that correct?

If you want u can use SQL alone to build a pipeline but if you have knowledge about pyspark it will add more advantages.

Ajay Kumar Pandey

View solution in original post

KVNARK
Honored Contributor II

Thank you @Ajay Pandey​ 

AmarK
Databricks Employee
Databricks Employee

@KVNARK .​ Could you please select @Ajay Pandey​ answer as Best answer if it helped you?

Anonymous
Not applicable

You can also learn the Scala API if you don't want to learn Python. It might be a valuable skill to learn Python. Either way, you can use SQL or Python for most applications. Some things such as AutoML don't require either.

NhatHoang
Valued Contributor II

You can use Scala or R (with SparkR) if you do not want to use Python. :)​

KVNARK
Honored Contributor II

Thanks All for your valuable suggestions!