cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Does Databricks Runtime 7.3+ include built-in Hadoop S3 connector configurations?

Divya_Bhadauria
New Contributor III

I came across the KB article S3 connection reset error, which mentions not using the following Spark settings for the Hadoop S3 connector for DBR 7.3 and above:

spark.hadoop.fs.s3.impl com.databricks.s3a.S3AFileSystem
spark.hadoop.fs.s3n.impl com.databricks.s3a.S3AFileSystem
spark.hadoop.fs.s3a.impl com.databricks.s3a.S3AFileSystem

My question is: for Databricks Runtime 7.3 and above, are these configurations already built in by default ?

 

1 REPLY 1

hasnat_unifeye
New Contributor

No, you don’t need to set those on DBR 7.3 and above.
From 7.3+ Databricks already uses the newer Hadoop S3A connector by default, so those com.databricks.s3a.S3AFileSystem settings are not part of the default config and shouldn’t be added.

If 
they are present I would assume it has been added manually, and removing it should cause no issues