cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Getting errors while reading data from URL

sharma_kamal
New Contributor III

I'm encountering some issues while trying to read a public dataset from a URL using Databricks. Here's the code snippet(along with errors) I'm working with:

sharma_kamal_1-1710132330915.png

 

I'm confused about Delta format error here.

  • When I read data from a URL, how would it have a Delta log associated with it? Delta logs seem relevant for data stored in Databricks, not external URLs.
  • Why is Databricks suggesting Delta format for this scenario?

I'm a bit stuck here. Any pointers or advice, please?

 

 

 

1 ACCEPTED SOLUTION

Accepted Solutions

MuthuLakshmi
New Contributor III
New Contributor III

@sharma_kamal 

Please disable the formatCheck in notebook and check if you could read the data

The configuration command %sql SET spark.databricks.delta.formatCheck.enabled=false will disable the format check for Delta tables in Databricks.

Databricks delta format check is a feature that validates whether the format of the source files is one of CSV, JSON, AVRO, ORC, PARQUET, TEXT, or BINARYFILE and if that meets that condition it will raise exception that you are facing.

This format check is because to ensure only delta format must be loaded for any operation.

If the format check is disabled, Databricks will not perform this validation and bypass this format check.

View solution in original post

2 REPLIES 2

MuthuLakshmi
New Contributor III
New Contributor III

@sharma_kamal 

Please disable the formatCheck in notebook and check if you could read the data

The configuration command %sql SET spark.databricks.delta.formatCheck.enabled=false will disable the format check for Delta tables in Databricks.

Databricks delta format check is a feature that validates whether the format of the source files is one of CSV, JSON, AVRO, ORC, PARQUET, TEXT, or BINARYFILE and if that meets that condition it will raise exception that you are facing.

This format check is because to ensure only delta format must be loaded for any operation.

If the format check is disabled, Databricks will not perform this validation and bypass this format check.

Thank you @MuthuLakshmi . It helped. 

Join 100K+ Data Experts: Register Now & Grow with Us!

Excited to expand your horizons with us? Click here to Register and begin your journey to success!

Already a member? Login and join your local regional user group! If there isn’t one near you, fill out this form and we’ll create one for you to join!