cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

Delta Live Table & Autoloader adding a non-existent column

YSF
New Contributor III

I'm trying to setup autoloader to read some csv files. I tried with both autoloader with the DLT decorator as well as just autoloader by itself. The first column of the data is called "run_id", when I do a spark.read.csv() directly on the file it comes in fine. When I use autoloader then it seems to see the first column as a curly brace "{" can't find anything online what that is or where it's coming from.

Here's a sample of the autoloader call:

df = (spark.readStream.format("cloudFiles")
      .option("cloudFiles.format", "csv")
      .option("cloudFiles.includeExistingFiles","true")
      .option("cloudFiles.schemaLocation", "/dbfs/schema_registry/")
      .load("/dbfs/mnt/folder/data_20230519.csv"))

Anyone know what's going on?

1 REPLY 1

Rishabh264
Honored Contributor II

can you attach the exact output so that I can have a look on that .

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.