Topics with Label: Schema evolution

Forum Posts

Sorted by:

by SRK • Contributor III

10-01-2022 3:15:10 AM

5627 Views
5 replies
7 kudos

How to handle schema validation for Json file. Using Databricks Autoloader?

Following are the details of the requirement:1. I am using databricks notebook to read data from Kafka topic and writing into ADLS Gen2 container i.e., my landing layer.2. I am using Spark code to read data from Kafka and write into landing...

Data Engineering

5627 Views
5 replies
7 kudos

10-01-2022 3:15:10 AM

View Replies

Latest Reply

maddy08
New Contributor II

10-24-2024 10:01:27 PM

7 kudos

just to clarify, are you reading kafka and writing into adls in json files? like for each message from kafka is 1 json file in adls ?

7 kudos

10-24-2024 10:01:27 PM

4 More Replies

by MRTN • New Contributor III

04-04-2023 12:22:03 PM

13690 Views
4 replies
3 kudos

Load CSV files with slightly different schemas

I have a set of CSV files generated by a system, where the schema has evolved over the years. Some columns have been added, and at least one column has been renamed in newer files. Is there any way to elegantly load these files into a dataframe? I ha...

Data Engineering

13690 Views
4 replies
3 kudos

04-04-2023 12:22:03 PM

View Replies

Latest Reply

MRTN
New Contributor III

04-12-2023 1:08:17 AM

3 kudos

For reference - for anybody struggling with the same issues. All online examples using auto loader are written as one block statement on the form: (spark.readStream.format("cloudFiles") .option("cloudFiles.format", "csv") # The schema location di...

3 kudos

04-12-2023 1:08:17 AM

3 More Replies

by Chris_Konsur • New Contributor III

11-10-2022 3:20:52 PM

1577 Views
0 replies
2 kudos

Schema supported by Autoloader

We do not want to use schema inference with schema evolution in Autoloader. Instead, we want to apply our schema and use the merge option. Our schema is very complex, with multiple nested following levels. When I apply this schema to Autoloader, it r...

Data Engineering

1577 Views
0 replies
2 kudos

11-10-2022 3:20:52 PM

by noimeta • Contributor III

07-28-2022 4:56:56 AM

6023 Views
4 replies
1 kudos

Apply change data with delete and schema evolution

Hi,Currently, I'm using structure streaming to insert/update/delete to a table. A row will be deleted if value in 'Operation' column is 'deleted'. Everything seems to work fine until there's a new column.Since I don't need 'Operation' column in the t...

Data Engineering

6023 Views
4 replies
1 kudos

07-28-2022 4:56:56 AM

View Replies

Latest Reply

User16753725469
Contributor II

09-01-2022 12:33:09 AM

1 kudos

please go through this documentation https://docs.delta.io/latest/api/python/index.html

1 kudos

09-01-2022 12:33:09 AM

3 More Replies

by fshimamoto • New Contributor III

06-29-2022 12:59:26 PM

3769 Views
3 replies
2 kudos

What are the best practices for schema drift using Delta Live tables, in a scenario where the main source is a no sql database and we have a lot of ch...

What are the best practices for schema drift using Delta Live tables, in a scenario where the main source is a no sql database and we have a lot of changes in the schema?

Data Engineering

3769 Views
3 replies
2 kudos

06-29-2022 12:59:26 PM

View Replies

Latest Reply

Vartika
Databricks Employee

08-30-2022 8:28:51 AM

2 kudos

Hey there @Fernando Martin Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from...

2 kudos

08-30-2022 8:28:51 AM

2 More Replies

by ilarsen • Contributor

08-21-2022 5:18:12 PM

1307 Views
0 replies
1 kudos

Trouble referencing a column that has been added by schema evolution (Auto Loader with Delta Live Tables)

Hi,I have a Delta Live Tables pipeline, using Auto Loader, to ingest from JSON files. I need to do some transformations - in this case, converting timestamps. Except one of the timestamp columns does not exist in every file. This is causing the DLT p...

Data Engineering

1307 Views
0 replies
1 kudos

08-21-2022 5:18:12 PM

by Gapy • New Contributor II

10-31-2021 6:43:09 AM

2022 Views
1 replies
1 kudos

Auto Loader Schema-Inference and Evolution for parquet files

Dear all,will (and when) will Auto Loader also support Schema-Inference and Evolution for parquet files, at this point it is only for JSON and CSV supported if i am not mistaken?Thanks and regards,Gapy

Data Engineering

2022 Views
1 replies
1 kudos

10-31-2021 6:43:09 AM

View Replies

Latest Reply

Sandeep
Contributor III

11-10-2021 7:46:01 AM

1 kudos

@Gasper Zerak , This will be available in near future (DBR 10.3 or later). Unfortunately, we don't have an SLA at this moment.

1 kudos

11-10-2021 7:46:01 AM

by saninanda • New Contributor II

09-23-2019 11:48:33 PM

16525 Views
7 replies
0 kudos

how to read schema from text file stored in cloud storage

I have file a.csv or a.parquet while creating data frame reading we can explictly define schema with struct type. instead of write the schema in the notebook want to create schema lets say for all my csv i have one schema like csv_schema and stored ...

Data Engineering

16525 Views
7 replies
0 kudos

09-23-2019 11:48:33 PM

View Replies

Latest Reply

Nakeman
New Contributor II

05-14-2021 2:28:39 AM

0 kudos

@shyampsr big thanks, was searching for the solution almost 3 hours _https://luckycanadian.com/

0 kudos

05-14-2021 2:28:39 AM

6 More Replies