Machine Learning

Forum Posts

Sorted by:

by js54123875 • New Contributor III

06-12-2023 11:58:34 AM

4366 Views
4 replies
3 kudos

Resolved! How to enforce schema with Autoloader?

I have a number of csv files that I am working to ingest using autoloader. There is an ID field that I want to require to be a STRING, but using SchemaHints is not working and is instead setting as an INT.The first few csv files have just integer va...

Machine Learning

4366 Views
4 replies
3 kudos

06-12-2023 11:58:34 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-14-2023 11:36:32 PM

3 kudos

Hi @Jennette Shepard We haven't heard from you since the last response from @Suteja Kanuri . Kindly share the information with us, and in return, we will provide you with the necessary solution.Thanks and Regards

3 kudos

06-14-2023 11:36:32 PM

3 More Replies

by Thanapat_S • Contributor

06-07-2023 9:12:06 PM

3711 Views
2 replies
6 kudos

Resolved! Is it possible to use both `Dynamic partition overwrites` and `overwriteSchema` options when writing a DataFrame to a Delta table?"

In my ETL case, I want to be able to adjust the table schema as needed, meaning the number of columns may increase or decrease depending on the ETL script. Additionally, I would like to use dynamic partition overwrite to avoid potential errors when u...

Machine Learning

3711 Views
2 replies
6 kudos

06-07-2023 9:12:06 PM

View Replies

Latest Reply

Vartika
Databricks Employee

06-09-2023 3:26:22 AM

6 kudos

Hi @Thanapat Sontayasara,Does @Werner Stinckens's response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly? If not, would you be happy to give us more information?Thanks!

6 kudos

06-09-2023 3:26:22 AM

1 More Replies

by weldermartins • Honored Contributor

10-08-2022 6:20:27 AM

20788 Views
17 replies
13 kudos

Resolved! Created nested struct schema SPARK - Schema Jira

Hello guys,I'm using Jira API to return "ISSUES". But to be able to use pyspark I need to create the Dataframe passing in the Schema. But I am not able to create the Schema based on the model below. Would you have any ideas?root |-- expand: string ...

Machine Learning

20788 Views
17 replies
13 kudos

10-08-2022 6:20:27 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

10-11-2022 1:21:32 AM

13 kudos

if columns are missing, that particular data is not present in the json. I am not aware of spark skipping columns when reading json with inferschema. There is an option dropFieldIfAllNull but that is False by default.That makes me think: you might ...

13 kudos

10-11-2022 1:21:32 AM

16 More Replies

by vaver_3 • New Contributor III

08-05-2022 12:45:07 PM

16185 Views
1 replies
5 kudos

Resolved! ingest a .csv file with spaces in column names using Delta Live into a streaming table

How do I ingest a .csv file with spaces in column names using Delta Live into a streaming table? All of the fields should be read using the default behavior .csv files for DLT autoloader - as strings. Running the pipeline gives me an error about in...

Machine Learning

16185 Views
1 replies
5 kudos

08-05-2022 12:45:07 PM

View Replies

Latest Reply

vaver_3
New Contributor III

08-11-2022 5:30:07 AM

5 kudos

After additional googling on "withColumnRenamed", I was able to replace all spaces in column names with "_" all at once by using select and alias instead:@dlt.view( comment="" ) def vw_raw(): return ( spark.readStream.format("cloudF...

5 kudos

08-11-2022 5:30:07 AM

Databricks Community

Resolved! How to enforce schema with Autoloader?

Resolved! Is it possible to use both `Dynamic partition overwrites` and `overwriteSchema` options when writing a DataFrame to a Delta table?"

Resolved! Created nested struct schema SPARK - Schema Jira

Resolved! ingest a .csv file with spaces in column names using Delta Live into a streaming table