Data Engineering

Forum Posts

Sorted by:

by Christine • Contributor II

05-24-2022 11:42:57 PM

9463 Views
9 replies
5 kudos

Resolved! pyspark dataframe empties after it has been saved to delta lake.

Hi, I am facing a problem that I hope to get some help to understand. I have created a function that is supposed to check if the input data already exist in a saved delta table and if not, it should create some calculations and append the new data to...

Data Engineering

9463 Views
9 replies
5 kudos

05-24-2022 11:42:57 PM

View Replies

Latest Reply

SharathE
New Contributor III

09-23-2023 11:04:59 AM

5 kudos

Hi,im also having similar issue ..does creating temp view and reading it again after saving to a table works?? /

5 kudos

09-23-2023 11:04:59 AM

8 More Replies

by shamly • New Contributor III

11-29-2022 11:41:24 AM

6461 Views
9 replies
2 kudos

Resolved! need to remove doubledagger delimiter from a csv using databricks

My csv data looks like this‡‡companyId‡‡,‡‡empId‡‡,‡‡regionId‡‡,‡‡companyVersion‡‡,‡‡Question‡‡I tried this codedff = spark.read.option("header", "true").option("inferSchema", "true").option("delimiter", "‡,").csv(f"/mnt/data/path/datafile.csv")But I...

Data Engineering

6461 Views
9 replies
2 kudos

11-29-2022 11:41:24 AM

View Replies

Latest Reply

UmaMahesh1
Honored Contributor III

11-29-2022 12:30:49 PM

2 kudos

Hi @shamly pt I took a bit another approach since I guess no one would be sure of the the encoding of the data you showed. Sample data I took :‡‡companyId‡‡,‡‡empId‡‡,‡‡regionId‡‡,‡‡companyVersion‡‡,‡‡Question‡‡‡‡1‡‡,‡‡121212‡‡,‡‡R‡‡,‡‡1.0A‡‡,‡‡NA‡‡...