Data Engineering

Forum Posts

Sorted by:

by Nazar • New Contributor II

09-23-2021 3:06:15 PM

8620 Views
3 replies
4 kudos

Resolved! Incremental write

Hi All,I have a daily spark job that reads and joins 3-4 source tables and writes the df in a parquet format. This data frame consists of 100+ columns. As this job run daily, our deduplication logic identifies the latest record from each of source t...

Data Engineering

8620 Views
3 replies
4 kudos

09-23-2021 3:06:15 PM

View Replies

Latest Reply

Nazar
New Contributor II

09-27-2021 2:55:33 PM

4 kudos

Thanks werners

4 kudos

09-27-2021 2:55:33 PM

2 More Replies

by lprevost • Contributor III

08-09-2021 8:46:26 AM

2579 Views
0 replies
0 kudos

Incremental updates to s3 csv files, autoloader, and delta lake updates

I'm using the Databricks autoloader to incrementally load a series of csv files on s3 which I update with an API. My tyipcal work process is to update only the latest year file each night. But, there are ocassions where previous years also get update...

Data Engineering

2579 Views
0 replies
0 kudos

08-09-2021 8:46:26 AM

by AlaQabaja • New Contributor II

09-19-2019 10:02:46 AM

8155 Views
3 replies
0 kudos

Get last modified date or create date for azure blob container

Hi Everyone, I am trying to implement a way in Python to only read files that weren't loaded since the last run of my notebook. The way I am thinking of implementing this is to keep of the last time my notebook has finished in a database table. Nex...

Data Engineering

8155 Views
3 replies
0 kudos

09-19-2019 10:02:46 AM

View Replies

Latest Reply

Forum_Admin
Databricks Employee

03-18-2020 5:25:37 AM

0 kudos

Hello! I just wanted to share my point of view on the topic of dating sites. I have been looking for a decent Asian catch-up site for a very long time, in addition to them I found https://hookupsearch.org/asian-hookup-sites/. We definitely recommend...

0 kudos

03-18-2020 5:25:37 AM

2 More Replies

Databricks Community

Resolved! Incremental write

Incremental updates to s3 csv files, autoloader, and delta lake updates

Get last modified date or create date for azure blob container