by
Nazar
• New Contributor II
- 5640 Views
- 3 replies
- 4 kudos
Hi All,I have a daily spark job that reads and joins 3-4 source tables and writes the df in a parquet format. This data frame consists of 100+ columns. As this job run daily, our deduplication logic identifies the latest record from each of source t...
- 5640 Views
- 3 replies
- 4 kudos
- 1612 Views
- 0 replies
- 0 kudos
I'm using the Databricks autoloader to incrementally load a series of csv files on s3 which I update with an API. My tyipcal work process is to update only the latest year file each night. But, there are ocassions where previous years also get update...
- 1612 Views
- 0 replies
- 0 kudos
- 5102 Views
- 3 replies
- 0 kudos
Hi Everyone,
I am trying to implement a way in Python to only read files that weren't loaded since the last run of my notebook. The way I am thinking of implementing this is to keep of the last time my notebook has finished in a database table. Nex...
- 5102 Views
- 3 replies
- 0 kudos
Latest Reply
Hello! I just wanted to share my point of view on the topic of dating sites. I have been looking for a decent Asian catch-up site for a very long time, in addition to them I found https://hookupsearch.org/asian-hookup-sites/. We definitely recommend...
2 More Replies