Databricks Community

Maksym · 01-19-2022

I have a simple job scheduled every 5 min. Basically it listens to cloudfiles on storage account and writes them into delta table, extremely simple. The code is something like this:df = (spark .readStream .format("cloudFiles") .option('cloudFil...

Maksym · 12-20-2022

Greetings, I have similar problem. Did you try to use Databricks workflows instead and schedule them instead on Data Factory?Because inside workflows it is possible to select a specific branch, so it may actually work.What do you think?

Maksym · 01-21-2022

I resolved it by using.option('cloudFiles.useIncrementalListing', 'false')Now if I understand correctly, rocksdb reads the whole list of files instead of its mini "checkpoints" based on filename and timestamps. My guess is: my json filenames are comp...

Databricks Community

User Stats

User Activity

Databricks Autoloader is getting stuck and does not pass to the next batch

Re: Call Databricks notebook in a specific branch from Azure Data Factory?

Re: Databricks Autoloader is getting stuck and does not pass to the next batch