Databricks Community

Prajapathy_NKR

@Naveenkumar1811 Since your silver is a streaming job, there can be lots of files and metadata being created based on your write interval and frequency of new data. If there are more files being created in few mins, it potentially leads to small file...

Prajapathy_NKR

Hi @Dom1 ,One solution which i had implemented is to use API to connect to artifact and download the latest artifact to driver's storage (when you use curl to download the file, it gets downloaded in the disk of the driver), later moved it to the req...

Prajapathy_NKR

Hi @SRJDB ,Here is a work around, you can switch to the widget to text parameter and implement a enums. It helps in validation in both value as well as case check. Using enums in one of the best practices to use constant variables. More over you can ...

Prajapathy_NKR

@Raman_Unifeye and @Coffee77 there was a situation when a parquet file were deleted because it was obsolete when a vacuum was executed. Post which the job started to fail, it was saying unable to find the parquet file, even though it was reading an...

Prajapathy_NKR

@Aviraldb please try the below way,%shcp /dbfs/Volumes/workspace/default/delc/generated_scripts/*.py /Workspace/Shared/Delc_Project/scripts/ Hope it helps.

Databricks Community

User Stats

User Activity

Re: Reduce the Time for First Spark Streaming Run Kick off

Re: Pull JAR from private Maven repository (Azure Artifactory)

Re: How to restrict the values permitted in a job or task parameter?

Re: Databricks external table lagging behind source files

Re: Moving files from Volume to Workspace