cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Autoloader file latency

Phani1
Valued Contributor II

Hi Team,

I would like to understand if there is a metadata table for the autoloader in Databricks that captures information about file arrival and processing.

The reason we are experiencing data issues is because our table A receives hundreds of files that are processed by an autoloader,
and in some scenarios, we have noticed that old files are processed after new files, possibly due to a problem in the source system.
However, if we have clear details about autoloder metadata, it will be easier to identify the root cause analysis.

could you please share the best practices for organizing data in a storage location that an autoloader can effectively process?

4 REPLIES 4

jose_gonzalez
Moderator
Moderator

Are you using file listing or file notification for auto loader? 

Phani1
Valued Contributor II

we are using the default.

jose_gonzalez
Moderator
Moderator

Kaniz_Fatma
Community Manager
Community Manager

Hey there! Thanks a bunch for being part of our awesome community! 🎉 

We love having you around and appreciate all your questions. Take a moment to check out the responses – you'll find some great info. Your input is valuable, so pick the best solution for you. And remember, if you ever need more help , we're here for you! 

Keep being awesome! 😊🚀

 

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group