Databricks Community

User16826994223 · ‎06-21-2021

I want to know how databricks maintain data recency in databricks

sajith_appukutt · ‎06-22-2021

When using delta tables in databricks, you have the advantage of delta cache which accelerates data reads by creating copies of remote files in nodes’ local storage using a fast intermediate data format. At the beginning of each query delta tables auto-update to the latest version - this way data is always recent.

However, if it is acceptable for results to be stale for a short duration of time, you could lower the latency of queries further. This is done by setting the Spark session configuration variable spark.databricks.delta.stalenessLimit with a time string value, e.g 1h, 15m, 1d

Databricks Community

How do we manage data recency in Databricks

Connect with Databricks Users in Your Area

Get Started With Lakehouse Architecture | Pass a quiz to earn your certificate completion.

Databricks Community Champion - February 2025 - Stefan Koch

Virtual Learning Festival: 9 April - 30 April

Women’s Week Challenge: Play, Engage & Win Swag

Data + AI Summit 2025 — registration now open!