Does Databricks get cached result for a subquery?

whleeman — Wed, 28 Jun 2023 21:27:46 GMT

If I run a query as "SELECT fare_amount FROM nyctaxi.trips where fare_amount > 1.5". The query results will be cached for 24 hours.

I then compose a second query using the previous query as a subquery "SELECT * FROM nyctaxi.trips WHERE fare_amount IN (SELECT fare_amount FROM nyctaxi.trips where fare_amount > 1.5)"

Will Databricks get the cached result for the subquery to speed up the second query execution?

Note that the ask is not to cache subquery result. Rather, to get the cached result for the subquery when the subquery was run independently before

topic Does Databricks get cached result for a subquery? in Data Engineering

Does Databricks get cached result for a subquery?