Databricks Community

JoseMacedo · ‎03-26-2024

Hello!

I'm using a server less SQL cluster on Data bricks and I have a dataset on Delta Table that has 500 billion rows. I'm trying to filter to have around 7 billion and the cache that dataset to use it on other queries and make it run faster.

When I cache the table it takes 1s and gives no error/warning.

When I select the cache table it gives and error that cannot be found.

This is what I'm doing:

CACHE TABLE table_filtered_cache AS select * from prod_datalake.table a
WHERE
a.year >= 2023 etc

and then

select count(*) from table_filtered_cache

What am I doing wrong, and what would you advise me to do?

-werners- · ‎03-26-2024

can you try with creating a global temp view of the cache?

JoseMacedo · ‎03-26-2024

I tried doing like:

CREATE GLOBAL TEMPORARY VIEW table_filtered_cache AS select

Got an error

"GLOBAL TEMPORARY VIEW is not supported on a SQL warehouse."

-werners- · ‎03-26-2024

I missed the 'serverless sql' part. CACHE is for spark, I don´t think it works for serverless sql.
Here is how caching works on DBSQL.

Databricks Community

How to cache on 500 billion rows

Join Us as a Local Community Builder!

Lakehouse, Lagers & Legends — Bangalore Meetup | December 13

🌟 Community Pulse: Your Weekly Roundup! November 21 – 27, 2025

Join us for another BrickTalk: Vibe-Coding Databricks Apps in Replit with Augusto!

Celebrating Our First Brickster Champion: Louis Frolio

⭐ Setup Spark with Hadoop Anywhere : A DBR aligned local Spark+HDFS+Hive stack on Docker⭐