Databricks Community

sajith_appukutt · ‎06-09-2021

I was expecting filter operations to be pushed down to Redshift by the optimizer. However, the entire dataset is getting loaded from Redshift.

sajith_appukutt · ‎06-21-2021

The Spark driver for Redshift pushes the following operators down into Redshift:

Filter
Project
Sort
Limit
Aggregation
Join

However, it does not support expressions operating on dates and timestamps today. If you have a similar requirement, please add a feature request via https://docs.databricks.com/resources/ideas.html

View solution in original post

sajith_appukutt · ‎06-21-2021

The Spark driver for Redshift pushes the following operators down into Redshift:

Filter
Project
Sort
Limit
Aggregation
Join

However, it does not support expressions operating on dates and timestamps today. If you have a similar requirement, please add a feature request via https://docs.databricks.com/resources/ideas.html

Databricks Community

I'm using the Redshift data source to load data into spark SQL data frames. However, I'm not seeing predicate push down for my queries ran on Redshift - is that expected?

Photos

Join Us as a Local Community Builder!

Announcing the APJ Databricks Smart Business Insights Challenge: Empowering Data-Driven Decision Mak

🚀 Monthly Databricks Get Started Days – Accelerate Your Learning Journey! 🚀

Business Intelligence in the Era of AI

Virtual Learning Festival: 9 April - 30 April

Data + AI Summit 2025 — registration now open!