filter push down into redis when querying using spark connector

- - Certifications
- - Learning Paths
- - Databricks Product Tours
- - Get Started Guides

- - Get Started Resources
- - Announcements
- - Community Articles
- - Databricks TV
- - Learning Events
- - MVP Articles
- - Product Platform Updates
- - Support FAQs
- - Technical Blog
- - Community Events
- - BrickTalks TV

- - Databricks Academy Learners
  - Databricks Academy Learners Forum
- - Regional and Interest Groups
- - Private Groups

- - Databricks Community Champions
- - Khoros Community Forums Support (Not for Databricks Product Questions)
- - Databricks Community Code of Conduct
- - DAIS 2026

Data Engineering

Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.

I'm loading df from redis using this code:

df = (spark.read.format("org.apache.spark.sql.redis")
        .option("table", f"state_store_ready_to_sell")
        .option("key.column", "msid").option("infer.schema", "true").load()

and then i'm running filter , for example:

ready_to_sell = df.filter("msid in ('12321','12432')")

I looked at spark plan and spark does not push the msid filter to redis.

Which means that all redis records are loaded and filtered on spark memory (according to the sql tab is spark ui)

msid is key.column in redis of course.

How do i make spark push down the filter the fetch only the relevant records?

Thanks!

Almog

0 REPLIES 0

never-displayed

You must be signed in to add attachments

never-displayed

Announcements

Congratulations Databricks Partners! You're Now Officially Recognized in the Databricks Community

Solution Accelerator Series | Measure Ad Effectiveness With Multi-Touch Attribution

Govern AI Spend at Scale: A Data-Driven Approach to AI Governance | Webinar

Databricks AMER Learning Festival | Virtual Training

Databricks Community

filter push down into redis when querying using spark connector

Congratulations Databricks Partners! You're Now Officially Recognized in the Databricks Community

Solution Accelerator Series | Measure Ad Effectiveness With Multi-Touch Attribution

Govern AI Spend at Scale: A Data-Driven Approach to AI Governance | Webinar

Databricks AMER Learning Festival | Virtual Training

Introducing the Genie Hub: Ask Questions, Share Builds, and Master Conversational Analytics