cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

collect_set/ collect_list Pushdown

Sam
New Contributor III

Hello,

I've noticed that Collect_Set and Collect_List are not pushed down to the database?

Runtime DB 9.1LTS

Spark 3.1.2

Database: Snowflake

Is there any way to get a distinct set from a group by in a way that will push down the query to the database?

1 REPLY 1

-werners-
Esteemed Contributor III

Hm so collect_set does not get translated to listagg.

Can you try the following?

  • use a more recent version of dbrx
  • use delta lake as spark source
  • use the latest version of the snowflake connector
  • check if pushdown to snowflake is enabled

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.