cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

using Spark SQL or particularly %SQL in a databricks notebook, is there a way to use pagination or offset or skip ?

User16783853501
New Contributor II
New Contributor II

using Spark SQL or particularly %SQL in a databricks notebook, is there a way to use pagination or offset or skip ? 

2 REPLIES 2

aladda
Honored Contributor II
Honored Contributor II

Can you clarify what are you looking for and what your use case is? Are you asking whether there's a preference for using Spark SQL or just direct SQL with %sql or something else?

sajith_appukutt
Honored Contributor II

There is no offset support yet. Here are a few possible workarounds

  • If you data is all in one partition ( rarely the case 🙂 ) , you could create a column with monotonically_increasing_id and apply filter conditions. if there are multiple partitions, monotonically_increasing_id won't be consecutive

  • Use except ( in your case sql equivalent of code below) . This however would be an expensive operation
df1 = df.limit(10); 
df2 = df.except(df1); 
df2.limit(20);

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.