by
aladda
• Honored Contributor II
- 659 Views
- 0 replies
- 0 kudos
It is best to avoid collecting stats on long strings. You typically want to collect stats on column that are used in filter, where clauses, joins and on which you tend to performance aggregations - typically numerical valuesYou can avoid collecting s...
- 659 Views
- 0 replies
- 0 kudos
- 741 Views
- 2 replies
- 0 kudos
What is the best way to deal with concurrent exceptions in Delta when you have multiple writers on the same delta table ?
- 741 Views
- 2 replies
- 0 kudos
Latest Reply
While you can try-catch-retry , it would be expensive to retry as the underlying table snapshot would have changed. So the best approach is to avoid conflicts using partitioning and disjoint command conditions as much as possible.
1 More Replies
- 1222 Views
- 2 replies
- 1 kudos
using Spark SQL or particularly %SQL in a databricks notebook, is there a way to use pagination or offset or skip ?
- 1222 Views
- 2 replies
- 1 kudos
Latest Reply
There is no offset support yet. Here are a few possible workarounds If you data is all in one partition ( rarely the case ) , you could create a column with monotonically_increasing_id and apply filter conditions. if there are multiple partitions...
1 More Replies