by
andrej
• New Contributor II
- 2655 Views
- 4 replies
- 1 kudos
I have a large table which contains a date_time column.The table contains 2 generated columns year, and month which are extracted from the date_time values and are used for partitioning.I have the following question.If I run the querySELECT *FROM tab...
- 2655 Views
- 4 replies
- 1 kudos
Latest Reply
Hi @Andrej Znidarsic Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.T...
3 More Replies
- 25253 Views
- 16 replies
- 3 kudos
Imagine the following setup:I have log files stored as JSON files partitioned by year, month, day and hour in physical folders:"""
/logs
|-- year=2020
|-- year=2021
`-- year=2022
|-- month=01
`-- month=02
|-- day=01
|-- day=.....
- 25253 Views
- 16 replies
- 3 kudos
- 5594 Views
- 2 replies
- 1 kudos
Hello everybody,Another strange issue I have and I would like to confirm me if this is a bug or expected behaviour:I'm joining a large dataset with a dimension table and as expected DPP is activated.I was trying to deactivate the feature as it change...
- 5594 Views
- 2 replies
- 1 kudos
Latest Reply
Hello @Kaniz Fatma Thank you for taking the time to answer.The issue in this case was that spark.databricks.optimizer.deltaTableFilesThreshold was activating DPP even if it was formally deactivated by setting all available "enabled" properties to f...
1 More Replies