cancel
Showing results for 
Search instead for 
Did you mean: 
dotan
New Contributor II
since ‎11-02-2022
‎06-26-2023

User Stats

  • 3 Posts
  • 0 Solutions
  • 1 Kudos given
  • 0 Kudos received

User Activity

I have a hive table in Delta format with over 1B rows, when I check the Data Explorer in the SQL section of Databricks it notes that the table size is 139.3GiB with 401 files but when I check the S3 bucket where the files are located (dbfs:/user/hive...
I setup a notebook to ingest data using Auto Loader from an S3 bucket that contains over 500K CSV files into a hive table.Recently the amount of rows (and input files) in the table grew from around 150M to 530M and now each batch takes around an hour...
Kudos given to