Databricks Community

intuz · 07-10-2025

Hey there, Here’s a simple step-by-step way to load a CSV in Databricks Free Edition: Step 1: Upload the file to your workspace (not DBFS) On the left menu, go to "Workspace". Right-click any folder (like Shared or your username) → click "Upload". Ch...

intuz · 07-09-2025

Hi again!Great question — when using Liquid Clustering, you do not need to run OPTIMIZE manually, even if you're truncating and reloading the entire dataset.Once you’ve defined CLUSTER BY (x, y, z), Liquid Clustering automatically organizes the data ...

intuz · 07-08-2025

Hey there, Processing many .txt files with different formats and validations is something Databricks handles well. Here’s a simple approach: Recommended Approach: Use DLT (Delta Live Tables) or LakeFlow to build a pipeline per format (if each format ...

intuz · 07-08-2025

Yes, the VACUUM table_x RETAIN 720 HOURS; command will indeed override your table-level retention properties and potentially compromise your 6-month time travel capability. When you explicitly specify a retention period in the VACUUM command, it tak...

intuz · 07-08-2025

Hi there,Thanks for sharing your issue — working with 150 billion rows monthly is definitely a serious scale, and optimizing performance matters a lot. Let me try to address both your questions clearly:Q1) Can we insert data in parallel into a cluste...

Databricks Community

User Stats

User Activity

Re: can load/read file in databricks free edition(which is all new edition replacing community editi

Re: Optimize taking FULL Taking Longer time on Clustered Table

Re: Data processing and validation with similar set of schema using databricks

Re: Impact of VACUUM and retention settings on Delta Lake

Re: Optimize taking FULL Taking Longer time on Clustered Table