cancel
Showing results for 
Search instead for 
Did you mean: 
AbhaySingh
Databricks Employee
Databricks Employee
since ‎10-15-2025
Wednesday

User Stats

  • 41 Posts
  • 6 Solutions
  • 3 Kudos given
  • 23 Kudos received

User Activity

We've all been there. You're excited about the lakehouse, you see the value clear as day, and then you try explaining it to a coworker and their eyes glaze over. Slides don't cut it. Documentation links get ignored. What actually works? Showing them...
You can now run distributed ML (Spark MLlib in Python, Optuna tuning, MLflow Spark, Joblib Spark) on serverless notebooks/jobs and on standard clusters, not just dedicated ML clusters.It reuses the same Unity Catalog + Lakeguard stack you already use...
Last month, our nightly CDC pipeline started timing out. What used to complete in 20 minutes was now crawling past the 4-hour mark—and failing. The culprit? A MERGE statement against a 2.3TB Delta table with 800 million rows that had grown steadily o...
Delta Lake 4.0 is the next major open-source release aligned with Spark 4.x, adding first-class Variant for semi-structured data, safer Type Widening, improved DROP FEATURE, better transaction log handling, and a new multi-engine story via Delta Kern...
  If you’ve ever hacked together a one-off script to pull data from some random API into Spark, you’re exactly who the new Python Data Source API is for. Databricks has made this API generally available on Apache Spark™ 4.0 with Databricks Runtime 1...