Databricks Platform Discussions
Dive into comprehensive discussions covering various aspects of the Databricks platform. Join the co...
Dive into comprehensive discussions covering various aspects of the Databricks platform. Join the co...
Engage in vibrant discussions covering diverse learning topics within the Databricks Community. Expl...
Dear Databricks Community,I am trying to programmatically access the narrative text summary that Databricks Genie displays in the UI, but I cannot find it in the public API responses.What I see in the UI: When I ask Genie a question, the UI displays ...
Hey, thank you very much! Confirm the text summary is in the response now.
Hi Team,We have a few scripts stored in a folder on a DBFS path. Recently, we've noticed that when we navigate to this path manually, the folder appears to be empty, and we are unable to see the scripts.However, the jobs that reference and access the...
Hi @Shivani_Komma99, Thanks for flagging this. Based on the behaviour you described, this appears to be consistent with a DBFS browser UI issue/regression rather than a problem with the underlying files, especially since the files are still accessibl...
When attempting to change sharing settings of published dashboards from Only people with access can view to anyone in my account can view or vice versa we get a "Failed to change permissions on dashboard..." error. This happens for all dashboards, bo...
Wow @Ashwin_DSA that's a quick way to escalate and resolve!
I recently upgraded my Databricks Connect version to 15.4 and got set up for Serverless, but ran into the following error when I ran the standard code to enable Arrow on Pyspark: >>> spark.conf.set(key='spark.sql.execution.arrow.pyspark.enabled', val...
When might this limitation be lifted? Is it due to security concerns that serverless doesn't support this?
Hi,I want to install Databricks Lakebridge on my Databricks environment and use the analyze and transpile commands through a python script.The usecase is that we need to create an automated pipeline that will migrate the existing scripts from snowfla...
Hello,I have a problemimport socket print(socket.gethostbyname("google.com"))[Trace ID: 00-9e5667936e24418dae4c39a9ed003a22-a43a5f2ca9b14451-00] --------------------------------------------------------------------------- gaierror ...
ABAC in Unity Catalog already gave teams a better way to apply row filters and column masks at scale. Instead of rebuilding the same access logic table by table, teams can classify data with governed tags and let policies apply automatically across a...
Interview Question:Many people start with the row count.I would start with the architecture.Billions of records are not new in enterprise data engineering. The real challenge is designing a pipeline that runs predictably, efficiently, and within SLA....
Here’s What Nobody Tells You.A hands-on look at Genie MCP and DBSQL MCP — what works, what doesn’t, and when to skip MCP entirely.Let me start with something most MCP content skips. MCP servers don’t just expose tools. They expose three things — tool...
I am using Azure Databricks DBFS. I have stored some python scripts, notebooks in folders in DBFS root and have mounted ADLS Gen2 to DBFS. Currently I do not have Unity Catalog enabled and this was setup 3-4 years ago. I came across this article http...
In current state, when DBFS is disabled in a workspace, both the root filestore and mount paths become inaccessible.What is deprecated are:DBFS root storage the workspace‑provisioned filestore - /dbfs/FileStore/, /dbfs/root/ etc.DBFS mounts (/dbfs/mn...
Hi everyone,I'm currently working on implementing a data dictionary, and I'm struggling to understand what the best approach is when using a Medallion architecture in Databricks.My company is migrating its data to Databricks so that users can consume...
Hi, This is a challenging area to get right, especially with so many tables and schemas. Unity Catalog is definitely a good bet for storing this metadata. There are a few approaches that you could take to the data definitions part, and it depends on ...
Hello.Ive created a custom cluster - m4.large. When i try to execute some code in this cluster the behaviour is as follows:- Cluster starts, have running status- I run code, for example print("Hello")- Code runs indefinitely- I click interrupt, it st...
The fact that print("Hello") eventually works but SELECT 1 never completes suggests the cluster may be running but not fully initialized for Spark workloads.A few things I’d check first:Cluster Event Log for any provisioning or startup errors.Spark U...
The conversation around AI often starts with models. In reality, long-term value comes from how well data pipelines, governance, and workflows work together.The teams moving fastest seem to be simplifying their stack instead of expanding it.What tren...
Coming from a statistics background, this makes a lot of sense to me. Before any model, we always spend most of the time just cleaning and validating data. the actual modeling is usually the smaller part.I think that's why the medallion architecture ...
We're currently setting up Databricks Asset Bundles (DAB) with a CI/CD pipeline using Azure DevOps.Our planned development workflow is as follows:Main branch → Developer creates a feature branch → Implement changes → Create a Pull Request → Senior de...
You can create Databricks Asset Bundles that are decoupled by domain, managed via multi target declarations within configuration and also driven by immutable, versioned artifacts stored securely within Unity Catalog Volumes. You can rely on explicit ...
Hi everyone,I'm using #Declarative Automation Bundles (DAB) to deploy data pipelines, and I've run into an issue with concurrent job runs and deploymentWhat happened:I started a job that depends on a wheel file built by the bundle (timestamped artifa...
Do you have source_linked_deployment set as false? That's probably causing it.
| User | Count |
|---|---|
| 1899 | |
| 1028 | |
| 930 | |
| 480 | |
| 362 |