cancel
Showing results for 
Search instead for 
Did you mean: 
Administration & Architecture
Explore discussions on Databricks administration, deployment strategies, and architectural best practices. Connect with administrators and architects to optimize your Databricks environment for performance, scalability, and security.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

noorbasha534
by Contributor III
  • 502 Views
  • 3 replies
  • 1 kudos

Azure Databricks Status

Dear all,I wanted to check if anyone implemented the solution of capturing information from Databricks status page in real-time 24x7 and load that into a log or table...https://learn.microsoft.com/en-us/azure/databricks/resources/statuswhat is the be...

  • 502 Views
  • 3 replies
  • 1 kudos
Latest Reply
TheRealOliver
Contributor
  • 1 kudos

It seems that the webhook is the way!There is nothing about system status in Databricks REST API.There is nothing about system status in the System Tables schema.

  • 1 kudos
2 More Replies
david_btmpl
by New Contributor II
  • 218 Views
  • 1 replies
  • 2 kudos

for_each_task with pool clusters

I am trying to run a `for_each_task` across different inputs of length `N` and `concurrency` `M` where N >> M.  To mitigate cluster setup time I want to use pool clusters.Now, when I set everything up, I notice that instead of `M` concurrent clusters...

  • 218 Views
  • 1 replies
  • 2 kudos
Latest Reply
SP_6721
Contributor
  • 2 kudos

Hi @david_btmpl When you set up a Databricks workflow using for_each_task with a cluster pool (instance_pool_id), Databricks will, by default, reuse the same cluster for all concurrent tasks in that job. So even if you’ve set a higher concurrency (li...

  • 2 kudos
janhouf
by New Contributor II
  • 1838 Views
  • 2 replies
  • 2 kudos

Query has been timed out due to inactivity.

Hi,We're experiencing an issue with SQL Serverless Warehouse when running queries through the dbx-sql-connector in Python. The error we get is: "Query has been timed out due to inactivity."This happens intermittently, even for queries that should com...

  • 1838 Views
  • 2 replies
  • 2 kudos
Latest Reply
AiswaryaS
New Contributor II
  • 2 kudos

Getting the same error while trying to run Tableau flow on Databricks. Is there a solution for this issue?

  • 2 kudos
1 More Replies
naumanmunir
by New Contributor II
  • 281 Views
  • 1 replies
  • 2 kudos

Service Principal Authentication / Terraform

Hello Databricks Community,I'm encountering an issue when trying to apply my Terraform configuration to create a Databricks MWS network on GCP. The terraform apply command fails with the following error: Error: cannot create mws networks: failed duri...

  • 281 Views
  • 1 replies
  • 2 kudos
Latest Reply
mnorland
Contributor
  • 2 kudos

Databricks account-level APIs can only be called by account owners and account admins and can only be authenticated using Google-issued OIDC tokens.In Terraform 0.13 and later, data resources have the same dependency resolution behavior as defined fo...

  • 2 kudos
jreh
by New Contributor III
  • 2459 Views
  • 7 replies
  • 2 kudos

Exact cost for job execution calculation

Hi everybody,I want to calculate the exact cost of single job execution. In all examples I can find on the internet it uses the tables system.billing.usage and system.billing.list_prices. It makes sense to calculate the sum of DBUs consumed and multi...

jreh_0-1728643622955.png
  • 2459 Views
  • 7 replies
  • 2 kudos
Latest Reply
vziog
New Contributor II
  • 2 kudos

And what about the costs for the disks of the VMs of the cluster?

  • 2 kudos
6 More Replies
Teo12333
by New Contributor II
  • 392 Views
  • 2 replies
  • 2 kudos

Impossible to access Terraform created external location?!

Hi all,There seems to be an external location created that nobody within the organization can actually see or manage, because it has been created with a Google service account in Terraform.Here is the problem:DESCRIBE EXTERNAL LOCATION `gcsbucketname...

  • 392 Views
  • 2 replies
  • 2 kudos
Latest Reply
mnorland
Contributor
  • 2 kudos

I would agree that the metastore admin(s) should be able to see the external location.  This issue can happen with terraform scripts if the script doesn't grant additional rights on the external location.

  • 2 kudos
1 More Replies
Isi
by Contributor III
  • 1477 Views
  • 3 replies
  • 3 kudos

Unable to access Databricks Volume from job triggered via API (Container Services)

Hi everyone,We’re facing a strange issue when trying to access a Databricks Volume from a job that is triggered via the Databricks REST API (not via Workflows). These jobs are executed using container services, which may be relevant, perhaps due to i...

  • 1477 Views
  • 3 replies
  • 3 kudos
Latest Reply
rcdatabricks
New Contributor III
  • 3 kudos

@Isi Are you using databricks-sdk library to access this volumes?example: https://docs.databricks.com/aws/en/dev-tools/sdk-python#files-in-volumes:~:text=Catalog%20volume.-,Python,-from%20databricks.

  • 3 kudos
2 More Replies
antonionuzzo
by New Contributor III
  • 328 Views
  • 1 replies
  • 0 kudos

Unexpected Behavior with Azure Databricks and Entra ID SCIM Integration

Hi everyone,I'm currently running some tests for a company that uses Entra ID as the backbone of its authentication system. Every employee with a corporate email address is mapped within the organization's Entra ID.Our company's Azure Databricks is c...

  • 328 Views
  • 1 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hello @antonionuzzo, This behavior is occurring because Azure Databricks allows workspace administrators to invite users from their organization's Entra ID directory into the Databricks workspace. This capability functions independently of whether th...

  • 0 kudos
antonionuzzo
by New Contributor III
  • 518 Views
  • 3 replies
  • 1 kudos

Monitor workspace admin activities

Hello everyone,I am conducting tests on Databricks AWS and have noticed that in an organization with multiple workspaces, each with different workspace admins, a workspace admin can invite a user who is not mapped within their workspace but is alread...

  • 518 Views
  • 3 replies
  • 1 kudos
Latest Reply
BigRoux
Databricks Employee
  • 1 kudos

You do have some control over what workspace admins can do. Databricks allows account admins to restrict workspace admin permissions by enabling the RestrictWorkspaceAdmins setting. Have a look here: https://docs.databricks.com/aws/en/admin/workspace...

  • 1 kudos
2 More Replies
KIRKQUINBAR
by New Contributor II
  • 615 Views
  • 1 replies
  • 2 kudos

Resolved! Predictive Optimization with multiple workspaces

We currently have an older instance of Azure Databricks that i migrated to Unity Catalog. Unfortunately i ran into some weird issues that don't seem fixable so i created a new instance and pointed it to the same metastore. The setting at the metastor...

  • 615 Views
  • 1 replies
  • 2 kudos
Latest Reply
Renu_
Contributor
  • 2 kudos

Hi @KIRKQUINBAR, if you enable Predictive Optimization at the metastore level in Unity Catalog, it automatically applies to all Unity Catalog managed tables within that metastore, no matter which workspace is accessing them. PO runs centrally, so the...

  • 2 kudos
RicksDB
by Contributor III
  • 565 Views
  • 2 replies
  • 0 kudos

Restrict serverless options to a subset of users

Hi,It seems as if there is no way to restrict serverless options to only only a subset of users. If a user has no budget policy, I assumed he could not run a serverless workload. Unfortunately, this is not the case and it will become a cost governanc...

  • 565 Views
  • 2 replies
  • 0 kudos
Latest Reply
gboso
New Contributor II
  • 0 kudos

Any news on this? This is a major blocker for us to enable serverless as we only have a handful of expert users

  • 0 kudos
1 More Replies
alexacas
by New Contributor II
  • 1818 Views
  • 3 replies
  • 0 kudos

Resolved! Help with Databricks SQL Queries

Hi everyone,I’m relatively new to Databricks and trying to optimize some SQL queries for better performance. I’ve noticed that certain queries take longer to run than expected. Does anyone have tips or best practices for writing efficient SQL in Data...

  • 1818 Views
  • 3 replies
  • 0 kudos
Latest Reply
lowedirect
New Contributor II
  • 0 kudos

When working with large datasets in Databricks SQL, here are some practical tips to boost performance:Leverage Partitioning: Partition large Delta tables on columns with high cardinality and frequent filtering (like date or region). It helps Databric...

  • 0 kudos
2 More Replies
raffael
by New Contributor III
  • 798 Views
  • 3 replies
  • 1 kudos

Resolved! How does reported billing in Azure relate to Databricks?

Hi,I'm confused by how costs in Azure relate to costs in Databricks. I'm currently on Azure Pay-as-you-Go and Databricks Trial. There's nothing on my Azure account going on apart from Databricks.This is the costs bar chart on Azure (€):This is the co...

raffael_2-1744702361699.png raffael_1-1744702336209.png raffael_3-1744702381360.png
  • 798 Views
  • 3 replies
  • 1 kudos
Latest Reply
raffael
New Contributor III
  • 1 kudos

Thanks.I can't find a documentation on how DBU translates to $/€. The pricing calculator only works for AWS/GCP. Where would I find that info?

  • 1 kudos
2 More Replies