cancel
Showing results for 
Search instead for 
Did you mean: 
Administration & Architecture
Explore discussions on Databricks administration, deployment strategies, and architectural best practices. Connect with administrators and architects to optimize your Databricks environment for performance, scalability, and security.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

RaulPino
by New Contributor III
  • 11230 Views
  • 3 replies
  • 1 kudos

Resolved! Networking reduction cost for NATGateway and Shared Catalog

Use case and context:We have a databricks workspace in a specific region, reading and writing files from/to the same region.We also read from a Shared Catalog in a different company, a data provider, which is pointing to multi-region s3 buckets.The r...

Administration & Architecture
natgateway
networking
S3
shared catalog
VPC
  • 11230 Views
  • 3 replies
  • 1 kudos
Latest Reply
RaulPino
New Contributor III
  • 1 kudos

Thanks @Retired_mod for all the suggestions.After some days of monitoring NAT cost, I realized that the implementation of the S3 Gateway Endpoint it was actually working, the problem was that I thought that this change would be reflected right away i...

  • 1 kudos
2 More Replies
6502
by New Contributor III
  • 37398 Views
  • 1 replies
  • 0 kudos

Resolved! Error: default auth: cannot configure default credentials, please check...

Hola all, I'm experiencing a quite strange error. The problem is that and happens inside a GITLAB pipeline:$ databricks current-user meError: default auth: cannot configure default credentials, please check https://docs.databricks.com/en/dev-tools/au...

  • 37398 Views
  • 1 replies
  • 0 kudos
Latest Reply
6502
New Contributor III
  • 0 kudos

Hola Kaniz, the problem is not on Databricks CLI but is due to some interactions happening inside the Gitlab pipeline. According to the documentation reported here: Databricks personal access token authentication | Databricks on AWS ( at the bottom o...

  • 0 kudos
Debi-Moha
by Databricks Partner
  • 11305 Views
  • 1 replies
  • 2 kudos

External locations being shared across workspaces

Currently, we have 3 Unity Catalog enabled workspaces sharing the same metastore. Now, when we create an external location or storage credential in any of the workspaces, it gets reflected across all workspaces. We are looking for some best practices...

  • 11305 Views
  • 1 replies
  • 2 kudos
Latest Reply
Allia
Databricks Employee
  • 2 kudos

Hi @Debi-Moha  Currently we do not have a mechanism to isolate the external locations and storage credentials based on workspaces, since the metastore is shared across the workspaces. Please check below document for recommendations on securing extern...

  • 2 kudos
Carsten03
by New Contributor III
  • 22504 Views
  • 2 replies
  • 0 kudos

Bitbucket Cloud Repo Integration with Token

Hey,I am using Bitbucket Cloud and I want to connect my repository to Databricks. I am able to connect with my personal app password but what I am looking for is an authentication of a technical user.I need the integration to point to my dbt repo, wh...

  • 22504 Views
  • 2 replies
  • 0 kudos
Latest Reply
Carsten03
New Contributor III
  • 0 kudos

Hi @Retired_mod,thank you for your response! With this link you provided, I was able to authenticate with Bitbucket Cloud. The solution was to use x-token-auth as a username. I have tried with the generated email address before which didn't work. Tha...

  • 0 kudos
1 More Replies
smehta_0908
by New Contributor II
  • 4425 Views
  • 1 replies
  • 0 kudos

Monitor and Alert Databricks Resource Utilization and Cost Consumption

We want to build monitoring and Alerting solution for Azure Databricks that should capture Resource Utilization details (like Aggregated CPU%, Memory% etc.) and Cost consumption at the Account Level.We have Unity Catalog Enabled and there are multipl...

  • 4425 Views
  • 1 replies
  • 0 kudos
Latest Reply
Allia
Databricks Employee
  • 0 kudos

@smehta_0908 Greetings! You can utilize Datadog for monitoring CPU and memory of clusters. https://docs.datadoghq.com/integrations/databricks/?tab=driveronly For Cost consumption at accounts level you can make use of billable usage logs using the Acc...

  • 0 kudos
m997al
by Contributor III
  • 10582 Views
  • 2 replies
  • 0 kudos

Is there a way to configure a cluster to have no internet access?

When experimenting with LLMs on Databricks clusters, I have become interested in knowing if the LLM (Llama2 or otherwise), tries to make calls to the internet (i.e., the settings for use_remote_code=True in Huggingface models, as just one example).Mo...

  • 10582 Views
  • 2 replies
  • 0 kudos
Latest Reply
m997al
Contributor III
  • 0 kudos

Thanks!  I see the general direction there.

  • 0 kudos
1 More Replies
FlavioSM
by New Contributor II
  • 4718 Views
  • 1 replies
  • 1 kudos

Resolved! Install xml maven library

When I`m trying to install xml package, I`m getting error PERMISSION_DENIED 

  • 4718 Views
  • 1 replies
  • 1 kudos
Latest Reply
Palash01
Valued Contributor
  • 1 kudos

Hey @FlavioSM The error message indicates that the library you're trying to install (com.databricks:spark-xml_2.13:0.17.0) is not on the allowlist for shared clusters in your Databricks workspace. Shared clusters are clusters that multiple users can ...

  • 1 kudos
rfreitas
by New Contributor II
  • 9072 Views
  • 4 replies
  • 0 kudos

Resolved! SCIM Synchronization for Email Change Cases in Azure AD

Hi everyone,I would like to know if the following behavior is expected or if it is a misconfiguration in SCIMWe are going through a change in the email of some users. So we did a test, changing the email of one of them, but the result was not OK beca...

  • 9072 Views
  • 4 replies
  • 0 kudos
Latest Reply
Ayushi_Suthar
Databricks Employee
  • 0 kudos

Hi @rfreitas , It is an expected behaviour because we don't support email or username updates/edits in the SCIM as of now. As per Databricks documentation, an email update is not supported in SCIM. You cannot update the username or email address of a...

  • 0 kudos
3 More Replies
danmlopsmaz
by New Contributor III
  • 17766 Views
  • 4 replies
  • 0 kudos

Resolved! Databricks job creator update with API

Hi team,Greetings.Do you know if there is a way to update the creator of a databricks_job using the API? the Documentation does not show "the creator" property and when I tried setting the creator, this property is not updated in the workspace UI.The...

  • 17766 Views
  • 4 replies
  • 0 kudos
Latest Reply
Ayushi_Suthar
Databricks Employee
  • 0 kudos

Hi @danmlopsmaz , Thanks for bringing up your concerns, always happy to help  I understand the customer wanted to change the creator of the job but at this moment we could change the owner but not the creator. But you should be able to clone an exist...

  • 0 kudos
3 More Replies
edmundsecho
by New Contributor II
  • 15339 Views
  • 2 replies
  • 0 kudos

how register a published app

Is there a way to register a web app with Databricks to enable it to access a user's data using OAuth2? i.e., be included in the list of published apps with a pre-negotiated scope? 

  • 15339 Views
  • 2 replies
  • 0 kudos
Latest Reply
Yeshwanth
Databricks Employee
  • 0 kudos

@edmundsecho could you please elaborate more on the use case? 

  • 0 kudos
1 More Replies
Carsten03
by New Contributor III
  • 7768 Views
  • 7 replies
  • 2 kudos

Resolved! CloudWatch Agent Init Script Fails

Hi,I am trying to install the CloudWatch log agent on my cluster, using this tutorial from AWS https://aws.amazon.com/blogs/mt/how-to-monitor-databricks-with-amazon-cloudwatch/They provide an init script there but when I try to start my cluster I get...

  • 7768 Views
  • 7 replies
  • 2 kudos
Latest Reply
alexbishop
New Contributor II
  • 2 kudos

Hi @Yeshwanth I have the exact same issue here. I have tried to update my cluster with the init script that you kindly shared in this thread. However, my error is still:Init script failure:Cluster scoped init script s3://databricks-init-scripts-caden...

  • 2 kudos
6 More Replies
Datazilla
by New Contributor III
  • 10786 Views
  • 5 replies
  • 1 kudos

Is a central UC Catalog management a Good Practice?

I am working at large company with many more or less independent divisions and we are currently working on the roll out of Unity Catalog in Azure. The idea was to have a central infrastructure repository (deployed via Terraform) to manage all central...

  • 10786 Views
  • 5 replies
  • 1 kudos
Latest Reply
Datazilla
New Contributor III
  • 1 kudos

Workspace Admins: Consider configuring permissions for workspace admins in the Account console to strike a balance between autonomy and governance.@Retired_mod Do you have any information about this configuration? I cannot find such thing in the Acco...

  • 1 kudos
4 More Replies
sheridan06
by Databricks Partner
  • 2483 Views
  • 2 replies
  • 0 kudos

Resolved! Architecture With the Enhanced Security and Compliance Add-On

Morning everyone!  Wondering if someone has a high-level reference architecture of Databricks on AWS that includes the Enhanced Security and Compliance Add-On??thanks,Brad

  • 2483 Views
  • 2 replies
  • 0 kudos
Latest Reply
Yeshwanth
Databricks Employee
  • 0 kudos

@sheridan06 please check this documentation: https://www.databricks.com/trust/security-features/protect-your-data-with-enhanced-security-and-compliance Let us know if it helps.

  • 0 kudos
1 More Replies