cancel
Showing results for 
Search instead for 
Did you mean: 
Administration & Architecture
Explore discussions on Databricks administration, deployment strategies, and architectural best practices. Connect with administrators and architects to optimize your Databricks environment for performance, scalability, and security.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Lakehouse Architecture


Forum Posts

Mathias
by New Contributor II
  • 7372 Views
  • 6 replies
  • 3 kudos

Different settings per target with Asset bundles

When generating the standard setup with databricks bundle init we will get databricks.yml that references resources/*. The targets are set in the databricks.yml and the resources (pipelines and jobs) are set in different files.I have dlt pipelines th...

  • 7372 Views
  • 6 replies
  • 3 kudos
Latest Reply
MohsenJ
Contributor
  • 3 kudos

I also like to know the solution to this problem

  • 3 kudos
5 More Replies
robmcclain
by New Contributor II
  • 648 Views
  • 2 replies
  • 1 kudos

Resolved! Using the API, get the list of the schemas and tables a group or user has permissions for

I am attempting to use the Databricks API to get a list of the schemas and tables a group or user has permissions for.  Is this possible?  Is there another method I should be using instead?I see the Unity Catalog > Grants > Get permissions endpoint c...

  • 648 Views
  • 2 replies
  • 1 kudos
Latest Reply
robmcclain
New Contributor II
  • 1 kudos

Thanks, @Walter_C.  In my case, I was able to get the data I needed by using the Databricks SQL Driver for Node.js , querying the information_schema.table_privileges table.

  • 1 kudos
1 More Replies
lgepp11
by New Contributor III
  • 4580 Views
  • 5 replies
  • 0 kudos

Azure Entra SSO Error: Your user has not been registered

I have set up SSO within databricks and automatic user provisioning with Azure Entra and confirmed it is working for all users. However 1 user is presented with this when signing in. The user is in the enterprise app within Azure Entra and the user i...

lgepp11_0-1696914264539.png
Administration & Architecture
azure
Entra
Error
Sign In
sso
  • 4580 Views
  • 5 replies
  • 0 kudos
Latest Reply
nj28sharp
New Contributor II
  • 0 kudos

Figured out the issue, it seems like Email is case sensitive 

  • 0 kudos
4 More Replies
VJ3
by New Contributor III
  • 970 Views
  • 1 replies
  • 0 kudos

Security Consideration for OAUTH Secrets to use Service Principal to authenticate with Databricks

What are the security consideration we need to keep in mind when we want to us OAUTH Secrets to use a Service Principal to access Azure Databricks when Identity federation is disabled and workspace is not yet on boarded on to Unity Catalog? Can we co...

  • 970 Views
  • 1 replies
  • 0 kudos
Latest Reply
VJ3
New Contributor III
  • 0 kudos

Thank you @Retired_mod for the response. I do have follow up questions.- What kind of encryption is used to store OAUTH secret?-  Is there any way OAUTH can be generated by someone else who is not a manager of that SPN? We need this as a part of segr...

  • 0 kudos
ShankarM
by New Contributor III
  • 968 Views
  • 0 replies
  • 0 kudos

Databricks Clean Room costing

hi,Can you throw some light on how the compute, data sharing costing is done for various scenarios:1. Collaborator 1 and Collaborator 2 are having Databricks accounts in the same region and same cloud. Is there a DBU cost and who will pay for it? I a...

  • 968 Views
  • 0 replies
  • 0 kudos
a_user12
by New Contributor II
  • 1463 Views
  • 2 replies
  • 0 kudos

Resolved! Databricks Spot Instance: Completion Guarantee

Databricks allows to use spot instances for worker nodes. I consider to use them for interactive clusters. Do I have a gurantee that code will be completed without any errors even if spot instances are evicted? I would accept execution delays but no ...

a_user12_5-1719901164567.png
  • 1463 Views
  • 2 replies
  • 0 kudos
Latest Reply
imsabarinath
New Contributor III
  • 0 kudos

You could explore their "SPOT_WITH_FALLBAK" feature. If you don't want your jobs to fail because of eviction but this currently is not supported with interactive clusters. Hoping that they may extend this to all compute options soonCreate a pipeline ...

  • 0 kudos
1 More Replies
erigaud
by Honored Contributor
  • 15972 Views
  • 9 replies
  • 9 kudos

Resolved! Installing libraries on job clusters

Simple question : what is the way to go to install libraries on job clusters ? There does not seem to be a "Libraries" tab on the UI as opposed to regular clusters. Does it mean that the only option is to use init scripts ? 

  • 15972 Views
  • 9 replies
  • 9 kudos
Latest Reply
imsabarinath
New Contributor III
  • 9 kudos

You may want to copy required libs to a volume and load it during cluster setup to avoid downloading the libs for every run.

  • 9 kudos
8 More Replies
alm
by New Contributor III
  • 3864 Views
  • 4 replies
  • 0 kudos

Show all privileges granted to principal

Given the name of a principal in Databricks (I'm using account-level groups) is there an easy way to query or in other way obtain all privileges granted to this principal?I know I can obtain the information by querying in several of the system.inform...

  • 3864 Views
  • 4 replies
  • 0 kudos
Latest Reply
sakthi_sujitha
Databricks Employee
  • 0 kudos

This link will provide details on how to verify all the privileges granted to Service Principals 

  • 0 kudos
3 More Replies
sk_databricks
by New Contributor
  • 633 Views
  • 1 replies
  • 0 kudos

Databricks Cache Options

Hi,We are working on Databricks solution hosted on AWS. We are exploring the caching options in Databricks. Apart from the Databricks cache and spark cache? What are the options? Is it feasible to use 3rd party Cache solutions like AWS Elastic Cache ...

  • 633 Views
  • 1 replies
  • 0 kudos
Latest Reply
Walter_C
Databricks Employee
  • 0 kudos

Databricks provides several caching options to enhance performance by minimizing Input and Output (I/O) read and write operations. These include: Databricks Disk Cache: This cache accelerates data reads by creating copies of remote Parquet data file...

  • 0 kudos
ccsong
by New Contributor II
  • 1287 Views
  • 3 replies
  • 0 kudos

Resolved! How could we share the Databricks ML runtime cluster among users when enable Unity Catalog

Hi team,Currently, we use the Databricks ML runtime to run our workflows and sometimes do the EDA. What we need is that we want to create a Databricks ML runtime for the team to share. When enabling Unity Catalog, how could we create a shared ML runt...

  • 1287 Views
  • 3 replies
  • 0 kudos
Latest Reply
Walter_C
Databricks Employee
  • 0 kudos

Right now there is not plan to support ML runtime in shared clusters. Engineering is working on additional solutions but no ETA is currently available.In regards why it is not supported, principal reason is due to isolation which is not available in ...

  • 0 kudos
2 More Replies
KevinGagnon
by New Contributor
  • 629 Views
  • 0 replies
  • 0 kudos

Delta live table : run_as

Does Databricks have any plans to decouple the owner from the "run_as" identity in Delta Live Table like it can be done in jobs?The problem arise specially when using DABs. The service principal used to deploy DLTs shouldn't be the owner AND the runn...

  • 629 Views
  • 0 replies
  • 0 kudos
macmiller1
by New Contributor II
  • 3291 Views
  • 0 replies
  • 0 kudos

Pass secret in spark config when value is in form a.b.c={{secrets/scope/secret}}

I am configuring the Cluster for a spark-submit task and I am trying to specify `spark.executor.extraJavaOptions a.b.c={{secrets/scope/secret}}` but the literal {{secrets/scope/secret}} is being passed in rather than the secret value itself.I know th...

  • 3291 Views
  • 0 replies
  • 0 kudos
sandipkumar
by New Contributor II
  • 827 Views
  • 1 replies
  • 0 kudos

Updating python version from 3.8 to 3.12 for s3 ingestion

I went to AWS Cloudformation stack and edited the template from python 3.8 to 3.12 and updated. I did this for both the workspace stack and the s3 ingestion stack. Will it break anything? Do I need to make any changes in the python code in the templa...

  • 827 Views
  • 1 replies
  • 0 kudos
Latest Reply
sandipkumar
New Contributor II
  • 0 kudos

Hi @Retired_mod ,Thanks a lot! I will look at StackSets.As I mentioned the code is not written by me but Databricks. Why does Databricks not use a new python version in its default stacks? We are low on resources and heavily rely on the default Datab...

  • 0 kudos
dbrx_user
by New Contributor III
  • 1897 Views
  • 3 replies
  • 5 kudos

Resolved! Move to 100% Serverless

Hi all,A few questions about the upcoming transition to 100% serverless, if anyone has any info that would be great!When will the move to serverless occur? I understand from 1st July (today) but has anyone seen a roadmap?What will the move to serverl...

  • 1897 Views
  • 3 replies
  • 5 kudos
Latest Reply
m997al
Contributor III
  • 5 kudos

Hi, so our Databricks contact just assured us the following, after we asked about this issue:Databricks is officially (but won’t be GA in every region till end of July) 100% Serverless OPTIONAL.We understand many of our customers have begged for 100%...

  • 5 kudos
2 More Replies
RamlaS
by New Contributor II
  • 938 Views
  • 1 replies
  • 0 kudos

Resolved! Unity Catalog Pandas on Spark Limitation

According to Databricks UC Documentation, below are the some of the limitations on Shared Mode Cluster.1. In Databricks Runtime 13.3 LTS and above, Python scalar UDFs and Pandas UDFs are supported. Other Python UDFs, including UDAFs, UDTFs, and Panda...

  • 938 Views
  • 1 replies
  • 0 kudos
Latest Reply
raphaelblg
Databricks Employee
  • 0 kudos

@RamlaS, UDFs on Unity Catalog is a feature that, at the current moment is still on the Public Preview stage. This means that the development has yet not finished.  UDFs can be used on DBR 13.3 and above, UDAFs are already available for DBR 15.2 on G...

  • 0 kudos

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels