cancel
Showing results for 
Search instead for 
Did you mean: 
Administration & Architecture
Explore discussions on Databricks administration, deployment strategies, and architectural best practices. Connect with administrators and architects to optimize your Databricks environment for performance, scalability, and security.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Lakehouse Architecture


Forum Posts

erigaud
by Honored Contributor
  • 15596 Views
  • 9 replies
  • 9 kudos

Resolved! Installing libraries on job clusters

Simple question : what is the way to go to install libraries on job clusters ? There does not seem to be a "Libraries" tab on the UI as opposed to regular clusters. Does it mean that the only option is to use init scripts ? 

  • 15596 Views
  • 9 replies
  • 9 kudos
Latest Reply
imsabarinath
New Contributor III
  • 9 kudos

You may want to copy required libs to a volume and load it during cluster setup to avoid downloading the libs for every run.

  • 9 kudos
8 More Replies
alm
by New Contributor III
  • 3742 Views
  • 4 replies
  • 0 kudos

Show all privileges granted to principal

Given the name of a principal in Databricks (I'm using account-level groups) is there an easy way to query or in other way obtain all privileges granted to this principal?I know I can obtain the information by querying in several of the system.inform...

  • 3742 Views
  • 4 replies
  • 0 kudos
Latest Reply
sakthi_sujitha
Databricks Employee
  • 0 kudos

This link will provide details on how to verify all the privileges granted to Service Principals 

  • 0 kudos
3 More Replies
sk_databricks
by New Contributor
  • 608 Views
  • 1 replies
  • 0 kudos

Databricks Cache Options

Hi,We are working on Databricks solution hosted on AWS. We are exploring the caching options in Databricks. Apart from the Databricks cache and spark cache? What are the options? Is it feasible to use 3rd party Cache solutions like AWS Elastic Cache ...

  • 608 Views
  • 1 replies
  • 0 kudos
Latest Reply
Walter_C
Databricks Employee
  • 0 kudos

Databricks provides several caching options to enhance performance by minimizing Input and Output (I/O) read and write operations. These include: Databricks Disk Cache: This cache accelerates data reads by creating copies of remote Parquet data file...

  • 0 kudos
ccsong
by New Contributor II
  • 1230 Views
  • 3 replies
  • 0 kudos

Resolved! How could we share the Databricks ML runtime cluster among users when enable Unity Catalog

Hi team,Currently, we use the Databricks ML runtime to run our workflows and sometimes do the EDA. What we need is that we want to create a Databricks ML runtime for the team to share. When enabling Unity Catalog, how could we create a shared ML runt...

  • 1230 Views
  • 3 replies
  • 0 kudos
Latest Reply
Walter_C
Databricks Employee
  • 0 kudos

Right now there is not plan to support ML runtime in shared clusters. Engineering is working on additional solutions but no ETA is currently available.In regards why it is not supported, principal reason is due to isolation which is not available in ...

  • 0 kudos
2 More Replies
KevinGagnon
by New Contributor
  • 623 Views
  • 0 replies
  • 0 kudos

Delta live table : run_as

Does Databricks have any plans to decouple the owner from the "run_as" identity in Delta Live Table like it can be done in jobs?The problem arise specially when using DABs. The service principal used to deploy DLTs shouldn't be the owner AND the runn...

  • 623 Views
  • 0 replies
  • 0 kudos
macmiller1
by New Contributor II
  • 3276 Views
  • 0 replies
  • 0 kudos

Pass secret in spark config when value is in form a.b.c={{secrets/scope/secret}}

I am configuring the Cluster for a spark-submit task and I am trying to specify `spark.executor.extraJavaOptions a.b.c={{secrets/scope/secret}}` but the literal {{secrets/scope/secret}} is being passed in rather than the secret value itself.I know th...

  • 3276 Views
  • 0 replies
  • 0 kudos
sandipkumar
by New Contributor II
  • 812 Views
  • 1 replies
  • 0 kudos

Updating python version from 3.8 to 3.12 for s3 ingestion

I went to AWS Cloudformation stack and edited the template from python 3.8 to 3.12 and updated. I did this for both the workspace stack and the s3 ingestion stack. Will it break anything? Do I need to make any changes in the python code in the templa...

  • 812 Views
  • 1 replies
  • 0 kudos
Latest Reply
sandipkumar
New Contributor II
  • 0 kudos

Hi @Retired_mod ,Thanks a lot! I will look at StackSets.As I mentioned the code is not written by me but Databricks. Why does Databricks not use a new python version in its default stacks? We are low on resources and heavily rely on the default Datab...

  • 0 kudos
dbrx_user
by New Contributor III
  • 1861 Views
  • 3 replies
  • 5 kudos

Resolved! Move to 100% Serverless

Hi all,A few questions about the upcoming transition to 100% serverless, if anyone has any info that would be great!When will the move to serverless occur? I understand from 1st July (today) but has anyone seen a roadmap?What will the move to serverl...

  • 1861 Views
  • 3 replies
  • 5 kudos
Latest Reply
m997al
Contributor III
  • 5 kudos

Hi, so our Databricks contact just assured us the following, after we asked about this issue:Databricks is officially (but won’t be GA in every region till end of July) 100% Serverless OPTIONAL.We understand many of our customers have begged for 100%...

  • 5 kudos
2 More Replies
RamlaS
by New Contributor II
  • 915 Views
  • 1 replies
  • 0 kudos

Resolved! Unity Catalog Pandas on Spark Limitation

According to Databricks UC Documentation, below are the some of the limitations on Shared Mode Cluster.1. In Databricks Runtime 13.3 LTS and above, Python scalar UDFs and Pandas UDFs are supported. Other Python UDFs, including UDAFs, UDTFs, and Panda...

  • 915 Views
  • 1 replies
  • 0 kudos
Latest Reply
raphaelblg
Databricks Employee
  • 0 kudos

@RamlaS, UDFs on Unity Catalog is a feature that, at the current moment is still on the Public Preview stage. This means that the development has yet not finished.  UDFs can be used on DBR 13.3 and above, UDAFs are already available for DBR 15.2 on G...

  • 0 kudos
billraper
by New Contributor II
  • 671 Views
  • 2 replies
  • 0 kudos

New Users don't receive onboarding email

When I create a new user in Databricks, the new user does not receive their onboarding email.  It is not in their junkmail, deleted items or in their inbox.However, when I reset that user's password, they do receive the password reset link, and are a...

  • 671 Views
  • 2 replies
  • 0 kudos
Latest Reply
Rishabh_Tiwari
Databricks Employee
  • 0 kudos

Hi @billraper , I'm sorry to hear about the trouble. Would you mind sharing more about whether this is happening with a community portal profile or with a product profile? Please share the link to the profile with which you're experiencing this issue...

  • 0 kudos
1 More Replies
JoyceZhang
by New Contributor II
  • 537 Views
  • 1 replies
  • 0 kudos

cannot login to account management

HiI am not able to login to account  management (https://accounts.cloud.databricks.com), It somehow reninforce SSO, cannot login with username and password.  

  • 537 Views
  • 1 replies
  • 0 kudos
Latest Reply
" src="" />
This widget could not be displayed.
This widget could not be displayed.
This widget could not be displayed.
  • 0 kudos

This widget could not be displayed.
HiI am not able to login to account  management (https://accounts.cloud.databricks.com), It somehow reninforce SSO, cannot login with username and password.  

This widget could not be displayed.
  • 0 kudos
This widget could not be displayed.
camilo_s
by Contributor
  • 1215 Views
  • 1 replies
  • 1 kudos

Coarse-grained access management for jobs

Are there any perspectives in Databricks' roadmap for enabling coarse-grained access management for jobs?Currently, access to jobs has to be managed on a job by job basis: https://docs.databricks.com/en/security/auth-authz/access-control/index.html#j...

  • 1215 Views
  • 1 replies
  • 1 kudos
Latest Reply
camilo_s
Contributor
  • 1 kudos

Hi @Retired_mod, thanks for your reply.A more mature access management concept in Databricks would be definitely terrific. I understand it's not entirely along the AI-lines that Databricks is pushing hard currently, but it would improve a lot the pla...

  • 1 kudos
Carsten03
by New Contributor III
  • 15248 Views
  • 8 replies
  • 3 kudos

Resolved! Run workflow using git integration with service principal

Hi,I want to run a dbt workflow task and would like to use the git integration for that. Using my personal user I am able to do so but I am running my workflows using a service principal.I added git credentials and the repository using terraform. I a...

  • 15248 Views
  • 8 replies
  • 3 kudos
Latest Reply
camilo_s
Contributor
  • 3 kudos

I created that link using the "Share" button in the post but it's broken, sorry Here's a working link to the discussion: https://community.databricks.com/t5/data-engineering/git-credentials-for-service-principals-running-jobs/td-p/73802

  • 3 kudos
7 More Replies
Avvar2022
by Contributor
  • 3555 Views
  • 3 replies
  • 3 kudos

Is there a setting which restricts users from Creating Job and Pipeline?

as far i know currently ((as of 03-25-2024) databricks don't any workspace admin settings option to restrict users from creating a workflow/job or delta pipelines. Here is the use case for it Example: you have 3 tier landscape Dev, Qa and Prod.It is ...

Administration & Architecture
administration
jobs
pipelines
  • 3555 Views
  • 3 replies
  • 3 kudos
Latest Reply
camilo_s
Contributor
  • 3 kudos

 I notice that a separate discussion overlaps with the OPs issue: https://community.databricks.com/t5/data-engineering/restricting-workflow-creation-and-implementing-approval/td-p/4336@Retired_mod do you have a mechanism for clustering discussions fo...

  • 3 kudos
2 More Replies
RamlaS
by New Contributor II
  • 522 Views
  • 0 replies
  • 0 kudos

Can you all share your experiences on rolling out new features in Workspaces managed by you as Admin

As Data Platform Admins, do you follow some standard process or self service approach towards rolling out new features in your workspaces? Is the process automated? How is the testing done? Please share your thoughts. Given UC is introducing new feat...

  • 522 Views
  • 0 replies
  • 0 kudos

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels