cancel
Showing results for 
Search instead for 
Did you mean: 
Administration & Architecture
Explore discussions on Databricks administration, deployment strategies, and architectural best practices. Connect with administrators and architects to optimize your Databricks environment for performance, scalability, and security.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Lakehouse Architecture


Forum Posts

sk_databricks
by New Contributor
  • 426 Views
  • 1 replies
  • 0 kudos

Databricks Cache Options

Hi,We are working on Databricks solution hosted on AWS. We are exploring the caching options in Databricks. Apart from the Databricks cache and spark cache? What are the options? Is it feasible to use 3rd party Cache solutions like AWS Elastic Cache ...

  • 426 Views
  • 1 replies
  • 0 kudos
Latest Reply
Walter_C
Honored Contributor
  • 0 kudos

Databricks provides several caching options to enhance performance by minimizing Input and Output (I/O) read and write operations. These include: Databricks Disk Cache: This cache accelerates data reads by creating copies of remote Parquet data file...

  • 0 kudos
ccsong
by New Contributor II
  • 872 Views
  • 3 replies
  • 0 kudos

How could we share the Databricks ML runtime cluster among users when enable Unity Catalog

Hi team,Currently, we use the Databricks ML runtime to run our workflows and sometimes do the EDA. What we need is that we want to create a Databricks ML runtime for the team to share. When enabling Unity Catalog, how could we create a shared ML runt...

  • 872 Views
  • 3 replies
  • 0 kudos
Latest Reply
Walter_C
Honored Contributor
  • 0 kudos

Right now there is not plan to support ML runtime in shared clusters. Engineering is working on additional solutions but no ETA is currently available.In regards why it is not supported, principal reason is due to isolation which is not available in ...

  • 0 kudos
2 More Replies
sandipkumar
by New Contributor II
  • 684 Views
  • 2 replies
  • 0 kudos

Updating python version from 3.8 to 3.12 for s3 ingestion

I went to AWS Cloudformation stack and edited the template from python 3.8 to 3.12 and updated. I did this for both the workspace stack and the s3 ingestion stack. Will it break anything? Do I need to make any changes in the python code in the templa...

  • 684 Views
  • 2 replies
  • 0 kudos
Latest Reply
sandipkumar
New Contributor II
  • 0 kudos

Hi @Kaniz_Fatma ,Thanks a lot! I will look at StackSets.As I mentioned the code is not written by me but Databricks. Why does Databricks not use a new python version in its default stacks? We are low on resources and heavily rely on the default Datab...

  • 0 kudos
1 More Replies
dbrx_user
by New Contributor III
  • 1325 Views
  • 4 replies
  • 6 kudos

Resolved! Move to 100% Serverless

Hi all,A few questions about the upcoming transition to 100% serverless, if anyone has any info that would be great!When will the move to serverless occur? I understand from 1st July (today) but has anyone seen a roadmap?What will the move to serverl...

  • 1325 Views
  • 4 replies
  • 6 kudos
Latest Reply
m997al
Contributor III
  • 6 kudos

Hi, so our Databricks contact just assured us the following, after we asked about this issue:Databricks is officially (but won’t be GA in every region till end of July) 100% Serverless OPTIONAL.We understand many of our customers have begged for 100%...

  • 6 kudos
3 More Replies
Sadam97
by New Contributor
  • 567 Views
  • 1 replies
  • 0 kudos

Databricks (GCP) Cluster not resolving Hostname into IP address

we have #mongodb hosts that must be resolved to private internal loadbalancer ips ( of another cluster ), and that we are unable to add host aliases in the Databricks GKE cluster in order for the spark to be able to connect to a mongodb and resolve t...

  • 567 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @Sadam97,  First, verify if your DNS server is responding. You can do this by running a ping command from a notebook in Databricks to reach your secondary DNS server.Launch a Web Terminal from the cluster workspace.Edit the /etc/resolv.conf fil...

  • 0 kudos
RamlaS
by New Contributor II
  • 772 Views
  • 1 replies
  • 0 kudos

Resolved! Unity Catalog Pandas on Spark Limitation

According to Databricks UC Documentation, below are the some of the limitations on Shared Mode Cluster.1. In Databricks Runtime 13.3 LTS and above, Python scalar UDFs and Pandas UDFs are supported. Other Python UDFs, including UDAFs, UDTFs, and Panda...

  • 772 Views
  • 1 replies
  • 0 kudos
Latest Reply
raphaelblg
Honored Contributor II
  • 0 kudos

@RamlaS, UDFs on Unity Catalog is a feature that, at the current moment is still on the Public Preview stage. This means that the development has yet not finished.  UDFs can be used on DBR 13.3 and above, UDAFs are already available for DBR 15.2 on G...

  • 0 kudos
billraper
by New Contributor II
  • 590 Views
  • 2 replies
  • 0 kudos

New Users don't receive onboarding email

When I create a new user in Databricks, the new user does not receive their onboarding email.  It is not in their junkmail, deleted items or in their inbox.However, when I reset that user's password, they do receive the password reset link, and are a...

  • 590 Views
  • 2 replies
  • 0 kudos
Latest Reply
Rishabh_Tiwari
Community Manager
  • 0 kudos

Hi @billraper , I'm sorry to hear about the trouble. Would you mind sharing more about whether this is happening with a community portal profile or with a product profile? Please share the link to the profile with which you're experiencing this issue...

  • 0 kudos
1 More Replies
JoyceZhang
by New Contributor II
  • 487 Views
  • 2 replies
  • 0 kudos

cannot login to account management

HiI am not able to login to account  management (https://accounts.cloud.databricks.com), It somehow reninforce SSO, cannot login with username and password.  

  • 487 Views
  • 2 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @JoyceZhang, Thank you for contacting Databricks Community Discussion Forum.   Please note that for any issues related to the Databricks Community Edition product, you can find helpful resources here. If you encounter any difficulties beyond what'...

  • 0 kudos
1 More Replies
RamlaS
by New Contributor II
  • 410 Views
  • 1 replies
  • 0 kudos

How are teams platform admin teams managing "allow list" for libraries feature UC?

We have so many team using maven libraries and are in the process of UC migration. These maven coordinates need to be added to "allow list" before they can be used in clusters. What is the standard process followed by admin teams for this feature? Do...

  • 410 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @RamlaS,  Managing the “allow list” for libraries in Databricks involves ensuring that specific Maven coordinates are approved for use in clusters. Manual Approval Process: Admin teams manually review and approve Maven coordinates.Users submit...

  • 0 kudos
RamlaS
by New Contributor II
  • 479 Views
  • 1 replies
  • 0 kudos

Can you all share your experiences on rolling out new features in Workspaces managed by you as Admin

As Data Platform Admins, do you follow some standard process or self service approach towards rolling out new features in your workspaces? Is the process automated? How is the testing done? Please share your thoughts. Given UC is introducing new feat...

  • 479 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @RamlaS, Rolling out new features in Databricks Workspaces as an admin involves several considerations. While practices can vary based on your organization’s specific needs, here are some general guidelines and best practices: Communication ...

  • 0 kudos
camilo_s
by Contributor
  • 1148 Views
  • 2 replies
  • 1 kudos

Coarse-grained access management for jobs

Are there any perspectives in Databricks' roadmap for enabling coarse-grained access management for jobs?Currently, access to jobs has to be managed on a job by job basis: https://docs.databricks.com/en/security/auth-authz/access-control/index.html#j...

  • 1148 Views
  • 2 replies
  • 1 kudos
Latest Reply
camilo_s
Contributor
  • 1 kudos

Hi @Kaniz_Fatma, thanks for your reply.A more mature access management concept in Databricks would be definitely terrific. I understand it's not entirely along the AI-lines that Databricks is pushing hard currently, but it would improve a lot the pla...

  • 1 kudos
1 More Replies
Carsten03
by New Contributor III
  • 13353 Views
  • 8 replies
  • 3 kudos

Resolved! Run workflow using git integration with service principal

Hi,I want to run a dbt workflow task and would like to use the git integration for that. Using my personal user I am able to do so but I am running my workflows using a service principal.I added git credentials and the repository using terraform. I a...

  • 13353 Views
  • 8 replies
  • 3 kudos
Latest Reply
camilo_s
Contributor
  • 3 kudos

I created that link using the "Share" button in the post but it's broken, sorry Here's a working link to the discussion: https://community.databricks.com/t5/data-engineering/git-credentials-for-service-principals-running-jobs/td-p/73802

  • 3 kudos
7 More Replies
Avvar2022
by Contributor
  • 3231 Views
  • 4 replies
  • 3 kudos

Is there a setting which restricts users from Creating Job and Pipeline?

as far i know currently ((as of 03-25-2024) databricks don't any workspace admin settings option to restrict users from creating a workflow/job or delta pipelines. Here is the use case for it Example: you have 3 tier landscape Dev, Qa and Prod.It is ...

Administration & Architecture
administration
jobs
pipelines
  • 3231 Views
  • 4 replies
  • 3 kudos
Latest Reply
camilo_s
Contributor
  • 3 kudos

 I notice that a separate discussion overlaps with the OPs issue: https://community.databricks.com/t5/data-engineering/restricting-workflow-creation-and-implementing-approval/td-p/4336@Kaniz_Fatma do you have a mechanism for clustering discussions fo...

  • 3 kudos
3 More Replies
RozaZaharieva
by New Contributor
  • 3312 Views
  • 3 replies
  • 3 kudos

Get Azure Databricks Account ID

Hi everyone,Is it possible with Terraform or Azure CLI or any other not manual method to get the value for Azure Databricks Account ID and not to use manual method as is described here - https://learn.microsoft.com/en-us/azure/databricks/administrati...

Administration & Architecture
azuredatabricks
iac
Terraform
  • 3312 Views
  • 3 replies
  • 3 kudos
Latest Reply
SHeisterkamp
New Contributor II
  • 3 kudos

I am also looking for a solution to this. The path suggested by @Kaniz_Fatma does not work! When I run the proposed az cli, this is what I get:```$> az databricks workspace show --resource-group $my_rg --name $my_ws --query 'id'"/subscriptions/64e..<...

  • 3 kudos
2 More Replies
adb-rm
by New Contributor II
  • 1213 Views
  • 4 replies
  • 0 kudos

Unity catalog and metadata to backup site

Hi ,I am looking for Databricks DR(Disaster recovery)  site creation. My  primary setup is in west us and my primary data bricks created in primary site with unity catalog enable. with same meta data i want bring up the Databricks in my secondary sid...

  • 1213 Views
  • 4 replies
  • 0 kudos
Latest Reply
Walter_C
Honored Contributor
  • 0 kudos

As of now UC does not have DR setting, this is currently in our prioritized roadmap but no ETA is currently available. Some customers host an external metastore and replicate it across regions to achieve this.

  • 0 kudos
3 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels