cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

GeKo
by Contributor
  • 25292 Views
  • 5 replies
  • 1 kudos

Insufficient privileges:User does not have permission SELECT on any file

Hello,after switching to "shared cluster" usage a python job is failing with error message:  Py4JJavaError: An error occurred while calling o877.load. : org.apache.spark.SparkSecurityException: [INSUFFICIENT_PERMISSIONS] Insufficient privileges: User...

Get Started Discussions
permissions
privileges
python
  • 25292 Views
  • 5 replies
  • 1 kudos
Latest Reply
Uj337
New Contributor III
  • 1 kudos

Hi @GeKo The checkpoint directory, is that set on cluster level or how do we set that ? Can you please help me with this ?

  • 1 kudos
4 More Replies
RobsonNLPT
by Contributor III
  • 1652 Views
  • 1 replies
  • 0 kudos

Databricks UC Data Lineage Official Limitations

Hi all.I have a huge data migration project using medallion architecture,  UC, notebooks and workflows . One of the relevant requirements we have is to capture all data dependencies (upstreams and downstreams) using data lineage. I've followed all re...

  • 1652 Views
  • 1 replies
  • 0 kudos
Latest Reply
MathieuDB
Databricks Employee
  • 0 kudos

Hello @RobsonNLPT , Yes SQL CTE are supported by the data lineage service. You can track table that were created using CTEs. Here is an example that demonstrate the feature. CREATE TABLE IF NOT EXISTS mpelletier.dbdemos.menu ( recipe_id INT, ...

  • 0 kudos
OlehSemeniuk
by New Contributor II
  • 3368 Views
  • 3 replies
  • 1 kudos

Resolved! Ingesting and Transforming NetCDF Data in Delta Table on Databricks Cluster

Hi,I need to ingest and transform historical climate data into a Delta table. The data is stored in .nc format (NetCDF). To work with this format, specific C libraries for Python are required, along with particular versions of Python libraries (e.g.,...

  • 3368 Views
  • 3 replies
  • 1 kudos
Latest Reply
Walter_C
Databricks Employee
  • 1 kudos

Great, please let us know in case any assistance is needed

  • 1 kudos
2 More Replies
Brianhourigan
by New Contributor II
  • 2370 Views
  • 5 replies
  • 0 kudos

Service Principal Access to Users Directory in Databricks - Creating Git Folders

I am trying to automate the creation of git folders in user workspace directories triggered by GitHub feature branch creation. When developers create feature branches in GitHub, we want a service principal to automatically create corresponding git fo...

  • 2370 Views
  • 5 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hi @Brianhourigan, Can you please DIM your suggestions? I can add it to our internal AHA idea.

  • 0 kudos
4 More Replies
iptkrisna
by New Contributor III
  • 3805 Views
  • 5 replies
  • 0 kudos

Restore deleted databricks jobs and job runs

Hi All,Is there a way to restore deleted databricks jobs?Thank you.

Get Started Discussions
Databricks
job-runs
Workflows
  • 3805 Views
  • 5 replies
  • 0 kudos
Latest Reply
hari-prasad
Valued Contributor II
  • 0 kudos

Hi @iptkrisna ,Currently, there is no option to recover deleted items. In architectures, it not necessary to control or manage the final code available in the system. Instead, the focus should be controlling and managing how code and jobs are deploye...

  • 0 kudos
4 More Replies
shwetamagar
by New Contributor II
  • 3068 Views
  • 1 replies
  • 1 kudos

Resolved! Unity Catalog : RDD Issue

In our existing notebooks, the scripts are reliant on RDDs. However, with the upgrade to Unity Catalog, RDDs will no longer be supported. We need to explore alternative approaches or tools to replace the use of RDDs. Could you suggest the best practi...

  • 3068 Views
  • 1 replies
  • 1 kudos
Latest Reply
Walter_C
Databricks Employee
  • 1 kudos

To transition from using RDDs (Resilient Distributed Datasets) to alternative approaches supported by Unity Catalog, you can follow these best practices and migration strategies: Use DataFrame API: The DataFrame API is the recommended alternative to...

  • 1 kudos
mrstevegross
by Contributor III
  • 3301 Views
  • 7 replies
  • 0 kudos

Resolved! Tutorial docs for running a job using serverless?

I'm exploring whether serverless (https://docs.databricks.com/en/jobs/run-serverless-jobs.html#create-a-job-using-serverless-compute) could be useful for our use case. I'd like to see an example of using serverless via the API. The docs say "To learn...

  • 3301 Views
  • 7 replies
  • 0 kudos
Latest Reply
mrstevegross
Contributor III
  • 0 kudos

Thanks!

  • 0 kudos
6 More Replies
aonurdemir
by Contributor
  • 1172 Views
  • 1 replies
  • 1 kudos

Resolved! Is there a cluster option for dashboards?

Hi everyone,I do not want to use 4 DBU/h XS warehouse since I have very tiny data on the new startup. I want to create a minimal cluster and run it as the underlying SQL engine for my dashboard.Thanks.

  • 1172 Views
  • 1 replies
  • 1 kudos
Latest Reply
Walter_C
Databricks Employee
  • 1 kudos

Unfortunately no, as dashboards are part of the SQL service on the platform they are designed to work with SQL warehouses only, you can create Notebook dashboards that will be able to work with regular clusters but functionalities will be limited in ...

  • 1 kudos
h2p5cq8
by New Contributor III
  • 2201 Views
  • 5 replies
  • 1 kudos

Resolved! Databricks workflow with sequenced tasks

I have a continuous workflow. It is continuous because I would like it to run every minute and if it has stuff to do the first task will take several minutes. As I understand, continuous workflows won't requeue while a job is currently running, where...

  • 2201 Views
  • 5 replies
  • 1 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 1 kudos

Hi @h2p5cq8, No problem! and you can have the queue option disabled to stop it. Go to the Advanced settings in the Job details side panel and toggle off the Queue option to prevent jobs from being queued

  • 1 kudos
4 More Replies
vicky403
by New Contributor
  • 1312 Views
  • 1 replies
  • 0 kudos

How Development Target works for multiple users?

Hi, I'm using the Databricks asset bundle to deploy my job to Azure Databricks.I want to configure the Databricks bundle so that when anyone runs the Azure pipeline, a job is created under their name in the format dev_username_job.Using a personal ac...

  • 1312 Views
  • 1 replies
  • 0 kudos
Latest Reply
zuzsad
New Contributor II
  • 0 kudos

Were you able to solve this?

  • 0 kudos
ahsan_aj
by Contributor II
  • 7611 Views
  • 5 replies
  • 0 kudos

Azure Databricks Enterprise Application User Impersonation Token Group Claims Issue

Hi all, I am using the Azure Databricks Microsoft Managed Enterprise Application scope (2ff814a6-3304-4ab8-85cb-cd0e6f879c1d/user_impersonation) to fetch an access token on behalf of a user. The authentication process is successful; however, the acce...

  • 7611 Views
  • 5 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hi @ahsan_aj, You can modify your token request by adding a claims parameter     const claimsRequest = {         "access_token": {             "groups": null         } https://learn.microsoft.com/en-us/security/zero-trust/develop/configure-tokens-gro...

  • 0 kudos
4 More Replies
cheerwthraj
by New Contributor
  • 2334 Views
  • 1 replies
  • 0 kudos

Best practices for tableau to connect to Databricks

Having problem in connecting to Databrikcs with service principal from tableau . Wanted to how how tableau extracts refreshing connecting to databricks , is it via individual Oauth or service principal

  • 2334 Views
  • 1 replies
  • 0 kudos
Latest Reply
saikumar246
Databricks Employee
  • 0 kudos

Hi @cheerwthraj,  To connect Tableau to Databricks and refresh extracts, you can use either OAuth or service principal authentication. For best practices, please refer to the below link, https://docs.databricks.com/en/partners/bi/tableau.html#best-pr...

  • 0 kudos
AbhishekNegi
by New Contributor
  • 1926 Views
  • 1 replies
  • 1 kudos

New Cluster 90% memory already consumed

Hi, seeing this on all new clusters (single or multi-node) I am creating. As soon as the metrics start showing up, the memory consumption shows 90% already consumed between Used and Cached (something like below). This is the case with higher or lower...

AbhishekNegi_0-1725911074420.png AbhishekNegi_1-1725911119189.png
  • 1926 Views
  • 1 replies
  • 1 kudos
Latest Reply
saikumar246
Databricks Employee
  • 1 kudos

Hi @AbhishekNegi I understand your concern. The reason for you to see memory consumption before initiating any task and regarding the comment taking time to execute. This is how Spark internally works. The memory consumption observed in a Spark clust...

  • 1 kudos
RobsonNLPT
by Contributor III
  • 8908 Views
  • 15 replies
  • 3 kudos

Delta Live Tables Permissions

Hi allI'm the owner of delta live tables pipelines but I don't see the option described on documentation to grant permissions for different users. The options available are "settings" and "delete"In the sidebar, click Delta Live Tables.Select the nam...

  • 8908 Views
  • 15 replies
  • 3 kudos
Latest Reply
Walter_C
Databricks Employee
  • 3 kudos

Ok might be that the version of the workspaces could be different and the new patch will be implemented soon.

  • 3 kudos
14 More Replies
Nandhini_Kumar
by New Contributor III
  • 3955 Views
  • 1 replies
  • 0 kudos

How the Scale up process done in the databricks cluster?

For my AWS databricks cluster, i configured shared computer with 1min worker node and 3 max worker node, initailly only one worker node and driver node instance is created in the AWS console. Is there any rule set by databricks for scale up the next ...

  • 3955 Views
  • 1 replies
  • 0 kudos
Latest Reply
NandiniN
Databricks Employee
  • 0 kudos

Databricks uses autoscaling to manage the number of worker nodes in a cluster based on the workload. When you configure a cluster with a minimum and maximum number of worker nodes, Databricks automatically adjusts the number of workers within this ra...

  • 0 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels