cancel
Showing results for 
Search instead for 
Did you mean: 
Community Platform Discussions
Connect with fellow community members to discuss general topics related to the Databricks platform, industry trends, and best practices. Share experiences, ask questions, and foster collaboration within the community.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

DavidKxx
by Contributor
  • 2538 Views
  • 4 replies
  • 0 kudos

Resolved! Have code stay hidden even when the notebook is copied

When I save a certain Python notebook where I have selected Hide Code and Hide Results on certain cells, those conditions persist.  For example, when I come back the next day in a new session, the hidden material is still hidden.When the notebook is ...

  • 2538 Views
  • 4 replies
  • 0 kudos
Latest Reply
Alok7661
New Contributor II
  • 0 kudos

In my situation we cannot split this notebook as ADF pipeline is already in PROD ,I have tried to use the option %%capture .It helps to ran the notebook within size limits but somehow it is corrupting the output. Also checked in the Databricks AI and...

  • 0 kudos
3 More Replies
canbirlik
by New Contributor
  • 125 Views
  • 1 replies
  • 0 kudos

Databricks Asset Bundles - Failed to install provider

Hello All,When I try to deploy my bundle, I get the following error.I can't edit the bundle.tf.json, I suppose it is created automatically. Does anyone have a solution for the same problem?Many Thanks,Can$ databricks bundle deploy -t devBuilding my_p...

  • 125 Views
  • 1 replies
  • 0 kudos
Latest Reply
chris0991
New Contributor II
  • 0 kudos

Hi Can, you're encountering frustrating issues with your bundle deployment. Have you checked your internet connection or tried again later? Sometimes, a simple retry can help resolve authentication errors. Also, if you're looking for useful tools to ...

  • 0 kudos
Databricks_info
by New Contributor II
  • 4527 Views
  • 5 replies
  • 0 kudos

Concurrent Update to Delta - Throws error

Team,I get a ConcurrentAppendException: Files were added to the root of the table by a concurrent update when trying to update a table which executes via jobs with for each activity in ADF,I tried with Databricks run time 14.x and set the delete vect...

  • 4527 Views
  • 5 replies
  • 0 kudos
Latest Reply
rpiotr
New Contributor II
  • 0 kudos

In case of such an issue, I would like to suggest apply retry and try except logic (you can use one of existing libraries) in both concurrent updates - it should help, and jobs won't report any error.

  • 0 kudos
4 More Replies
PushkarDeole
by New Contributor III
  • 161 Views
  • 3 replies
  • 0 kudos

Unable to set shuffle partitions on DLT pipeline

Hello,We are using a 5 worker node DLT job compute for a continuous mode streaming pipeline. The worker configuration is Standard_D4ads_v5 i.e. 4 cores so total cores across 5 workers is 20 cores.We have wide transformation at some places in the pipe...

  • 161 Views
  • 3 replies
  • 0 kudos
Latest Reply
szymon_dybczak
Contributor III
  • 0 kudos

Hi @PushkarDeole ,Each Delta Live Tables pipeline has two associated clusters:The updates cluster processes pipeline updates.The maintenance cluster runs daily maintenance tasks.According to docs, if you want to configure settings at the pipeline lev...

  • 0 kudos
2 More Replies
ramesh7598
by New Contributor II
  • 337 Views
  • 1 replies
  • 0 kudos

Clarification on Acceptable ID Proof for Databricks Associate Exam

Hi,I am planning to take the Databricks Associate exam in the upcoming week. My current ID proof is my original driving license issued by the Tamil Nadu government; however, it is laminated rather than a hard plastic card.Could you please confirm if ...

  • 337 Views
  • 1 replies
  • 0 kudos
Latest Reply
ramesh7598
New Contributor II
  • 0 kudos

Hi Databricks community and @Cert-Team,Can you please help me here? The reason I am asking is Microsoft certification doesn’t approve laminated id proof’s.

  • 0 kudos
lakshmikanth-01
by New Contributor II
  • 192 Views
  • 1 replies
  • 0 kudos

Unable to access Community Edition

I'm trying to create community edition account but everytime i try it shows" An error has occurred. Please try again later."I have attached the screenshot.I also tried accesiing Databricks in a differenct pc based on the answers from previous threads...

  • 192 Views
  • 1 replies
  • 0 kudos
Latest Reply
gchandra
Valued Contributor II
  • 0 kudos

By any chance is that email address already registered with another Databricks Tier (like company or paid)?  Can you try in incognito mode with all ad blockers/popup blockers disabled

  • 0 kudos
AcrobaticMonkey
by New Contributor II
  • 190 Views
  • 1 replies
  • 1 kudos

Instance profiles are not working in Shared access mode

I’m trying to fetch billing data from an AWS account using boto3 to assume a role that has access to this information. This operation works fine in No Isolation and Single User access modes, but it fails in Shared access mode. Since I need to store t...

  • 190 Views
  • 1 replies
  • 1 kudos
Latest Reply
gchandra
Valued Contributor II
  • 1 kudos

Instance Profiles are open for all, they won't work in Shared access mode. Using Storage credentials, create  External Locations. Cleaner way to Govern the access.

  • 1 kudos
RahulChaubey
by New Contributor III
  • 2146 Views
  • 4 replies
  • 1 kudos

Resolved! Can we get notebook owner using notebook path as parameter in api ?

I need to get the notebook owner using api or some other way by passing notebook path as parameter.

  • 2146 Views
  • 4 replies
  • 1 kudos
Latest Reply
nkumar18
New Contributor II
  • 1 kudos

Hi @RahulChaubey Did you find any solution to your problem?

  • 1 kudos
3 More Replies
Kinger
by New Contributor
  • 1828 Views
  • 2 replies
  • 0 kudos

Associating a Git Credential with a Service Principal using Terraform Provider (AWS)

I am attempting to create a Databrick Repo in a workspace via Terraform. I would like the Repo and the associated Git Credential to be associated with a Service Principal. In my initial run, the Terraform provider is associated with the user defined ...

  • 1828 Views
  • 2 replies
  • 0 kudos
Latest Reply
nicole_lu_PM
Contributor III
  • 0 kudos

Hi Kinger and Debi-Moha, Do the steps in the "Use a service principal with Databricks Git folders" documentation work for you?  Specifically for Terraform: https://docs.databricks.com/en/repos/ci-cd-techniques-with-repos.html#terraform-integration Th...

  • 0 kudos
1 More Replies
dbx_687_3__1b3Q
by New Contributor III
  • 1301 Views
  • 3 replies
  • 1 kudos

Impersonating a user

How do I impersonate a user? I can't find any documentation that explains how to do this or even hint that it's possible.Use case: I perform administrative tasks like assign grants and roles to catalogs, schemas, and tables for the benefit of busines...

  • 1301 Views
  • 3 replies
  • 1 kudos
Latest Reply
TMD
New Contributor III
  • 1 kudos

Hello, This is an important feature. Here's an idea submitted just now - https://ideas.databricks.com/ideas/DBE-I-1511The idea has the link to this discussion.    

  • 1 kudos
2 More Replies
datastones
by Contributor
  • 524 Views
  • 6 replies
  • 3 kudos

Resolved! Data loss after writing a transformed pyspark dataframe to delta table in unity catalog

Hey guys, after some successful data preprocessing without any errors, i have a final dataframe shape with the shape of ~ (200M, 150). the cluster i am using has sufficient ram + cpus + autoscaling, all metrics look fine after the job was done.The pr...

Community Platform Discussions
Data Processing
help
  • 524 Views
  • 6 replies
  • 3 kudos
Latest Reply
datastones
Contributor
  • 3 kudos

@szymon_dybczak i could resolve it now! basically, i broke the process down into further subprocesses, for each sub process, i cached and wrote them all into delta table (without overwritting), the next subprocess needs to read data in the delta tabl...

  • 3 kudos
5 More Replies
DavidKxx
by Contributor
  • 320 Views
  • 3 replies
  • 3 kudos

Display data as multi-line in dashboard table

I am displaying a table in a notebook dashboard.  One column of the data is conceptually a list of strings.  I can originate or convert the list as whatever format would be useful (as a string representing a JSON array, as an ARRAY struct, etc.). I w...

solution1.png solution2.png
  • 320 Views
  • 3 replies
  • 3 kudos
Latest Reply
filipniziol
New Contributor III
  • 3 kudos

Hi @DavidKxx ,What you can do is convert your array to into an HTML formatted string with bullet points.Here is the code: # Sample data with an array column data = [ (1, ['Apple', 'Banana', 'Cherry']), (2, ['Dug', 'Elephant']), (3, ['Fish...

  • 3 kudos
2 More Replies
DavidKxx
by Contributor
  • 214 Views
  • 1 replies
  • 0 kudos

Word wrap in dashboards

When I'm displaying a Table-style visualization in a notebook dashboard, is there a setting I can apply to a text column so that it automatically word-wraps text longer than the display width of the column?For example, in the following dashboard disp...

solution3.png
  • 214 Views
  • 1 replies
  • 0 kudos
Latest Reply
filipniziol
New Contributor III
  • 0 kudos

Hi @DavidKxx ,That is quite similar question to one about displaying array as bullet list. Since you were successful in implementing displayHTML, what do you think about doing similar in this case? # Sample DataFrame with long text data = [ (1, '...

  • 0 kudos

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Top Kudoed Authors