cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

210573
by New Contributor
  • 6374 Views
  • 4 replies
  • 1 kudos

Internal error. Attach your notebook to a different cluster or restart the current cluster.

Started getting this error while running all the scripts. All the scripts were running fine before. I tried de-attaching and also restart nothing seems to work.Internal error. Attach your notebook to a different cluster or restart the current cluste...

  • 6374 Views
  • 4 replies
  • 1 kudos
Latest Reply
tieu_quyen
New Contributor II
  • 1 kudos

Hi @210573 (Customer)​ ,I got the same error, tried to restart and create a new cluster but the solution does not work. What I did to fix the issue: Instead of putting in function, break the code out to run line by line. I just want to see where the ...

  • 1 kudos
3 More Replies
TaBorjaTa
by New Contributor II
  • 3390 Views
  • 1 replies
  • 2 kudos

Pytest imports of sibling modules when using Databricks for VSCode

Hello all, I am following the Databrick's documentation on unit testing found here: Run tests with pytest for the Databricks extension for Visual Studio Code - Azure Databricks | Microsoft LearnHowever, when taking it a step further I get an ImportEr...

Data Engineering
pytest
VSCode
  • 3390 Views
  • 1 replies
  • 2 kudos
Latest Reply
Trifa
New Contributor II
  • 2 kudos

HelloImport errors happen often with Pytest. To Debug this error you can add this in your "test_myfunction_test.py":import sys # printing all directories for # interpreter to search sys.pathsys.path is a built-in variable within the sys module. I...

  • 2 kudos
AFox
by Contributor
  • 2216 Views
  • 7 replies
  • 0 kudos

databricks-connector: Error: Cluster MASKED is in unexpected state Pending.

Is there a way to make databricks-connector wait for cluster to be running?Details:databricks-connector==13.1.0 and the python minor version of cluster and environment are both 3.10If the cluster is not running this will start it, but any commands af...

  • 2216 Views
  • 7 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @AFox , I want to express my gratitude for your effort in selecting the most suitable solution. It's great to hear that your query has been successfully resolved. Thank you for your contribution. 

  • 0 kudos
6 More Replies
AndyAtINX
by New Contributor III
  • 1534 Views
  • 4 replies
  • 1 kudos

Resolved! Error inviting user to workspace "Failed to add user: A user with email ... or username ... in different cases already exist in the account"

We have 3 workspaces - 1 old version in one AWS account, 2 latest versions in another.We are PAYG full edition, not using SSO.Our admins (existing DBX users in the `admins` group) can invite new users via the Admin Console from the 1 old and 1 new wo...

  • 1534 Views
  • 4 replies
  • 1 kudos
Latest Reply
Schneider-Elect
New Contributor II
  • 1 kudos

We are facing same issue, We are on azure. @AndyAtINX you mean if user exist in workspace with abc@gmail.com we should add the user in workspace2 with abc@gmail.com not ABC@GMAIL.COM. if this the case we tried this and its not working for us.

  • 1 kudos
3 More Replies
OliverCadman
by New Contributor III
  • 3189 Views
  • 2 replies
  • 0 kudos

DUPLICATE: Missing 'DBAcademy DLT' as a Cluster Policy when creating Delta Live Tables pipeline

Good afternoon,I'm currently going through Module 4 of the Data Engineering Associate pathway, specifically lesson 4.1 - DLT UI Walkthrough. We are instructed to specify the Cluster Policy as 'DBAcademy DLT' when configuring the pipeline. However, th...

Data Engineering
Data engineer Associate
dlt
pipeline
pipeline configuration
  • 3189 Views
  • 2 replies
  • 0 kudos
Latest Reply
SeRo
New Contributor II
  • 0 kudos

The policy will be available after running the notebook /Users/<YOUR USER NAME>/Data Engineering with Databricks - v3.1.4/Includes/Workspace-Setup 

  • 0 kudos
1 More Replies
Shankar
by New Contributor III
  • 2626 Views
  • 1 replies
  • 1 kudos

How does deletedFileRetentionDuration and logRetentionDuration associated with Vacuum?

I am trying to learn more about Vacuum operation and came across the two properties: delta.deletedFileRetentionDurationdelta.logRetentionDurationSo, let's say I have a delta table where few records/files have been deleted. The delta.deletedFileRetent...

Data Engineering
delta
deltatables
vacuum
  • 2626 Views
  • 1 replies
  • 1 kudos
Latest Reply
dasiekr
New Contributor II
  • 1 kudos

No answers for those question?I also find it not clear enough to understand this process of underlying parquet files retention.Can someone help here?

  • 1 kudos
rbricks007
by New Contributor II
  • 1067 Views
  • 2 replies
  • 0 kudos

Resolved! Trying to use pivot function with pyspark for count aggregate

I'm trying this code but getting the following error testDF = (eventsDF .groupBy("user_id") .pivot("event_name") .count("event_name")) TypeError: _api() takes 1 positional argument but 2 were givenPlease guide how to fix...

Data Engineering
count
pivot
python
  • 1067 Views
  • 2 replies
  • 0 kudos
Latest Reply
Krishnamatta
New Contributor III
  • 0 kudos

Try thisfrom pyspark.sql import functions as F testDF = (eventsDF .groupBy("user_id") .pivot("event_name") .agg(F.count("event_name")))  

  • 0 kudos
1 More Replies
rt-slowth
by Contributor
  • 538 Views
  • 1 replies
  • 0 kudos

Resolved! how to use dlt module in streaming pipeline

If anyone has example code for building a CDC live streaming pipeline generated by AWS DMS using import dlt, I'd love to see it.I'm currently able to see the parquet file starting with Load on the first full load to S3 and the cdc parquet file after ...

  • 538 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @rt-slowth ,  Certainly! Let’s explore how to create a Change Data Capture (CDC) live streaming pipeline using Delta Live Tables and AWS Database Migration Service (DMS). Delta Live Tables and AWS DMS: Delta Live Tables is an open-source storage ...

  • 0 kudos
alexiswl
by Contributor
  • 3182 Views
  • 4 replies
  • 0 kudos

Resolved! Create a UDF Table Function with DLT in UC

Hello, I am trying to generate a DLT but need to use a UDF Table Function in the process.  This is what I have so far, everything works (without e CREATE OR REFRESH LIVE TABLE wrapper)```sqlCREATE OR REPLACE FUNCTION silver.portal.get_workflows_from_...

  • 3182 Views
  • 4 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @alexiswl , I want to express my gratitude for your effort in selecting the most suitable solution. It's great to hear that your query has been successfully resolved. Thank you for your contribution. 

  • 0 kudos
3 More Replies
JonLaRose
by New Contributor III
  • 1518 Views
  • 4 replies
  • 0 kudos

Resolved! Max amount of tables

Hi!What is the maximum amount of tables that is possible to create in a Unity catalog?Is there any difference between managed and external tables? If so, what is the limit for external tables? Thanks,Jonathan.

  • 1518 Views
  • 4 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @JonLaRose , I want to express my gratitude for your effort in selecting the most suitable solution. It's great to hear that your query has been successfully resolved. Thank you for your contribution. 

  • 0 kudos
3 More Replies
js54123875
by New Contributor III
  • 3706 Views
  • 3 replies
  • 0 kudos

Resolved! Power BI - Databricks Connection using Service Principal PAT Refresh

What is best practice for automatically refreshing service princpal PAT in Power BI for a connection to a Databricks dataset? Ideally when the PAT is updated it will automatically be stored in Azure Key Vault, is there a way that Power BI can pick it...

Data Engineering
Azure Key Vault
Personal Access Token
Power BI
Service Principal
  • 3706 Views
  • 3 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @js54123875 , Certainly! Refreshing a Power BI dataset with a Service Principal and managing PATs can be achieved through a combination of best practices.    Let’s explore some approaches:   Service Principal and Azure Key Vault: Create a Service ...

  • 0 kudos
2 More Replies
rt-slowth
by Contributor
  • 2451 Views
  • 2 replies
  • 0 kudos

Resolved! how to run @dlt pipeline in vscode

I want to test a pipeline created using dlt and python in vscode.

  • 2451 Views
  • 2 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @rt-slowth , To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers your question?This will also help other community members who may have similar ques...

  • 0 kudos
1 More Replies
dev_puli
by New Contributor III
  • 2924 Views
  • 3 replies
  • 0 kudos

tracing the history of a workflow

Hi!I use Databricks in Azure and I find it inconvenient not knowing the last modified user and modified time. How can I trace the history of modified time and user details? Would it be possible to deploy the workflows into higher environments?Thanks!

Data Engineering
azure
Workflows
  • 2924 Views
  • 3 replies
  • 0 kudos
Latest Reply
dev_puli
New Contributor III
  • 0 kudos

Sorry! I added another issue at the end without mentioning it was a new issue I encountered. I had challenges in changing the owner of a workflow when I created a workflow. I ended up seeking help from another user with admin privileges to change the...

  • 0 kudos
2 More Replies
Labels
Top Kudoed Authors