Data Engineering

Forum Posts

Sorted by:

by Unilever • New Contributor II

03-15-2023 2:39:58 AM

911 Views
2 replies
1 kudos

I would like to get rid of the error

the SPN we use for the mount points has access to the dataset in question, but for some reason I get this errorPlease find the attached screenshot for the error details.

Data Engineering

911 Views
2 replies
1 kudos

03-15-2023 2:39:58 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-18-2023 9:19:09 PM

1 kudos

Hi @sai chandu palkapati Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from y...

1 kudos

03-18-2023 9:19:09 PM

1 More Replies

by bd • New Contributor III

03-13-2023 2:30:39 PM

2616 Views
3 replies
0 kudos

Resolved! Job aborted due to stage failure: ModuleNotFoundError

I'm getting this Failure Reason on a fairly simple streaming job. I'm running the job in a notebook. The notebook relies on a python module that I'm syncing to DBFS with `dbx`. Within the notebook generally, the module is available, i.e. `import mymo...

Data Engineering

2616 Views
3 replies
0 kudos

03-13-2023 2:30:39 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-18-2023 9:17:12 PM

0 kudos

Hi @Benjamin Dean Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Than...

0 kudos

03-18-2023 9:17:12 PM

2 More Replies

by Dayaa • New Contributor II

03-13-2023 9:48:54 AM

1741 Views
3 replies
3 kudos

Resolved! Load data into Azure SQL Database from Azure Databricks ( restricted table not a whole workspace tables)

Hi ,I want to share limited tables in my databricks workspace and users will connect to my databricks through Azure Data factory and will load data into Azure SQL. Is this possible using Delta Sharing? Or any other method or tool?

Data Engineering

1741 Views
3 replies
3 kudos

03-13-2023 9:48:54 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-18-2023 8:50:44 PM

3 kudos

Hi @Dayananthan Marimuthu Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your...

3 kudos

03-18-2023 8:50:44 PM

2 More Replies

by Arunsundar • New Contributor III

03-12-2023 9:16:49 PM

2353 Views
5 replies
4 kudos

The possibility of finding the workload dynamically and spin up the cluster based on the workload

Hi Team,Good morning. I would like to understand if there is a possibility to determine the workload automatically through code (data load from a file to a table, determine the file size, kind of a benchmark that we can check), based on which we can ...

Data Engineering

2353 Views
5 replies
4 kudos

03-12-2023 9:16:49 PM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

03-18-2023 8:34:17 AM

4 kudos

Hi @Arunsundar Muthumanickam , We haven't heard from you since the last response from @Vigneshraja Palaniraj and @Debayan Mukherjee, and I was checking back to see if her suggestions helped you.Or else, If you have any solution, please share it...

4 kudos

03-18-2023 8:34:17 AM

4 More Replies

by ossinova • Contributor II

03-15-2023 6:23:15 AM

1321 Views
2 replies
2 kudos

PIVOT on month and quarter

I want to simplify this query:SELECT year(EntryDate) Year, AccountNumber, sum(CreditBase - DebitBase) FILTER(WHERE month(EntryDate) = 1) AS jan_total, sum(CreditBase - DebitBase) FILTER(WHERE month(EntryDate) = 2) AS feb_total, sum(CreditBase - Debi...

Data Engineering

1321 Views
2 replies
2 kudos

03-15-2023 6:23:15 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

03-18-2023 8:31:39 AM

2 kudos

Hi @Oscar Dyremyhr, We haven't heard from you since the last response from @Lakshay Goel , and I was checking back to see if his suggestions helped you.Or else, If you have any solution, please share it with the community, as it can be helpful to ...

2 kudos

03-18-2023 8:31:39 AM

1 More Replies

by Harun • Honored Contributor

03-15-2023 8:55:28 AM

3338 Views
6 replies
6 kudos

how to load structured stream data into delta table whose location is in ADLS Gen2

Hi All,I am working on a streaming data processing. As a intial step i have read the data from azure eventhub using readstream. now i want to writestream this into a delta table. My requirement is, The data should present in external location (adls g...

Data Engineering

3338 Views
6 replies
6 kudos

03-15-2023 8:55:28 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

03-18-2023 8:30:35 AM

6 kudos

Hi @Harun Raseed Basheer, We haven't heard from you since the last response from @Werner Stinckens, and I was checking back to see if his suggestions helped you.Or else, If you have any solution, please share it with the community, as it can be ...

6 kudos

03-18-2023 8:30:35 AM

5 More Replies

by sanjay • Valued Contributor II

03-13-2023 10:45:11 AM

6430 Views
4 replies
4 kudos

Resolved! PySpark UDF is taking long to process

Hi,I have UDF which runs for each spark dataframe row, does some complex processing and return string output. But it takes very long if data is 15000 rows. I have configured cluster with autoscaling, but its not spinning more servers.Please suggest h...

Data Engineering

6430 Views
4 replies
4 kudos

03-13-2023 10:45:11 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

03-18-2023 3:38:01 AM

4 kudos

Hi @Sanjay Jain , We haven't heard from you since the last response from @Lakshay Goel, @rishabh and @Vigneshraja Palaniraj, and I was checking back to see if their suggestions helped you.Or else, If you have any solution, please share it with ...

4 kudos

03-18-2023 3:38:01 AM

3 More Replies

by Dataengineer_mm • New Contributor

03-13-2023 5:04:23 PM

1904 Views
2 replies
1 kudos

Surrogate key using identity column.

I want to create a surrogate in the delta table And i used the identity column id-Generated as DefaultCan i insert rows into the delta table using only spark.sql like Insert query ? or i can also use write delta format options? If i use the df.write ...

Data Engineering

1904 Views
2 replies
1 kudos

03-13-2023 5:04:23 PM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

03-18-2023 3:30:09 AM

1 kudos

Hi @Menaka Murugesan(Customer), We haven’t heard from you since the last response from @Nandini N (Customer), and I was checking back to see if her suggestions helped you.Or else, If you have any solution, please share it with the community, as ...

1 kudos

03-18-2023 3:30:09 AM

1 More Replies

by bluesky111 • New Contributor II

03-14-2023 2:49:49 AM

1443 Views
2 replies
3 kudos

Resolved! I Input the wrong schedule time for the exams can it be reschedule ?

Helo today ,i think i was scheduled to do an exams at 2.15 PM but unfortunately i made a mistake put the time to 2.15 AM, could it be rescheduled? i already submit a ticket to https://help.databricks.com/s/contact-us?ReqType=training but no reply yet...

Data Engineering

1443 Views
2 replies
3 kudos

03-14-2023 2:49:49 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

03-18-2023 3:24:26 AM

3 kudos

Hi @heron halim (Customer), We haven't heard from you since the last response from @Akshay Padmanabhan, and I was checking back to see if his suggestions helped you. Or else, If you have any solution, please share it with the community, as it ca...

3 kudos

03-18-2023 3:24:26 AM

1 More Replies

by Philearner • New Contributor II

03-16-2023 8:15:20 PM

1858 Views
3 replies
3 kudos

Unable to find input by typing input in the Multiselect Widget

In the AWS databricks widgets.multiselect, I'm unable to find input by typing input in the mulitselect bar. It was working before. Although I can find the inputs by scrolling down the list, it's annoying if the list is long.Here's my script:measlis...

Data Engineering

1858 Views
3 replies
3 kudos

03-16-2023 8:15:20 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-17-2023 11:14:06 PM

3 kudos

Hi @Philip Teu Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

3 kudos

03-17-2023 11:14:06 PM

2 More Replies

by Sas • New Contributor II

03-12-2023 5:49:40 PM

3273 Views
3 replies
4 kudos

Resolved! Confusion in string comparison

Hello expertI am new to spark. I am using same price of code but getting different resultsWhen i am using below piece of code, i am getting errorpy4j.Py4JException: Method or([class java.lang.String]) does not existdf.filter(F.col("state").isNull() ...

Data Engineering

3273 Views
3 replies
4 kudos

03-12-2023 5:49:40 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-18-2023 12:35:10 AM

4 kudos

Hi @Saswata Dutta Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedbac...

4 kudos

03-18-2023 12:35:10 AM

2 More Replies

by sagiatul • New Contributor II

03-15-2023 4:55:39 AM

3851 Views
2 replies
3 kudos

Databricks driver logs

I am running jobs on databricks clusters. When the cluster is running I am able to find the executor logs by going to Spark Cluster UI Master dropdown, selecting a worker and going through the stderr logs. However, once the job is finished and cluste...

Data Engineering

3851 Views
2 replies
3 kudos

03-15-2023 4:55:39 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-18-2023 12:23:15 AM

3 kudos

Hi @Atul Arora Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedback w...

3 kudos

03-18-2023 12:23:15 AM

1 More Replies

by saikrishna3390 • New Contributor II

03-12-2023 11:51:49 AM

5422 Views
2 replies
2 kudos

How do I configure managed identity to databricks cluster and access azure storage using spark config

Partner want to use adf managed identity to connect to my databricks cluster and connect to my azure storage and copy the data from my azure storage to their azure storage storage

Data Engineering

5422 Views
2 replies
2 kudos

03-12-2023 11:51:49 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-18-2023 12:17:55 AM

2 kudos

Hi @SAI PUSALA Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedback w...

2 kudos

03-18-2023 12:17:55 AM

1 More Replies

by js54123875 • New Contributor III

03-17-2023 7:54:34 AM

1724 Views
2 replies
2 kudos

Failure to initialize configuration' on SQL Warehouse Tables

Yesterday I had a basic DLT pipeline up and running, and was able to query the hive_metastore tables successfully. The pipeline uses autloader to ingest a few csv files from cloud storage to streaming live bronze and silver tables. Today after star...

Data Engineering

1724 Views
2 replies
2 kudos

03-17-2023 7:54:34 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-17-2023 11:56:39 PM

2 kudos

Hi @Jennette Shepard Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feed...

2 kudos

03-17-2023 11:56:39 PM

1 More Replies

by User16752244127 • Contributor

03-16-2023 2:24:04 AM

1212 Views
2 replies
4 kudos

Resolved! DLT code examples and notebooks?

we like the examples that you show in webinars especially with DLT and Huggingface or DLT with ingestion from Kafka, are they publicly available?

Data Engineering

1212 Views
2 replies
4 kudos

03-16-2023 2:24:04 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-17-2023 11:25:20 PM

4 kudos

Hi @Frank Munz Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedback w...

4 kudos

03-17-2023 11:25:20 PM

1 More Replies

User

Count

1603

744

348

285

247

Databricks Community

Forum Posts

I would like to get rid of the error

Resolved! Job aborted due to stage failure: ModuleNotFoundError

Resolved! Load data into Azure SQL Database from Azure Databricks ( restricted table not a whole workspace tables)

The possibility of finding the workload dynamically and spin up the cluster based on the workload

PIVOT on month and quarter

how to load structured stream data into delta table whose location is in ADLS Gen2

Resolved! PySpark UDF is taking long to process

Surrogate key using identity column.

Resolved! I Input the wrong schedule time for the exams can it be reschedule ?

Unable to find input by typing input in the Multiselect Widget

Resolved! Confusion in string comparison

Databricks driver logs

How do I configure managed identity to databricks cluster and access azure storage using spark config

Failure to initialize configuration' on SQL Warehouse Tables

Resolved! DLT code examples and notebooks?

Compute Policy Does Not Install Libraries

Is there a way to let the DLT pipeline retry by it...

Can't create Catalog on Databricks on AWS

Executing Notebooks - Run All Cells vs Run All Bel...

getting Status code: 301 Moved Permanently error