Data Engineering

Forum Posts

Sorted by:

by swetha • New Contributor III

08-30-2022 4:42:29 AM

1378 Views
2 replies
1 kudos

I am unable to attach a streaming listener to a spark streaming job. Error: no streaming listener attached to the spark application is the error we are observing post accessing streaming statistics API. Please help us with this issue ASAP. Thanks.

Issue:After adding the listener jar file in the cluster init script, the listener is working (From what I see in the stdout/log4j logs)But when I try to hit the 'Content-Type: application/json' http://host:port/api/v1/applications/app-id/streaming/st...

Data Engineering

1378 Views
2 replies
1 kudos

08-30-2022 4:42:29 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-15-2022 4:05:18 AM

1 kudos

Hi @swetha kadiyala Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Th...

1 kudos

09-15-2022 4:05:18 AM

1 More Replies

by Sadiq • New Contributor III

08-29-2022 10:15:41 AM

1478 Views
6 replies
4 kudos

Fixed length file from Databricks notebook ( Spark SQL)

Hi ,I need help writing data from azure databricks notebook into Fixed Length .txt.notebook has 10 lakh rows and 86 columns. can anyone suggest me

Data Engineering

1478 Views
6 replies
4 kudos

08-29-2022 10:15:41 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-15-2022 3:47:10 AM

4 kudos

Hi @sadiq vali Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

4 kudos

09-15-2022 3:47:10 AM

5 More Replies

by PrebenOlsen • New Contributor III

08-26-2022 4:31:47 AM

1429 Views
4 replies
1 kudos

GroupBy in delta live tables fails with error "RuntimeError: Query function must return either a Spark or Koalas DataFrame"

I have a delta live table that I'm trying to run GroupBy on, but getting an error: "RuntimeError: Query function must return either a Spark or Koalas DataFrame". Here is my code:@dlt.table def groups_hierarchy(): df = dlt.read_stream("groups_h...

Data Engineering

1429 Views
4 replies
1 kudos

08-26-2022 4:31:47 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-15-2022 3:33:44 AM

1 kudos

Hi @Preben Olsen Does @Debayan Mukherjee response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

1 kudos

09-15-2022 3:33:44 AM

3 More Replies

by 190809 • Contributor

09-15-2022 3:00:36 AM

554 Views
0 replies
0 kudos

Pulling Data From Stripe to Databricks using the Webhook

I am doing some investigation in how to connect Databricks and Stripe. Stirpe has really good documentation and I have decided to set up a webhook in Django as per their recommendation. This function handles events as they occur in stripe:-----------...

Data Engineering

554 Views
0 replies
0 kudos

09-15-2022 3:00:36 AM

by Munni • New Contributor II

09-14-2022 10:06:20 PM

284 Views
0 replies
0 kudos

Hai,I need somehelp,I am reading csv file through pyspark ,in which one field encoded with double quotes,I should get that value along with double quo...

Hai,I need somehelp,I am reading csv file through pyspark ,in which one field encoded with double quotes,I should get that value along with double quotes.Spark version is 3.0.1.col1,col2,col3"A",""B,C"","D"-----------INPUTOUTPUT:A , "B,C" , D

Data Engineering

284 Views
0 replies
0 kudos

09-14-2022 10:06:20 PM

by KrishZ • Contributor

09-13-2022 11:08:50 PM

1364 Views
2 replies
1 kudos

Where to report a bug with Databricks ?

I have in issue in Pyspark.Pandas to report.Is there a github or some forum where I can register my issue?Here's the issue

Data Engineering

1364 Views
2 replies
1 kudos

09-13-2022 11:08:50 PM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

09-14-2022 9:01:36 PM

1 kudos

Hi, @Krishna Zanwar Could you please raise a support case to report the bug. Please refer https://docs.databricks.com/resources/support.html to engage with Databricks Support.

1 kudos

09-14-2022 9:01:36 PM

1 More Replies

by Andrei_Radulesc • Contributor III

09-14-2022 8:21:33 AM

4451 Views
1 replies
2 kudos

Resolved! Error: cannot create mws credentials: Cannot complete request; user is unauthenticated

I am configuring databricks_mws_credentials through Terraform on AWS. This used to work up to a couple days ago - now, I am getting "Error: cannot create mws credentials: Cannot complete request; user is unauthenticated".My user/pw/account credential...

Data Engineering

4451 Views
1 replies
2 kudos

09-14-2022 8:21:33 AM

View Replies

Latest Reply

Andrei_Radulesc
Contributor III

09-14-2022 11:42:57 AM

2 kudos

Update: after changing the account password, the error went away. There seems to have been a temporary glitch in Databricks preventing Terraform from working with the old password - because the old password was correctly set up.Anyhow, now I have a w...

2 kudos

09-14-2022 11:42:57 AM

by RohitKulkarni • Contributor

09-14-2022 10:24:26 AM

1149 Views
0 replies
2 kudos

Get file from SharePoint to copy into Azure blob storage

Hello Team,I am trying to copy the xlx files from sharepoint and move to the Azure blob storageUSERNAME = app_config_client.get_configuration_setting(key='BIAppConfig:SharepointUsername',label='BIApp').valuePASSWORD = app_config_client.get_configurat...

Data Engineering

1149 Views
0 replies
2 kudos

09-14-2022 10:24:26 AM

by Anonymous • Not applicable

09-14-2022 10:03:38 AM

349 Views
0 replies
0 kudos

Data + AI World Tour &#xd83c;&#xdf0f; ✈️ Data + AI World Tour brings the data lakehouse to the global datacommunity. With content, customers and speakers tai...

Data + AI World Tour Data + AI World Tour brings the data lakehouse to the global datacommunity. With content, customers and speakers tailored to eachregion, the tour showcases how and why the data lakehouse is quicklybecoming the cloud data archite...

Data Engineering

349 Views
0 replies
0 kudos

09-14-2022 10:03:38 AM

by ahuarte • New Contributor III

12-16-2021 2:11:09 AM

6320 Views
18 replies
3 kudos

Resolved! Getting Spark & Scala version in Cluster node initialization script

Hi there, I am developing a Cluster node initialization script (https://docs.gcp.databricks.com/clusters/init-scripts.html#environment-variables) in order to install some custom libraries.Reading the docs of Databricks we can get some environment var...

Data Engineering

6320 Views
18 replies
3 kudos

12-16-2021 2:11:09 AM

View Replies

Latest Reply

Lingesh
New Contributor III

09-14-2022 9:02:25 AM

3 kudos

We can infer the cluster DBR version using the env $DATABRICKS_RUNTIME_VERSION. (For the exact spark/scala version mapping, you can refer to the specific DBR release notes)Sample usage inside a init script, DBR_10_4_VERSION="10.4" if [[ "$DATABRICKS_...

3 kudos

09-14-2022 9:02:25 AM

17 More Replies

by Krish-685291 • New Contributor III

09-14-2022 4:42:53 AM

429 Views
0 replies
0 kudos

Dataframe loses its contents after the write operation to Database.

We had working code as below.print(f"{file_name}Before insert count", datetime.datetime.now(), scan_df_new.count())print(scan_df_new.show())scan_20220908120005_10Before insert count 2022-09-14 11:37:15.853588 3+-------------------+----------+--------...

Data Engineering

429 Views
0 replies
0 kudos

09-14-2022 4:42:53 AM

by nickagel • New Contributor III

08-12-2022 2:51:56 AM

2340 Views
5 replies
4 kudos

AWS Glue Catalog w/ Delta Tables Connected to Databricks SQL Engine - Incompatible format detected.

I've posted the same question on stack overflow to try to maximize reach here & potentially raise this issue to Databricks.I am trying to query delta tables from my AWS Glue Catalog on Databricks SQL Engine. They are stored in Delta Lake format. I ha...

Data Engineering

2340 Views
5 replies
4 kudos

08-12-2022 2:51:56 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-10-2022 9:41:55 PM

4 kudos

Hi @Nick Agel Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

4 kudos

09-10-2022 9:41:55 PM

4 More Replies

by tariq • New Contributor III

09-13-2022 7:46:38 AM

3908 Views
4 replies
0 kudos

Importing python module

I'm not sure how a simple thing like importing a module in python can be so broken in such a product. First, I was able to make it work using the following:import sys sys.path.append("/Workspace/Repos/Github Repo/sparkling-to-databricks/src") from ut...

Data Engineering

3908 Views
4 replies
0 kudos

09-13-2022 7:46:38 AM

View Replies

Latest Reply

KrishZ
Contributor

09-13-2022 9:36:54 PM

0 kudos

I too wonder the same thing. How can importing a python module be so difficult and not even documented lol.No need for libraries..Here's what worked for me..Step1: Upload the module by first opening a notebook >> File >> Upload Data >> drag and drop ...

0 kudos

09-13-2022 9:36:54 PM

3 More Replies

by jay_sharma • New Contributor III

09-13-2022 12:43:56 PM

830 Views
0 replies
4 kudos

Function not found when running from another Notebook using %run command.

Hi all,I'm trying to run some functions from another notebook (data_process_notebook) in my main notebook, using the %run command command. When I run the command: %run ../path/to/data_process_notebook, it is able to complete successfully, no path, pe...

Data Engineering

830 Views
0 replies
4 kudos

09-13-2022 12:43:56 PM

by mattmunz • New Contributor III

05-11-2022 12:49:55 PM

14297 Views
5 replies
0 kudos

How can I resolve this SSL error which occurrs when calling databricks-sql-connector/databricks.sql.connect() from my python app?

Error: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: self signed certificate in certificate chain (_ssl.c:997)> python --versionPython 3.10.4This error seems to be coming from the thrift backend. I suspect but have not confirmed that t...

Data Engineering

14297 Views
5 replies
0 kudos

05-11-2022 12:49:55 PM

View Replies

Latest Reply

ziggy
New Contributor II

06-29-2022 9:44:20 AM

0 kudos

I have the same issue and tried the solution mentioned above. It still did not work. I am getting below errorError: ('HY000', '[HY000] [Simba][ThriftExtension] (14) Unexpected response from server during a HTTP connection: SSL_connect: certificate ve...

0 kudos

06-29-2022 9:44:20 AM

4 More Replies

User

Count

1601

736

343

284

247

Databricks

Forum Posts

I am unable to attach a streaming listener to a spark streaming job. Error: no streaming listener attached to the spark application is the error we are observing post accessing streaming statistics API. Please help us with this issue ASAP. Thanks.

Fixed length file from Databricks notebook ( Spark SQL)

GroupBy in delta live tables fails with error "RuntimeError: Query function must return either a Spark or Koalas DataFrame"

Pulling Data From Stripe to Databricks using the Webhook

Hai,I need somehelp,I am reading csv file through pyspark ,in which one field encoded with double quotes,I should get that value along with double quo...

Where to report a bug with Databricks ?

Resolved! Error: cannot create mws credentials: Cannot complete request; user is unauthenticated

Get file from SharePoint to copy into Azure blob storage

Data + AI World Tour &#xd83c;&#xdf0f; ✈️ Data + AI World Tour brings the data lakehouse to the global datacommunity. With content, customers and speakers tai...

Resolved! Getting Spark & Scala version in Cluster node initialization script

Dataframe loses its contents after the write operation to Database.

AWS Glue Catalog w/ Delta Tables Connected to Databricks SQL Engine - Incompatible format detected.

Importing python module

Function not found when running from another Notebook using %run command.

How can I resolve this SSL error which occurrs when calling databricks-sql-connector/databricks.sql.connect() from my python app?

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...