Data Engineering

Forum Posts

Sorted by:

by martikev • New Contributor II

09-02-2022 7:21:12 AM

739 Views
2 replies
1 kudos

How to avoid losing dbutils.widgets parameters when connecting databricks repos folder with azure dev ops?

Hi guys We want to move from databricks workspace to databricks repos. Now when we create a new folder under repos and connect to our azure dev ops repository all our dbutils.widget parameters are lost. How to avoid this? We want to fully depend on a...

Data Engineering

739 Views
2 replies
1 kudos

09-02-2022 7:21:12 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-19-2022 2:40:47 AM

1 kudos

Hi @Kevin Peter M. Marti Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from y...

1 kudos

09-19-2022 2:40:47 AM

1 More Replies

by Shubhamgoyal • New Contributor

09-16-2022 4:42:53 AM

1151 Views
2 replies
1 kudos

Access Databricks SQL databases using rest API

Hi All,We want to read/write data to Databricks SQL using powerapps. I have been looking for documentation around accessing databases in databricks SQL via rest api.Appreciate your help on this.

Data Engineering

1151 Views
2 replies
1 kudos

09-16-2022 4:42:53 AM

View Replies

Latest Reply

byrdman
New Contributor III

09-18-2022 10:50:18 AM

1 kudos

With the databricks api you can start a workflow job. build the job to ingest your data into tables.

1 kudos

09-18-2022 10:50:18 AM

1 More Replies

by turagittech • New Contributor

09-01-2022 6:32:02 PM

4527 Views
2 replies
1 kudos

PYODBC very slow - 30 minutes to write 6000 rows

Along withh several other issues I'm encountering, I am finding pandas dataframe to_sql being very slowI am writing to an Azure SQL database and performance is woeful. This is a test database and it has S3 100DTU and one user, me as it's configuratio...

Data Engineering

4527 Views
2 replies
1 kudos

09-01-2022 6:32:02 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-17-2022 11:07:31 PM

1 kudos

Hi @Peter McLarty Does @Debayan Mukherjee response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

1 kudos

09-17-2022 11:07:31 PM

1 More Replies

by arz • New Contributor

09-17-2022 12:59:39 PM

1065 Views
0 replies
0 kudos

PySpark job with joins & write parquet operation fails with FetchFailedException

I'm working on a task where I transform a dataset and re-save it to an S3 bucket. This involves joining the dataset to two others, dropping fields from the initial dataset which overlapped with fields from the other two, hashing certain fields with p...

Data Engineering

1065 Views
0 replies
0 kudos

09-17-2022 12:59:39 PM

by dibo • New Contributor II

09-17-2022 8:21:20 AM

340 Views
0 replies
0 kudos

I can't login to https://community.cloud.databricks.com/login.html

Now, I can't login to https://community.cloud.databricks.com/login.html with the correct username and password, later I click the button to reset my password and I receive the email for modifying password, I have modified password, But I still can't ...

Data Engineering

340 Views
0 replies
0 kudos

09-17-2022 8:21:20 AM

by SamSteere • New Contributor III

08-31-2022 11:07:44 PM

983 Views
3 replies
6 kudos

docs.databricks.com

REST API Documentation is out of date since the release of Delta Live TablesWhen using the `2.0/clusters/list` endpoint in an environment with running clusters provisioned by DLTs, the clusters will be returned with a `cluster_source` value of `PIPEL...

Data Engineering

983 Views
3 replies
6 kudos

08-31-2022 11:07:44 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-17-2022 12:54:45 AM

6 kudos

Hi @Sam Steere Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

6 kudos

09-17-2022 12:54:45 AM

2 More Replies

by alejandrofm • Valued Contributor

08-31-2022 2:12:38 PM

1738 Views
7 replies
1 kudos

Improve dowload speed or see download progress Python-Databricks SQL

Hi! I'm using the code from here to execute a query on Databricks, it goes flawlessly, can follow it from the Spark UI, etc. The problem here is at the moment it seems the download of the result (spark is idle, there is a green check in the query his...

Data Engineering

1738 Views
7 replies
1 kudos

08-31-2022 2:12:38 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-17-2022 12:53:17 AM

1 kudos

Hi @Alejandro Martinez Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you...

1 kudos

09-17-2022 12:53:17 AM

6 More Replies

by data_boy_2022 • New Contributor III

08-30-2022 1:44:24 PM

4270 Views
8 replies
3 kudos

Data ingest of csv files from S3 using Autoloader is slow

I have 150k small csv files (~50Mb) stored in S3 which I want to load into a delta table.All CSV files are stored in the following structure in S3:bucket/folder/name_00000000_00000100.csvbucket/folder/name_00000100_00000200.csvThis is the code I use ...

Data Engineering

4270 Views
8 replies
3 kudos

08-30-2022 1:44:24 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-17-2022 12:45:08 AM

3 kudos

Hi @Jan R Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

3 kudos

09-17-2022 12:45:08 AM

7 More Replies

by swetha • New Contributor III

08-30-2022 10:24:49 AM

1536 Views
4 replies
1 kudos

Error: no streaming listener attached to the spark app is the error we are observing post accessing streaming statistics API. Please help us with this issue ASAP. Thanks.

Issue: Spark structured streaming applicationAfter adding the listener jar file in the cluster init script, the listener is working (From what I see in the stdout/log4j logs)But when I try to hit the 'Content-Type: application/json' http://host:port/...

Data Engineering

1536 Views
4 replies
1 kudos

08-30-2022 10:24:49 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-17-2022 12:36:37 AM

1 kudos

Hi @swetha kadiyala Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Th...

1 kudos

09-17-2022 12:36:37 AM

3 More Replies

by Nid • New Contributor

09-15-2022 10:16:03 AM

465 Views
1 replies
0 kudos

badge not received for Databricks Lakehouse Fundamentals Accreditation

Hi,I have cleared the assessment for Databricks Lakehouse Fundamentals Accreditationbut yet to received a badge. Kindly assist me with this

Data Engineering

465 Views
1 replies
0 kudos

09-15-2022 10:16:03 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-16-2022 10:26:00 PM

0 kudos

Hi @Nidhi kawale Thank you for reaching out!Let us look into this for you, and we will get back to you with an update.Kindly, share your email id at community@databricks.com.

0 kudos

09-16-2022 10:26:00 PM

by Bit-Warrior • New Contributor

09-16-2022 11:52:43 AM

349 Views
0 replies
0 kudos

Installing System ML on the cluster

I am trying to install the systemml package from Maven, I ignored the librarieslog4j:log4j, com:sun.jdmk, com:sun.jmx, javax:jmsBut when I run one command of systemml, then spark/databricks can no longer select from tables, effectively breaking somet...

Data Engineering

349 Views
0 replies
0 kudos

09-16-2022 11:52:43 AM

by parthsalvi • Contributor

09-16-2022 8:50:09 AM

786 Views
0 replies
0 kudos

Few sparks apis not working in DBR 11.2, 10.4 LTS Shared Mode (custom vpc) like df.tail, df.rdd.map

We're trying to use DBR 11.2 & 10.4LTS in Shared mode on a customer managed vpc. But we're running into following issues Is this issue related to our customer managed VPC setup or is it specific to DBR 11.2.Same issue also seen in DBR 11.1 and 10.4 L...

Data Engineering

786 Views
0 replies
0 kudos

09-16-2022 8:50:09 AM

by asanchez75 • New Contributor

09-16-2022 2:20:53 AM

499 Views
0 replies
0 kudos

How to compare Spark performance under different hardware (GPU vs CPU)

Hello,I found some benchmarks between GPU and CPU Spark-based systems that are not performed in the same hardware. Is this faire since a powerful CPU server could eventually outperforms a GPU server?For example,Here, the performance comparison is don...

Data Engineering

499 Views
0 replies
0 kudos

09-16-2022 2:20:53 AM

by nancy_g • New Contributor III

05-25-2022 8:03:28 AM

2451 Views
6 replies
5 kudos

Resolved! Are Jobs not supported on cluster with Azure Data Lake Storage credential passthrough enabled cluster?

Data Engineering

2451 Views
6 replies
5 kudos

05-25-2022 8:03:28 AM

View Replies

Latest Reply

Rostislaw
New Contributor III

09-16-2022 1:40:33 AM

5 kudos

Right now the feature seems to be public available. It is possible to schedule jobs with ADLS passthough enabled and do not have to provide service principal credentials.However I ask myself how that works behind the scenses. When working interactive...

5 kudos

09-16-2022 1:40:33 AM

5 More Replies

by amit • New Contributor II

09-15-2022 4:32:03 AM

361 Views
2 replies
0 kudos

www.databricks.com

Hi @Lindsay Olson ,I have attended the virtual instructor-led training on 23-08-2022 (https://www.databricks.com/p/webinar/databricks-lakehouse-fundamentals-learning-plan). As per the conditions mentioned, I have completed all of the steps for getti...

Data Engineering

361 Views
2 replies
0 kudos

09-15-2022 4:32:03 AM

View Replies

Latest Reply

amit
New Contributor II

09-16-2022 12:04:10 AM

0 kudos

Thanks @Lindsay Olson . Yes issue has been resolved,

0 kudos

09-16-2022 12:04:10 AM

1 More Replies

User

Count

1601

736

343

284

246

Databricks

Forum Posts

How to avoid losing dbutils.widgets parameters when connecting databricks repos folder with azure dev ops?

Access Databricks SQL databases using rest API

PYODBC very slow - 30 minutes to write 6000 rows

PySpark job with joins & write parquet operation fails with FetchFailedException

I can't login to https://community.cloud.databricks.com/login.html

docs.databricks.com

Improve dowload speed or see download progress Python-Databricks SQL

Data ingest of csv files from S3 using Autoloader is slow

Error: no streaming listener attached to the spark app is the error we are observing post accessing streaming statistics API. Please help us with this issue ASAP. Thanks.

badge not received for Databricks Lakehouse Fundamentals Accreditation

Installing System ML on the cluster

Few sparks apis not working in DBR 11.2, 10.4 LTS Shared Mode (custom vpc) like df.tail, df.rdd.map

How to compare Spark performance under different hardware (GPU vs CPU)

Resolved! Are Jobs not supported on cluster with Azure Data Lake Storage credential passthrough enabled cluster?

www.databricks.com

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...