Topics with Label: Code

Forum Posts

Sorted by:

by User16783853906 • Contributor III

06-23-2021 2:52:55 PM

1862 Views
5 replies
5 kudos

Resolved! Update code for a streaming job in Production

How to update a streaming job in production with minimal/no downtime when there are significant code changes that may not be compatible with the existing checkpoint state to resume the stream processing?

Data Engineering

1862 Views
5 replies
5 kudos

06-23-2021 2:52:55 PM

View Replies

Latest Reply

Anonymous
Not applicable

07-25-2022 1:51:13 AM

5 kudos

Thanks for the information, I will try to figure it out for more. Keep sharing such informative post keep suggesting such post.MA Health Connector

5 kudos

07-25-2022 1:51:13 AM

4 More Replies

by Megan05 • New Contributor III

06-29-2022 9:04:38 AM

1270 Views
4 replies
1 kudos

Trying to write to S3 bucket but executed code not showing any progress

I am trying to write data from databricks to an S3 bucket but when I submit the code, it runs and runs and does not make any progress. I am not getting any errors and the logs don't seem to recognize I've submitted anything. The cluster also looks un...

Data Engineering

1270 Views
4 replies
1 kudos

06-29-2022 9:04:38 AM

View Replies

Latest Reply

User16753725469
Contributor II

07-15-2022 6:40:48 AM

1 kudos

Can you please check the driver log4j to see what is happening?

1 kudos

07-15-2022 6:40:48 AM

3 More Replies

by Dicer • Valued Contributor

07-02-2022 4:27:46 AM

11055 Views
13 replies
13 kudos

Resolved! Failed to convert Spark.sql to Pandas Dataframe using .toPandas()

I wrote the following code:data = spark.sql (" SELECT A_adjClose, AA_adjClose, AAL_adjClose, AAP_adjClose, AAPL_adjClose FROM deltabase.a_30min_delta, deltabase.aa_30min_delta, deltabase.aal_30min_delta, deltabase.aap_30min_delta ,deltabase.aapl_30m...

Data Engineering

11055 Views
13 replies
13 kudos

07-02-2022 4:27:46 AM

View Replies

Latest Reply

Dicer
Valued Contributor

07-18-2022 11:39:47 PM

13 kudos

I just discovered a solution.Today, I opened Azure Databricks. When I imported python libraries. Databricks told me that toPandas() was deprecated and it suggested me to use toPandas.The following solution works: Use toPandas instead of toPandas() da...

13 kudos

07-18-2022 11:39:47 PM

12 More Replies

by Andyfcx • New Contributor

10-15-2021 3:11:08 AM

1363 Views
3 replies
2 kudos

Resolved! Is it possible to clone a private repository and use it in databricks Repos?

As title, I need to clone code from my private git repo, and use it in my notebook, I do something likedef cmd(command, cwd=None): process = subprocess.Popen(command.split(), stdout=subprocess.PIPE, cwd=cwd) output, error = process.communicate(...

Data Engineering

1363 Views
3 replies
2 kudos

10-15-2021 3:11:08 AM

View Replies

Latest Reply

Kaniz
Community Manager

05-18-2022 2:46:03 PM

2 kudos

Hi @Andy Huang , Just a friendly follow-up. Do you still need help, or @Prabakar Ammeappin 's response help you to find the solution? Please let us know.

2 kudos

05-18-2022 2:46:03 PM

2 More Replies

by Jackie • New Contributor II

03-08-2022 2:55:26 PM

4290 Views
4 replies
6 kudos

Resolved! speed up a for loop in python (azure databrick)

code example# a list of file pathlist_files_path = ["/dbfs/mnt/...", ..., "/dbfs/mnt/..."]# copy all file above to this folderdest_path=""/dbfs/mnt/..."for file_path in list_files_path: # copy function copy_file(file_path, dest_path)I am runni...

Data Engineering

4290 Views
4 replies
6 kudos

03-08-2022 2:55:26 PM

View Replies

Latest Reply

Hemant
Valued Contributor II

04-27-2022 7:07:59 PM

6 kudos

@Jackie Chan , What's the data size you want to copy? If it's bigger, then use ADF.

6 kudos

04-27-2022 7:07:59 PM

3 More Replies

by Syed1 • New Contributor III

03-24-2022 5:23:49 PM

8546 Views
9 replies
13 kudos

Resolved! Python Graph not showing

Hi , I have run this code import matplotlib.pyplot as pltimport numpy as npplt.style.use('bmh')%matplotlib inlinex = np.array([5,7,8,7,2,17,2,9,4,11,12,9,6])y = np.array([99,86,87,88,111,86,103,87,94,78,77,85,86])p= plt.scatter(x, y)display command r...

Data Engineering

8546 Views
9 replies
13 kudos

03-24-2022 5:23:49 PM

View Replies

Latest Reply

User16725394280
Contributor II

04-08-2022 4:43:50 AM

13 kudos

@Syed Ubaid i tried with 7.3 LTS and its works fine.

13 kudos

04-08-2022 4:43:50 AM

8 More Replies

by p42af • New Contributor

03-08-2022 7:15:43 PM

3086 Views
4 replies
1 kudos

Resolved! rdd.foreachPartition() does nothing?

I expected the code below to print "hello" for each partition, and "world" for each record. But when I ran it the code ran but had no print outs of any kind. No errors either. What is happening here?%scala val rdd = spark.sparkContext.parallelize(S...

Data Engineering

3086 Views
4 replies
1 kudos

03-08-2022 7:15:43 PM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

03-09-2022 2:49:34 AM

1 kudos

Is it lazy evaluated so you need to trigger action I guess

1 kudos

03-09-2022 2:49:34 AM

3 More Replies

by s_plank • New Contributor III

03-01-2022 4:13:20 AM

1860 Views
6 replies
5 kudos

Resolved! Databricks-Connect shows different partitions than Databricks for the same delta table

Hello,here is a small code-snippet:from pyspark.sql import SparkSession spark = SparkSession.builder.appName('example_app').getOrCreate() spark.sql('SHOW PARTITIONS database.table').show() The output inside the Databricks-Notebook:+-------------+--...

Data Engineering

1860 Views
6 replies
5 kudos

03-01-2022 4:13:20 AM

View Replies

Latest Reply

s_plank
New Contributor III

04-05-2022 11:16:48 PM

5 kudos

Hi @Jose Gonzalez ,yes the SQL-Connector works fine. Thank you!

5 kudos

04-05-2022 11:16:48 PM

5 More Replies

by Databricks_7045 • New Contributor III

02-07-2022 4:53:18 AM

1330 Views
4 replies
0 kudos

Resolved! Encapsulate Databricks Pyspark/SparkSql code

Hi All ,I have Custom code ( Pyspark & SparkSQL) (notebooks) which I want to deploy at customer location and encapsulate so that end customers don't see the actual code. Currently we have all code in Notebooks (Pyspark/spark sql). Could you please l...

Data Engineering

1330 Views
4 replies
0 kudos

02-07-2022 4:53:18 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

02-08-2022 2:33:56 AM

0 kudos

With notebooks that is not possible.You can write your code in scala/java and build a jar, which you then run with spark-submit.(example)Or use python and deploy a wheel.(example)This can become quite complex when you have dependencies.Also: a jar et...

0 kudos

02-08-2022 2:33:56 AM

3 More Replies

by William_Scardua • Valued Contributor

01-28-2022 11:02:38 AM

1801 Views
0 replies
2 kudos

How to use Pylint to check your pyspark code quality ?

Hi guys,I would like to use the Pylint to check my pyspark scripts, do you do that ?Thank you ?

Data Engineering

1801 Views
0 replies
2 kudos

01-28-2022 11:02:38 AM

by BorislavBlagoev • Valued Contributor III

01-19-2022 9:27:17 AM

2572 Views
9 replies
3 kudos

Resolved! Tring to create incremental pipeline but fails when I try to use outputMode "update"

def upsertToDelta(microBatchOutputDF, batchId): microBatchOutputDF.createOrReplaceTempView("updates") microBatchOutputDF._jdf.sparkSession().sql(""" MERGE INTO old o USING updates u ON u.id = o.id WHEN MATCHED THEN UPDATE SE...

Data Engineering

2572 Views
9 replies
3 kudos

01-19-2022 9:27:17 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

01-19-2022 9:52:57 AM

3 kudos

Delta table/file version is too old. Please try to upgrade it as described here https://docs.microsoft.com/en-us/azure/databricks/delta/versioning

3 kudos

01-19-2022 9:52:57 AM

8 More Replies

by wyzer • Contributor II

01-18-2022 4:58:40 AM

5589 Views
3 replies
3 kudos

Resolved! Why database/table names are in lower case ?

Hello,When I run this code :CREATE DATABASE BackOfficeI see the database like this :backofficeWhy everything is in lower case ?Is it possible to configure Databricks in order to keep the real name ?Thanks.

Data Engineering

5589 Views
3 replies
3 kudos

01-18-2022 4:58:40 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

01-18-2022 12:45:34 PM

3 kudos

It is managed by hive metastore as you can put it in different databases is saver this way as some database are Case Sensitive and some not (you can easily test it with standard WHERE syntax).Probably you could change it with some hive settings but i...

3 kudos

01-18-2022 12:45:34 PM

2 More Replies

by jason_mcdonald • New Contributor

06-08-2021 3:42:14 PM

595 Views
1 replies
0 kudos

Can I turn this text to code?

[start code block]I'm writing this. Then I'm going to reformat it in code.[end code block]

Data Engineering

595 Views
1 replies
0 kudos

06-08-2021 3:42:14 PM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

11-27-2021 9:26:54 AM

0 kudos

Please provide more details.

0 kudos

11-27-2021 9:26:54 AM

by JK2021 • New Contributor III

11-12-2021 9:00:51 AM

2395 Views
5 replies
3 kudos

Resolved! Exception handling in Databricks

We are planning to customise code on Databricks to call Salesforce bulk API 2.0 to load data from databricks delta table to Salesforce.My question is : All the exception handling, retries and all around Bulk API can be coded explicitly in Data bricks...

Data Engineering

2395 Views
5 replies
3 kudos

11-12-2021 9:00:51 AM

View Replies

Latest Reply

Prabakar
Esteemed Contributor III

11-15-2021 2:37:45 AM

3 kudos

Hi @Jazmine Kochan , I haven't tried Salesforce bulk API 2.0 to load data. But in theory, it should be fine.

3 kudos

11-15-2021 2:37:45 AM

4 More Replies

by Orianh • Valued Contributor II

10-17-2021 4:55:24 AM

4985 Views
7 replies
3 kudos

Resolved! Read JSON with backslash.

Hello guys.I'm trying to read JSON file which contains backslash and failed to read it via pyspark.Tried a lot of options but didn't solve this yet, I thought to read all the JSON as text and replace all "\" with "/" but pyspark fail to read it as te...

Data Engineering

4985 Views
7 replies
3 kudos

10-17-2021 4:55:24 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-11-2021 8:48:53 AM

3 kudos

@orian hindi - Would you be happy to post the solution you came up with and then mark it as best? That will help other members.

3 kudos

11-11-2021 8:48:53 AM

6 More Replies