Data Engineering

Forum Posts

Sorted by:

by nk76 • New Contributor III

07-01-2022 2:09:09 AM

10226 Views
7 replies
5 kudos

Resolved! Custom library import fails randomly with error: not found: value it

Hello,I have an issue with the import of a custom library, in Azure Databricks.(roughly) 95% of the times it works fine, but sometimes it fails.I searched the internet and this community with no luck, so far.It is a scala library in a scala notebook,...

Data Engineering

10226 Views
7 replies
5 kudos

07-01-2022 2:09:09 AM

View Replies

Latest Reply

Naskar
New Contributor II

12-05-2022 5:49:46 PM

5 kudos

Even I also encountered the same error. While Importing a file getting an error as "Import failed with error: Could not deserialize: Exceeded 16777216 bytes (current = 16778609)"

5 kudos

12-05-2022 5:49:46 PM

6 More Replies

by rrussell25 • New Contributor

11-29-2022 7:13:57 PM

1820 Views
1 replies
0 kudos

Read arguments in a scala note invoked by a job.

In a scala note, how to I read input arguments (e.g. those proved by a job that runs a scala notebook). In python, dbutils.notebook.entry_point.getCurrentBindings() works. How about for scala.

Data Engineering

1820 Views
1 replies
0 kudos

11-29-2022 7:13:57 PM

View Replies

Latest Reply

UmaMahesh1
Honored Contributor III

11-30-2022 11:04:00 AM

0 kudos

Hi @Robert Russell You can use dbutils.notebook.getContext.currentRunId in scala notebooks. Other methods are also available likedbutils.notebook.getContext.jobGroupdbutils.notebook.getContext.rootRunId dbutils.notebook.getContext.tags etc...You ...

0 kudos

11-30-2022 11:04:00 AM

by archanarddy • New Contributor

10-11-2022 7:48:55 PM

1525 Views
0 replies
0 kudos

metastore is down

I am trying to run a scala notebook, but my job just spins and says Metastore is down. Can someone help me. Thanks in advance.

Data Engineering

1525 Views
0 replies
0 kudos

10-11-2022 7:48:55 PM

by 齐木木 • New Contributor III

09-15-2022 7:17:04 PM

2693 Views
1 replies
3 kudos

Resolved! The case class reports an error when running in the notebook

As shown in the figure, the case class and the json string are converted through fasterxml.jackson, but an unexpected error occurred during the running of the code. I think this problem may be related to the loading principle of the notebook. Because...

Data Engineering

2693 Views
1 replies
3 kudos

09-15-2022 7:17:04 PM

View Replies

Latest Reply

齐木木
New Contributor III

09-15-2022 7:29:54 PM

3 kudos

code：var str="{\"app_type\":\"installed-app\"}" import com.fasterxml.jackson.databind.ObjectMapper import com.fasterxml.jackson.module.scala.DefaultScalaModule val mapper = new ObjectMapper() mapper.registerModule(DefaultScalaModule) ...

3 kudos

09-15-2022 7:29:54 PM

by sai_731566 • New Contributor II

08-16-2022 6:50:34 AM

2898 Views
1 replies
0 kudos

How to pass parameters/arguments to shell script from scala in databricks.

I was running shell scrip in data bricks using %sh magic command.I am having requirement where I need to pass parameters/arguments to the script. Is there any way we can get this done with scala as base language.

Data Engineering

2898 Views
1 replies
0 kudos

08-16-2022 6:50:34 AM

View Replies

by sriwin • New Contributor

10-30-2021 10:31:29 AM

3379 Views
1 replies
0 kudos

Create gpg file and save to AWS s3 storage in scala

Hi - Could you please help me on how can I create a scala notebook to perform the below tasksEncrypt a text file using the gpgUpload the file to amazon s3 storageverify the file exists in amazon s3decrypt the encrypted file to verify no issuesApprec...

Data Engineering

3379 Views
1 replies
0 kudos

10-30-2021 10:31:29 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-01-2021 8:58:07 AM

0 kudos

Hello! My name is Piper and I'm a community moderator for Databricks. Thanks for your question. Let's give it a bit more to see what our members have to say. If not, we'll circle back around.

0 kudos

11-01-2021 8:58:07 AM

by tarente • New Contributor III

10-08-2021 10:04:23 AM

4881 Views
6 replies
5 kudos

Resolved! How to implement the where not exists pattern in scala?

I have a dataframe with the following columns:Key1Key2Y_N_ColCol1Col2For the key tuple (Key1, Key2), I have rows with Y_N_Col = "Y" and Y_N_Col = "N".I need a new dataframe with all rows with Y_N_Col = "Y" (regardless of the key tuple), plus all Y_N_...

Data Engineering

4881 Views
6 replies
5 kudos

10-08-2021 10:04:23 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

10-11-2021 3:21:51 AM

5 kudos

I'd use a left-anti join.So create a df with all the Y, then create a df with all the N and do a left_anti join (on key1 and key2) on the df with the Y.then a union of those two.

5 kudos

10-11-2021 3:21:51 AM

5 More Replies

by User15787040559 • Databricks Employee

06-07-2021 9:02:46 AM

2774 Views
1 replies
1 kudos

How can I get Databricks notebooks to stop cutting off the explain plans?

(since Spark 3.0)Dataset.queryExecution.debug.toFilewill dump the full plan to a file, without concatenating the output as a fully materialized Java string in memory.

Data Engineering

2774 Views
1 replies
1 kudos

06-07-2021 9:02:46 AM

View Replies

Latest Reply

dazfuller
Contributor III

09-28-2021 12:16:03 PM

1 kudos

Notebooks really aren't the best method of viewing large files. Two methods you could employ areSave the file to dbfs and then use databricks CLI to download the fileUse the web terminalIn the web terminal option you can do something like "cat my_lar...

1 kudos

09-28-2021 12:16:03 PM

by tarente • New Contributor III

09-18-2021 11:09:15 AM

2103 Views
2 replies
3 kudos

Resolved! How to create a csv using a Scala notebook that as " in some columns?

In a project we use Azure Databricks to create csv files to be loaded in ThoughtSpot.Below is a sample to the code I use to write the file:val fileRepartition = 1 val fileFormat = "csv" val fileSaveMode = "overwrite" var fileOptions = Map ( ...

Data Engineering

2103 Views
2 replies
3 kudos

09-18-2021 11:09:15 AM

View Replies

Latest Reply

tarente
New Contributor III

09-21-2021 1:03:14 AM

3 kudos

Hi Shan,Thanks for the link.I now know more options for creating different csv files.I have not yet completed the problem, but that is related with a destination application (ThoughtSpot) not being able to load the data in the csv file correctly.Rega...

3 kudos

09-21-2021 1:03:14 AM

1 More Replies

by Zircoz • New Contributor II

09-11-2021 2:31:52 AM

14828 Views
2 replies
6 kudos

Resolved! Can we access the variables created in Python in Scala's code or notebook ?

If I have a dict created in python on a Scala notebook (using magic word ofcourse):%python d1 = {1: "a", 2:"b", 3:"c"}Can I access this d1 in Scala ?I tried the following and it returns d1 not found:%scala println(d1)

Data Engineering

14828 Views
2 replies
6 kudos

09-11-2021 2:31:52 AM

View Replies

Latest Reply

cpm1
New Contributor II

09-11-2021 7:38:36 AM

6 kudos

Martin is correct. We could only access the external files and objects. In most of our cases, we just use temporary views to pass data between R & Python.https://docs.databricks.com/notebooks/notebooks-use.html#mix-languages

6 kudos

09-11-2021 7:38:36 AM

1 More Replies

by saqib • New Contributor II

08-19-2016 8:40:53 AM

15339 Views
5 replies
2 kudos

Markup in Databricks Notebook

Do Databricks Scala Notebooks support any sort of markup/markdown?

Data Engineering

15339 Views
5 replies
2 kudos

08-19-2016 8:40:53 AM

View Replies

Latest Reply

Anonymous
Not applicable

09-15-2016 8:43:13 AM

2 kudos

Is it possible to reference variables in markdown?

2 kudos

09-15-2016 8:43:13 AM

4 More Replies

Databricks Community

Resolved! Custom library import fails randomly with error: not found: value it

Read arguments in a scala note invoked by a job.

metastore is down

Resolved! The case class reports an error when running in the notebook

How to pass parameters/arguments to shell script from scala in databricks.

Create gpg file and save to AWS s3 storage in scala

Resolved! How to implement the where not exists pattern in scala?

How can I get Databricks notebooks to stop cutting off the explain plans?

Resolved! How to create a csv using a Scala notebook that as " in some columns?

Resolved! Can we access the variables created in Python in Scala's code or notebook ?

Markup in Databricks Notebook