Data Engineering

Forum Posts

Sorted by:

by tessaickx • New Contributor III

04-07-2023 4:18:05 AM

3369 Views
4 replies
4 kudos

Using ipywidgets latest versions

Hello everyone,I upgraded my cluster to DBR 13.0, which comes with ipywidgets version 7.7.2 installed.However, I want to use the TagsInput widget, which is new since version 8.0.4.If i upgrade the ipywidgets package to version 8.0.4, none of the widg...

Data Engineering

3369 Views
4 replies
4 kudos

04-07-2023 4:18:05 AM

View Replies

Latest Reply

pmd84
New Contributor II

01-23-2025 10:33:30 AM

4 kudos

I can confirm that installing a newer ipywidgets library version at a cluster level does not resolve these issues. The arcgis library relies on ipywidgets v8 to render maps. Even when I install ipywidgets > 8 at the cluster level, the widgets still d...

4 kudos

01-23-2025 10:33:30 AM

3 More Replies

by Roy • New Contributor II

11-18-2019 12:59:25 PM

60758 Views
6 replies
0 kudos

Resolved! dbutils.notebook.exit() executing from except in try/except block even if there is no error.

I am using Python notebooks as part of a concurrently running workflow with Databricks Runtime 6.1. Within the notebooks I am using try/except blocks to return an error message to the main concurrent notebook if a section of code fails. However I h...

Data Engineering

60758 Views
6 replies
0 kudos

11-18-2019 12:59:25 PM

View Replies

Latest Reply

tonyliken
New Contributor II

01-16-2025 8:26:32 AM

0 kudos

because the dbutils.notebook.exit() is an 'Exception' it will always trigger the except Exception as e: part of the code. When can use this to our advantage to solve the problem by adding an 'if else' to the except block. query = "SELECT 'a' as Colum...

0 kudos

01-16-2025 8:26:32 AM

5 More Replies

by confused_dev • New Contributor II

10-31-2022 12:43:52 PM

38713 Views
7 replies
5 kudos

Python mocking dbutils in unittests

I am trying to write some unittests using pytest, but I am coming accross the problem of how to mock my dbutils method when dbutils isn't being defined in my notebook.Is there a way to do this so that I can unit test individual functions that are uti...

Data Engineering

38713 Views
7 replies
5 kudos

10-31-2022 12:43:52 PM

View Replies

Latest Reply

pavlosskev
New Contributor III

10-25-2024 2:44:06 AM

5 kudos

Fermin_vicente's answer is pretty good already. Below is how you can do something similar with conftest.py# conftest.py import pytest from unittest.mock import MagicMock from pyspark.sql import SparkSession @pytest.fixture(scope="session") def dbuti...

5 kudos

10-25-2024 2:44:06 AM

6 More Replies

by hanspetter • New Contributor III

08-02-2017 12:26:46 AM

56625 Views
19 replies
4 kudos

Resolved! Is it possible to get Job Run ID of notebook run by dbutils.notbook.run?

When running a notebook using dbutils.notebook.run from a master-notebook, an url to that running notebook is printed, i.e.: Notebook job #223150 Notebook job #223151 Are there any ways to capture that Job Run ID (#223150 or #223151)? We have 50 or ...

Data Engineering

56625 Views
19 replies
4 kudos

08-02-2017 12:26:46 AM

View Replies

Latest Reply

Rodrigo_Mohr
New Contributor II

04-11-2024 12:49:47 PM

4 kudos

I know this is an old thread, but sharing what is working for me well in Python now, for retrieving the run_id as well and building the entire link to that job run:job_id = dbutils.notebook.entry_point.getDbutils().notebook().getContext().jobId().get...

4 kudos

04-11-2024 12:49:47 PM

18 More Replies

by mjbobak • Contributor

09-08-2022 6:31:52 PM

26535 Views
5 replies
9 kudos

Resolved! How to import a helper module that uses databricks specific modules (dbutils)

I have a main databricks notebook that runs a handful of functions. In this notebook, I import a helper.py file that is in my same repo and when I execute the import everything looks fine. Inside my helper.py there's a function that leverages built-i...

Data Engineering

26535 Views
5 replies
9 kudos

09-08-2022 6:31:52 PM

View Replies

Latest Reply

amitca71
Contributor II

12-11-2022 7:51:48 AM

9 kudos

Hi,i 'm facing similiar issue, when deploying via dbx.I have an helper notebook, that when executing it via jobs works fine (without any includes)while i deploy it via dbx (to same cluster), the helper notebook results withdbutils.fs.ls(path)NameEr...

9 kudos

12-11-2022 7:51:48 AM

4 More Replies

by GC-James • Contributor II

03-04-2022 7:34:53 AM

16399 Views
15 replies
5 kudos

Resolved! Lost memory when using dbutils

Why does copying a 9GB file from a container to the /dbfs lose me 50GB of memory? (Which doesn't come back until I restarted the cluster)

Data Engineering

16399 Views
15 replies
5 kudos

03-04-2022 7:34:53 AM

View Replies

Latest Reply

AdrianP
New Contributor II

07-11-2023 1:56:28 AM

5 kudos

Hi James,Did you get to the bottom of this? We are experiencing the same issue, and all the suggested solutions don't seem to work.Thanks,Adrian

5 kudos

07-11-2023 1:56:28 AM

14 More Replies

by Jain • New Contributor III

05-18-2023 3:49:30 AM

12263 Views
4 replies
4 kudos

Unable to use dbutils in Premium

I am unable to use dbutils commands and mkdir, etc also does not work after upgrading my Databricks Workspace from Standard tier to Premium tier.It throws the following error:py4j.security.Py4JSecurityException: Constructor public com.databricks.back...

Data Engineering

12263 Views
4 replies
4 kudos

05-18-2023 3:49:30 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-21-2023 12:05:15 AM

4 kudos

Hi @Abhishek Jain Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Than...

4 kudos

06-21-2023 12:05:15 AM

3 More Replies

by opt • New Contributor

06-13-2023 12:06:24 AM

1298 Views
1 replies
1 kudos

how to execute "Build your Chat Bot with Dolly Demo" in my own VM?

I am trying to execute Build your Chat Bot with Dolly Demo using my own VM. At the first steps they are executing this command %run ./_resources/00-init $catalog=hive_metastore $db=dbdemos_llm which is -as I understand- calling another python script...

Data Engineering

1298 Views
1 replies
1 kudos

06-13-2023 12:06:24 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-15-2023 11:11:03 PM

1 kudos

Hi @alaa migdady Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question. Thanks.

1 kudos

06-15-2023 11:11:03 PM

by grazie • Contributor

05-19-2023 5:54:57 AM

2618 Views
2 replies
2 kudos

how to get dbutils in Runtime 13

We're using the following method (generated by using dbx) to access dbutils, e.g. to retrieve parameters from secret scopes: @staticmethod def _get_dbutils(spark: SparkSession) -> "dbutils": try: from pyspark.dbutils import...

Data Engineering

2618 Views
2 replies
2 kudos

05-19-2023 5:54:57 AM

View Replies

Latest Reply

colt
New Contributor III

05-23-2023 7:50:56 AM

2 kudos

We have something similar in our code. This worked using runtime 13 until last week. Also the Machine Learning DBR doesn't work either.

2 kudos

05-23-2023 7:50:56 AM

1 More Replies

by J_M_W • Contributor

10-06-2022 11:37:11 AM

5075 Views
2 replies
4 kudos

Resolved! Can you use %run or dbutils.notebook.run in a Delta Live Table pipeline?

Hi there, Can you use a %run or dbutils.notebook.run() in a Delta Live Table (DLT) pipeline?When I try, I get the following error: "IllegalArgumentException: requirement failed: To enable notebook workflows, please upgrade your Databricks subscriptio...

Data Engineering

5075 Views
2 replies
4 kudos

10-06-2022 11:37:11 AM

View Replies

Latest Reply

J_M_W
Contributor

10-10-2022 1:31:43 AM

4 kudos

Hi all.@Kaniz Fatma thanks for your answer. I am on the premium pricing tier in Azure.After digging around the logs it would seem that you cannot run magic commands in a Delta Live Table pipeline. Therefore, you cannot use %run in a DLT pipeline - w...

4 kudos

10-10-2022 1:31:43 AM

1 More Replies

by Josh_Stafford • New Contributor II

03-28-2023 7:27:31 AM

2437 Views
2 replies
1 kudos

Using dbutils.fs.ls on URI with square brackets results in error

Square brackets in ADLS are accepted, so why can't I list the files in the folder? I have tried escaping the square brackets manually, but then the escaped values are re-escaped from %5B to %255B and %5D to %255D. I get:URISyntaxException: Illegal ...

Data Engineering

2437 Views
2 replies
1 kudos

03-28-2023 7:27:31 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-04-2023 6:20:15 PM

1 kudos

@Joshua Stafford :The URISyntaxException error you are encountering is likely due to the fact that square brackets are reserved characters in URIs (Uniform Resource Identifiers) and need to be properly encoded when used in a URL. In this case, it ap...

1 kudos

04-04-2023 6:20:15 PM

1 More Replies

by MetaRossiVinli • Contributor

03-27-2023 1:05:08 PM

5156 Views
1 replies
1 kudos

Resolved! Find root path to Repo for .py file import

I want to import a Python function stored in the following file path:`<repo>/lib/lib_helpers.py`I want to import the function from any file in my repo. For instance from these:`<repo>/notebooks/etl/bronze/dlt_bronze_elt``<repo>/workers/job_worker`It ...

Data Engineering

5156 Views
1 replies
1 kudos

03-27-2023 1:05:08 PM

View Replies

Latest Reply

MetaRossiVinli
Contributor

03-27-2023 3:14:44 PM

1 kudos

Ok, I figured it out. If you just make it a Python module by adding an empty `__init__.py`, Databricks will load it on start. Then, you can just import it.

1 kudos

03-27-2023 3:14:44 PM

by oriole • New Contributor III

03-19-2023 12:35:30 PM

9486 Views
5 replies
2 kudos

Resolved! Spark Driver Crash Writing Large Text

I'm working with a large text variable, working it into single line JSON where Spark can process beautifully. Using a single node 256 GB 32 core Standard_E32d_v4 "cluster", which should be plenty memory for this dataset (haven't seen cluster memory u...

Data Engineering

9486 Views
5 replies
2 kudos

03-19-2023 12:35:30 PM

View Replies

Latest Reply

pvignesh92
Honored Contributor

03-20-2023 8:46:30 AM

2 kudos

@David Toft Hi, The current implementation of dbutils.fs is single-threaded, performs the initial listing on the driver and subsequently launches a Spark job to perform the per-file operations. So I guess the put operation is running on a single cor...

2 kudos

03-20-2023 8:46:30 AM

4 More Replies

by GC-James • Contributor II

08-10-2022 2:13:11 AM

4599 Views
6 replies
10 kudos

Disable dbutils suggestion

I would like to turn off or suppress this message which is returned from the dbutils library. %r files <- dbutils.fs.ls("/dbfs/tmp/") For prettier results from dbutils.fs.ls(<dir>), please use `%fs ls <dir>`How can I do this?

Data Engineering

4599 Views
6 replies
10 kudos

08-10-2022 2:13:11 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-20-2022 2:13:44 AM

10 kudos

Hi @James Smith Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks...

10 kudos

09-20-2022 2:13:44 AM

5 More Replies

by sudhanshu1 • New Contributor III

12-21-2022 4:43:11 AM

775 Views
0 replies
0 kudos

Structured Streaming

I need some solution for below problem.We have set of json files which are keep coming to aws s3, these files contains details for a property . please note 1 property can have 10-12 rows in this json file. Attached is sample json file.We need to read...

Data Engineering

775 Views
0 replies
0 kudos

12-21-2022 4:43:11 AM