Data Engineering

Forum Posts

Sorted by:

by Hubert-Dudek • Esteemed Contributor III

11-17-2021 6:16:14 AM

20063 Views
12 replies
12 kudos

Resolved! dbutils or other magic way to get notebook name or cell title inside notebook cell

Not sure it exists but maybe there is some trick to get directly from python code:NotebookNameCellTitlejust working on some logger script shared between notebooks and it could make my life a bit easier

Data Engineering

20063 Views
12 replies
12 kudos

11-17-2021 6:16:14 AM

View Replies

Latest Reply

rtullis
New Contributor II

11-21-2024 12:22:16 PM

12 kudos

I got the solution to work in terms of printing the notebook that I was running; however, what if you have notebook A that calls a function that prints the notebook name, and you run notebook B that %runs notebook A? I get the notebook B's name when...

12 kudos

11-21-2024 12:22:16 PM

11 More Replies

by T_1 • New Contributor III

05-25-2022 12:31:51 PM

30195 Views
13 replies
3 kudos

Resolved! displayHTML can't seem to be used from Python code, only hand typed into a cell???

Trying to use displayHTML from w/in a Python module gets a Python exception:NameError: name 'displayHTML' is not definedand I've found no way around this. It seems to be something at the UI layer or something, not a Python function that can be refere...

Data Engineering

30195 Views
13 replies
3 kudos

05-25-2022 12:31:51 PM

View Replies

Latest Reply

T_1
New Contributor III

10-31-2023 11:23:07 AM

3 kudos

Holy Guacamole Batman! It works finally!!!! Wow, thanks @ptweir That's awesome! I can go back and update my doc (and code, to just use databricks the same, now, and Jupyter!) and it'll work by default. It's great they fixed it, shame they never told ...

3 kudos

10-31-2023 11:23:07 AM

12 More Replies

by Data_Engineer_3 • New Contributor III

10-27-2021 8:14:48 AM

19690 Views
12 replies
4 kudos

FileNotFoundError: [Errno 2] No such file or directory: '/FileStore/tables/flight_data.zip' The data and file exists in location mentioned above

I am new to learning Spark and working on some practice; I have uploaded a zip file in DBFS /FileStore/tables directory and trying to run a python code to unzip the file; The python code is as: from zipfile import *with ZipFile("/FileStore/tables/fli...

Data Engineering

19690 Views
12 replies
4 kudos

10-27-2021 8:14:48 AM

View Replies

Latest Reply

883022
New Contributor II

07-19-2023 8:36:48 AM

4 kudos

What if changing the runtime is not an option? I'm experiencing a similar issue using the following:%pip install -r /dbfs/path/to/file.txtThis worked for a while, but now I'm getting the Errno 2 mentioned above. I am still able to print the same file...

4 kudos

07-19-2023 8:36:48 AM

11 More Replies

by Oliver_Angelil • Valued Contributor II

05-09-2023 5:56:29 AM

9027 Views
4 replies
0 kudos

Resolved! Python code linter in Databricks notebook

Is it possible to get syntax linting in a DB notebook? Say with flake8, like I do in VS code?

Data Engineering

9027 Views
4 replies
0 kudos

05-09-2023 5:56:29 AM

View Replies

Latest Reply

artsheiko
Databricks Employee

05-10-2023 7:03:46 AM

0 kudos

No linting in a DB notebook available for now. The Notebook is currently in the process of adopting Monaco as the underlying code editor which will offer an improved code authoring experience for notebook cells.Some of the Monaco editor features enab...

0 kudos

05-10-2023 7:03:46 AM

3 More Replies

by KVNARK • Honored Contributor II

12-21-2022 4:11:38 AM

1451 Views
1 replies
4 kudos

Resolved! a usecase to query millions of values.

have a small use case where we need to query the sql database with 1 million values(dynamically returned from python code) in the condition from python function. eg: select * from id in (1,2,23,33........1M). I feel this is very bad approach. Is ther...

Data Engineering

1451 Views
1 replies
4 kudos

12-21-2022 4:11:38 AM

View Replies

Latest Reply

daniel_sahal
Esteemed Contributor

12-22-2022 5:06:06 AM

4 kudos

You can also create a temporary view with the output from python code (one id = one row) and then inner join the view to the table. IMO will improve readability of your code.

4 kudos

12-22-2022 5:06:06 AM

by aben1 • New Contributor

09-29-2022 3:22:49 AM

1449 Views
0 replies
0 kudos

I have created a piece of python code, which lead to some python error.The job have failed with Internal Error, see below.The message after clicking o...

I have created a piece of python code, which lead to some python error.The job have failed with Internal Error, see below.The message after clicking on it states somewhat miseleading info:Meanwhile the real issue is fortunatelly described in Logs I d...

Data Engineering

1449 Views
0 replies
0 kudos

09-29-2022 3:22:49 AM

by kirankv • New Contributor

08-24-2022 12:46:14 PM

903 Views
0 replies
0 kudos

How to get notebookid programmatically using R

Hi, I would like to log the notebook id programmatically in R, Is there any command that exists in R so that I can leverage to grab the notebook id, I tried with python using the below command and grab it without any issues, and looking for similar f...

Data Engineering

903 Views
0 replies
0 kudos

08-24-2022 12:46:14 PM

by gbrueckl • Contributor II

10-14-2021 1:12:48 PM

5940 Views
6 replies
4 kudos

Resolved! CREATE FUNCTION from Python file

Is it somehow possible to create an SQL external function using Python code?the examples only show how to use JARshttps://docs.databricks.com/spark/latest/spark-sql/language-manual/sql-ref-syntax-ddl-create-function.htmlsomething like:CREATE TEMPORAR...

Data Engineering

5940 Views
6 replies
4 kudos

10-14-2021 1:12:48 PM

View Replies

Latest Reply

pts
New Contributor II

02-04-2022 6:11:28 PM

4 kudos

As a user of your code, I'd find it a less pleasant API because I'd have to some_module.some_func.some_func() rather than just some_module.some_func()No reason to have "some_func" exist twice in the hierarchy. It's kind of redundant. If some_func is ...

4 kudos

02-04-2022 6:11:28 PM

5 More Replies

by Development • New Contributor III

12-28-2021 11:37:40 PM

815 Views
0 replies
0 kudos

Hi All, I hope you're doing well I am facing issue while installing an python library on ADB Cluster. lib - PyCaret ( latest version) its not gett...

Hi All,I hope you're doing wellI am facing issue while installing an python library on ADB Cluster.lib - PyCaret ( latest version)its not getting install and showing me 'Failed' Status.It would be great if you can help here !!Thanks

Data Engineering

815 Views
0 replies
0 kudos

12-28-2021 11:37:40 PM

by Nickels • New Contributor II

10-25-2021 2:04:31 AM

2463 Views
4 replies
1 kudos

Resolved! Reply on inline runtime commands

I feel like the answer to this question should be simple, but none the less I'm struggling.I run a python code that prompts me with the following warning:On my local machine, I can accept this through my terminal and my machine do not run out of memo...

Data Engineering

2463 Views
4 replies
1 kudos

10-25-2021 2:04:31 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

10-26-2021 1:41:23 PM

1 kudos

Hi @Nickels Köhling ,In Databricks, you will only be able to see the output in the driver logs. If you go to your driver logs, you will be able to see 3 windows that are displaying the output of "stdout", "stderr" and "log4j".If in your code you do ...

1 kudos

10-26-2021 1:41:23 PM

3 More Replies

by User16826994223 • Honored Contributor III

06-18-2021 3:47:20 AM

2299 Views
1 replies
0 kudos

Resolved! How to find best model using python in mlflow

I have a use case in mlflow with python code to find a model version that has the best metric (for instance, “accuracy”) among so many versions , I don't want to use web ui but to use python code to achieve this. Any Idea?

Data Engineering

2299 Views
1 replies
0 kudos

06-18-2021 3:47:20 AM

View Replies

Latest Reply

User16826994223
Honored Contributor III

06-18-2021 3:48:01 AM

0 kudos

import mlflow client = mlflow.tracking.MlflowClient() runs = client.search_runs("my_experiment_id", "", order_by=["metrics.rmse DESC"], max_results=1) best_run = runs[0]https://mlflow.org/docs/latest/python_api/mlflow.tracking.html#mlflow.tracking.M...

0 kudos

06-18-2021 3:48:01 AM

Databricks Community

Resolved! dbutils or other magic way to get notebook name or cell title inside notebook cell

Resolved! displayHTML can't seem to be used from Python code, only hand typed into a cell???

FileNotFoundError: [Errno 2] No such file or directory: '/FileStore/tables/flight_data.zip' The data and file exists in location mentioned above

Resolved! Python code linter in Databricks notebook

Resolved! a usecase to query millions of values.

I have created a piece of python code, which lead to some python error.The job have failed with Internal Error, see below.The message after clicking o...

How to get notebookid programmatically using R

Resolved! CREATE FUNCTION from Python file

Hi All, I hope you&#39;re doing well I am facing issue while installing an python library on ADB Cluster. lib - PyCaret ( latest version) its not gett...

Resolved! Reply on inline runtime commands

Resolved! How to find best model using python in mlflow

Hi All, I hope you're doing well I am facing issue while installing an python library on ADB Cluster. lib - PyCaret ( latest version) its not gett...