Data Engineering

Forum Posts

Sorted by:

by mk1987c • New Contributor III

02-11-2023 8:21:13 PM

5910 Views
5 replies
1 kudos

Resolved! I am trying to use Databricks Autoloader with File Notification Mode

When i run my command for readstream using .option("cloudFiles.useNotifications", "true") it start reading the files from Azure blob (please note that i did not provide the configuration like subscription id , clint id , connect string and all while...

Data Engineering

5910 Views
5 replies
1 kudos

02-11-2023 8:21:13 PM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

02-22-2023 2:27:59 PM

1 kudos

Hi,I would like to share the following docs that might be able to help you with this issue. https://docs.databricks.com/ingestion/auto-loader/file-notification-mode.html#required-permissions-for-configuring-file-notification-for-adls-gen2-and-azure-b...

1 kudos

02-22-2023 2:27:59 PM

4 More Replies

by dannythermadom • New Contributor III

11-23-2022 5:43:38 AM

5209 Views
6 replies
7 kudos

Dbutils.notebook.run command not working with /Repos/

I have two github repo configured in Databricks Repos folder. repo_1 is run using a job and repo_2 is run/called from repo_1 using Dbutils.notebook.run command. dbutils.notebook.run("/Repos/repo_2/notebooks/notebook", 0, args)i am getting the follo...

Data Engineering

5209 Views
6 replies
7 kudos

11-23-2022 5:43:38 AM

View Replies

Latest Reply

cubanDataDude
New Contributor II

04-15-2024 6:03:02 AM

7 kudos

I am having a similar issue... ecw_staging_nb_List = ['/Workspace/Repos/PRIMARY/UVVC_DATABRICKS_EDW/silver/nb_UPSERT_stg_ecw_insurance', '/Repos/PRIMARY/UVVC_DATABRICKS_EDW/silver/nb_UPSERT_stg_ecw_facilitygroups'] Adding workspace d...

7 kudos

04-15-2024 6:03:02 AM

5 More Replies

by Nikhil3107 • New Contributor III

06-12-2023 9:29:36 AM

2248 Views
1 replies
2 kudos

Deploy model to AWS Sagemaker: ModuleNotFoundError: No module named 'docker'

Greetings, When trying to run the following command: %sh mlflow sagemaker build-and-push-containerI get the following error:/databricks/python3/lib/python3.9/site-packages/click/core.py:2309: UserWarning: Virtualenv support is still experimental and ...

Data Engineering

2248 Views
1 replies
2 kudos

06-12-2023 9:29:36 AM

View Replies

by Vishal09k • New Contributor II

04-29-2023 9:45:39 AM

2819 Views
1 replies
3 kudos

Display Command Not showing the Result, Rather giving the Dataframe Schema

Data Engineering

2819 Views
1 replies
3 kudos

04-29-2023 9:45:39 AM

View Replies

Latest Reply

Rishabh-Pandey
Esteemed Contributor

05-01-2023 3:49:18 AM

3 kudos

hey ,can you try you sql query with this methodselect * from (your sql query )

3 kudos

05-01-2023 3:49:18 AM

by AmineHY • Contributor

01-11-2023 3:14:37 AM

27732 Views
4 replies
1 kudos

Resolved! How to get rid of "Command result size exceeds limit"

I am working on Databricks Notebook and trying to display a map using Floium and I keep getting this error > Command result size exceeds limit: Exceeded 20971520 bytes (current = 20973510)How can I get increase the memory limit?I already reduced the...

Data Engineering

27732 Views
4 replies
1 kudos

01-11-2023 3:14:37 AM

View Replies

Latest Reply

labromb
Contributor

03-09-2023 1:26:22 AM

1 kudos

Hi, I have the same problem with keplergl, and the save to disk option, whilst helpful isn't super practical... So how does one plot large datasets in kepler?Any thought welcome

1 kudos

03-09-2023 1:26:22 AM

3 More Replies

by db-avengers2rul • Contributor II

11-29-2022 5:28:48 AM

8325 Views
2 replies
0 kudos

Resolved! delete files from the directory

Is there a way to delete recursively files using a command in notebookssince in the below directory i have many combination of files like .txt,,png,.jpg but i only want to delete files with .csv example dbfs:/FileStore/.csv*

Data Engineering

8325 Views
2 replies
0 kudos

11-29-2022 5:28:48 AM

View Replies

Latest Reply

UmaMahesh1
Honored Contributor III

12-03-2022 12:31:30 AM

0 kudos

Hi @Rakesh Reddy Gopidi You can use the os module to iterate over a directory.By using a loop over the directory, you can check what the file ends with using .endsWith(".csv).After fetching all the files, you can remove it. Hope this helps..Cheers.

0 kudos

12-03-2022 12:31:30 AM

1 More Replies

by cmilligan • Contributor II

11-05-2022 12:53:52 PM

3033 Views
0 replies
6 kudos

Catch when a notebook fails and terminate command in threaded parallel notebook run

I have a command that is running notebooks in parallel using threading. I want the command to fail whenever one of the notebooks that is running fails. Right now it is just continuing to run the command.Below is the command line that I'm currently ru...

Data Engineering

3033 Views
0 replies
6 kudos

11-05-2022 12:53:52 PM

by pret • New Contributor II

10-04-2022 2:39:00 AM

3740 Views
4 replies
0 kudos

How can I run a scala command line in databricks?

I wish to run a scala command, which I believe would normally be run from a scala command line rather than from within a notebook. It happens to be:scala [-cp scalatest-<version>.jar:...] org.scalatest.tools.Runner [arguments](scalatest_2.12__3.0.8.j...

Data Engineering

3740 Views
4 replies
0 kudos

10-04-2022 2:39:00 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-02-2022 8:49:01 PM

0 kudos

Hi @David Vardy Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks...

0 kudos

11-02-2022 8:49:01 PM

3 More Replies

by RohitKulkarni • Contributor II

09-28-2022 8:33:42 AM

5209 Views
5 replies
4 kudos

External table issue

Hello Team,I am using df.write command and the table is getting created. If you refer the below screenshot the table got created in Tables folder in dedicated sql pool. But i required in the External Tables folder. RegardsRK

Data Engineering

5209 Views
5 replies
4 kudos

09-28-2022 8:33:42 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

09-29-2022 12:46:28 AM

4 kudos

if you actually write into Synapse, it is not an external table. the data resides on synapse.If you want to have an external table, write the data on your data lake in parquet/delta lake format and then create an external table on that location in s...

4 kudos

09-29-2022 12:46:28 AM

4 More Replies

by jay_sharma • New Contributor III

09-13-2022 12:43:56 PM

1650 Views
0 replies
4 kudos

Function not found when running from another Notebook using %run command.

Hi all,I'm trying to run some functions from another notebook (data_process_notebook) in my main notebook, using the %run command command. When I run the command: %run ../path/to/data_process_notebook, it is able to complete successfully, no path, pe...

Data Engineering

1650 Views
0 replies
4 kudos

09-13-2022 12:43:56 PM

by Carneiro • New Contributor II

04-28-2022 9:21:38 AM

7387 Views
2 replies
2 kudos

Resolved! Stuck in "Running Command ..."

Hi, Since yesterday, without a known reason, some commands that used to run daily are now stuck in a "Running command" state. Commands as:dataframe.toPandas() dataframe.show(n=1) dataframe.description() dataframe.write.format("csv").save(location) ge...

Data Engineering

7387 Views
2 replies
2 kudos

04-28-2022 9:21:38 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

04-29-2022 2:18:29 PM

2 kudos

Hi @Luiz Carneiro ,Could you split your Spark's actions into more CMD (paragraphs) and run one at a time to check where it could be taking the extra time. Also, Pandas only runs on your driver. Have you try to use Python or Scala APIs instead? in ca...

2 kudos

04-29-2022 2:18:29 PM

1 More Replies

by IgnacioCastinei • New Contributor III

09-15-2021 4:49:02 AM

11064 Views
6 replies
2 kudos

CLI Command <databricks fs cp> Not Uploading Files to DBFS

Hi all, So far I have been successfully using the CLI interface to upload files from my local machine to DBFS/FileStore/tables. Specifically, I have been using my terminal and the following command: databricks fs cp -r <MyLocalDataset> dbfs:/FileStor...

Data Engineering

11064 Views
6 replies
2 kudos

09-15-2021 4:49:02 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

10-29-2021 3:40:20 PM

2 kudos

hi @Ignacio Castineiras ,If Arjun.kr's fully answered your question, would you be happy to mark their answer as best so that others can quickly find the solution?Please let us know if you still are having this issue.

2 kudos

10-29-2021 3:40:20 PM

5 More Replies

by User16826994223 • Honored Contributor III

06-25-2021 9:45:03 AM

6473 Views
2 replies
1 kudos

AssertionError: assertion failed: Unable to delete the record but I am able to select it though

Is there any reason this command works well:%sql SELECT * FROM datanase.table WHERE salary > 1000returning 2 rows, while the below:%sql delete FROM datanase.table WHERE salary > 1000ErrorError in SQL statement: AssertionError: assertion failed:...

Data Engineering

6473 Views
2 replies
1 kudos

06-25-2021 9:45:03 AM

View Replies

Latest Reply

User16826994223
Honored Contributor III

06-25-2021 9:45:49 AM

1 kudos

DELETE FROM (and similarly UPDAT. aren't supported on the Parquet files - right now on Databricks, it's supported for Delta format. You can convert your parquet files into delta using CONVERT TO DELTA, and then this command will work for you.

1 kudos

06-25-2021 9:45:49 AM

1 More Replies