Data Engineering

Forum Posts

Sorted by:

by shreyag • New Contributor II

02-16-2022 6:48:03 AM

3194 Views
2 replies
0 kudos

scheduling tasks through CLI

Is there a way to schedule tasks or jobs through the Databricks CLI instead of the GUI? I want to be able to create a job flow with different notebook through the CLI.

Data Engineering

3194 Views
2 replies
0 kudos

02-16-2022 6:48:03 AM

View Replies

Latest Reply

Atanu
Databricks Employee

03-08-2022 5:53:13 AM

0 kudos

I agreed with @Kaniz Fatma https://docs.databricks.com/dev-tools/cli/jobs-cli.html?_ga=2.101966982.684786035.1646666830-480220406.1638459894 this is the job CLI we currently support @Shreya Gupta

0 kudos

03-08-2022 5:53:13 AM

1 More Replies

by Alx • Databricks Partner

02-10-2022 7:35:23 PM

4296 Views
1 replies
0 kudos

Problem with network security group (NSG) rules in case of VNet injection

Hi everyone,Our internal company security policy for the Cloud infrastructure requires to have custom outbound NSG rule that denies all traffic. The rules attributes should be as follows: Priority: 4096Port: AnyProtocol: AnySource: AnyDestination: An...

Data Engineering

4296 Views
1 replies
0 kudos

02-10-2022 7:35:23 PM

View Replies

Latest Reply

Atanu
Databricks Employee

03-08-2022 5:44:28 AM

0 kudos

HELLO @Alexey Tyulyaev please check https://docs.microsoft.com/en-us/azure/virtual-network/manage-network-security-group

0 kudos

03-08-2022 5:44:28 AM

by alejandrofm • Valued Contributor

03-07-2022 6:24:01 AM

6487 Views
3 replies
3 kudos

Resolved! Delta, the specified key does not exist error

Hi, I'm having this error too frequently on a few tables, I check on S3 and the partition exists and the file is there on the partition.error: Spectrum Scan Error: DeltaManifestcode: 15005context: Error fetching Delta Lake manifest delta/product/sub_...

Data Engineering

6487 Views
3 replies
3 kudos

03-07-2022 6:24:01 AM

View Replies

Latest Reply

alejandrofm
Valued Contributor

03-08-2022 5:07:55 AM

3 kudos

@Hubert Dudek , I'll add that sometimes, just running:GENERATE symlink_format_manifest FOR TABLE schema.tablesolves it, but, how can the symlink get broken?Thanks!

3 kudos

03-08-2022 5:07:55 AM

2 More Replies

by study_community • New Contributor III

01-17-2022 2:57:32 AM

17872 Views
8 replies
3 kudos

Not able to move files from local to dbfs through dbfs CLI

Hi Folks,I have installed and configured databricks CLI in my local machine. I tried to move a local file from my personal computer using dbfs cp to dbfs:/ path. I can see the file is copied from local, and is only visible in local. I am not able to ...

Data Engineering

17872 Views
8 replies
3 kudos

01-17-2022 2:57:32 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-08-2022 4:10:58 AM

3 kudos

Hi, Could you try to save the file from your local machine to dbfs:/FileStore location?# Put local file test.py to dbfs:/FileStore/test.pydbfs cp test.py dbfs:/FileStore/test.py

3 kudos

03-08-2022 4:10:58 AM

7 More Replies

by shrewdTurtle • New Contributor II

03-05-2022 12:38:00 AM

4117 Views
2 replies
3 kudos

Cannot open Jobs tab in Databricks Community edition.

Hi,I get the following exception when I try to open jobs tab.Uncaught TypeError: Cannot read properties of undefined (reading 'apply') Reload the page and try again. If the error persists, contact support. Reference error code: fd9ae37c18c1400cb15...

Data Engineering

4117 Views
2 replies
3 kudos

03-05-2022 12:38:00 AM

View Replies

Latest Reply

shrewdTurtle
New Contributor II

03-08-2022 1:31:08 AM

3 kudos

@Kaniz Fatma , @Werner Stinckens thanks for the clarification. I agree with @Werner Stinckens , Error message should be more useful.

3 kudos

03-08-2022 1:31:08 AM

1 More Replies

by Jan_A • New Contributor III

02-02-2022 5:08:18 AM

7390 Views
3 replies
5 kudos

Resolved! Move/Migrate database from dbfs root (s3) to other mounted s3 bucket

Hi,I have a databricks database that has been created in the dbfs root S3 bucket, containing managed tables. I am looking for a way to move/migrate it to a mounted S3 bucket instead, and keep the database name.Any good ideas on how this can be done?T...

Data Engineering

7390 Views
3 replies
5 kudos

02-02-2022 5:08:18 AM

View Replies

Latest Reply

User16753724663
Databricks Employee

03-07-2022 8:32:48 PM

5 kudos

Hi @Jan Ahlbeck we can use below property to set the default location:"spark.sql.warehouse.dir": "S3 URL/dbfs path"Please let me know if this helps.

5 kudos

03-07-2022 8:32:48 PM

2 More Replies

by databrick_comm • New Contributor II

02-04-2022 9:04:13 AM

6812 Views
3 replies
0 kudos

Not able to connecting Denodo VDP from databricks

I would like connect Denodo VDP from databrick workspace installed ODBC client and Installed denodo Jar in cluster ,not able to understanding other steps.Could you please me

Data Engineering

6812 Views
3 replies
0 kudos

02-04-2022 9:04:13 AM

View Replies

Latest Reply

User16753724663
Databricks Employee

03-07-2022 8:23:56 PM

0 kudos

Hi @sathyanarayan kokku Are you trying to install denodo vdp server in databricks?

0 kudos

03-07-2022 8:23:56 PM

2 More Replies

by NAS • New Contributor III

03-07-2022 10:45:53 AM

3065 Views
1 replies
1 kudos

Resolved! "import pandas as pd" => [Errno 5]

When I type import pandas as pdfrom a Notebook in a Repo I get:--------------------------------------------------------------------------- AttributeError Traceback (most recent call last) /usr/lib/python3.8/importlib/_boots...

Data Engineering

3065 Views
1 replies
1 kudos

03-07-2022 10:45:53 AM

View Replies

Latest Reply

NAS
New Contributor III

03-07-2022 4:33:49 PM

1 kudos

Thanks to Elliott Hertz, I found out that the ML Experiments cannot be stored in the repo. After I moved them to my Workspace everything seems to work.

1 kudos

03-07-2022 4:33:49 PM

by RohanB • New Contributor III

01-27-2022 2:39:37 AM

8073 Views
8 replies
3 kudos

Resolved! Spark Streaming - Checkpoint State EOF Exception

I have a Spark Structured Streaming job which reads from 2 Delta tables in streams , processes the data and then writes to a 3rd Delta table. The job is being run with the Databricks service on GCP.Sometimes the job fails with the following exception...

Data Engineering

8073 Views
8 replies
3 kudos

01-27-2022 2:39:37 AM

View Replies

Latest Reply

RohanB
New Contributor III

02-15-2022 4:27:03 AM

3 kudos

Hi @Jose Gonzalez ,Do you require any more information regarding the code? Any idea what could be cause for the issue?Thanks and Regards,Rohan

3 kudos

02-15-2022 4:27:03 AM

7 More Replies

by SCOR • New Contributor II

02-08-2022 1:11:50 AM

3382 Views
3 replies
4 kudos

SparkJDBC42.jar Issue ?

Hi there!I am using the SparkJDBC42.jar in my Java application to use my delta lake tables , The connection is made through databricks sql endpoint in where I created a database and store in it my delta tables. I have a simple code to open connection...

Data Engineering

3382 Views
3 replies
4 kudos

02-08-2022 1:11:50 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

03-07-2022 3:10:08 PM

4 kudos

Hi @Seifeddine SNOUSSI ,Are you still having issue or you were able to resolve this issue? please let us know

4 kudos

03-07-2022 3:10:08 PM

2 More Replies

by Kody_Devl • New Contributor II

02-05-2022 4:24:51 AM

3092 Views
1 replies
0 kudos

HTML Backup Import Into my Account

Hi AllI would like to Import my HTML notebook backup into my databricks account and use it as if it was my master (I am a developer and have many exported HTML backups that I may want to reuse. When you open an .HTML from backup, databricks has, ...

Data Engineering

3092 Views
1 replies
0 kudos

02-05-2022 4:24:51 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

03-07-2022 2:29:19 PM

0 kudos

Hi @Ross Crill ,Did @Kaniz Fatma reply helped you to resolve your question? please let us know

0 kudos

03-07-2022 2:29:19 PM

by Dunken • New Contributor III

02-01-2022 12:46:22 AM

6154 Views
7 replies
3 kudos

Resolved! Databricks and CD4ML

I would like to use Databricks in a CD4ML way (see also https://martinfowler.com/articles/cd4ml.html). Is this possible? I would like to develop and train models in one environment once qualified, I would like to deploy the model with the application...

Data Engineering

6154 Views
7 replies
3 kudos

02-01-2022 12:46:22 AM

View Replies

Latest Reply

Atanu
Databricks Employee

03-03-2022 6:49:47 AM

3 kudos

something below you are looking for @Armin Galliker ?

3 kudos

03-03-2022 6:49:47 AM

6 More Replies

by brickster_2018 • Databricks Employee

06-25-2021 3:51:02 PM

6764 Views
2 replies
2 kudos

Resolved! How many jobs can I create in my Databricks workspace?

Data Engineering

6764 Views
2 replies
2 kudos

06-25-2021 3:51:02 PM

View Replies

Latest Reply

MoJaMa
Databricks Employee

03-07-2022 7:31:24 AM

2 kudos

Since Workflows (Multi-Task jobs) is now GA, one way to work around the 1000 concurrent jobs limit is to use tasks within a job. Each job can have 100 tasks, and these tasks do not count toward the concurrent job limit.

2 kudos

03-07-2022 7:31:24 AM

1 More Replies

by alejandrofm • Valued Contributor

03-06-2022 5:09:33 PM

4372 Views
4 replies
5 kudos

Resolved! Show Vacuum operation result (files deleted) without DRY RUN

Hi, I'm runing some scheduled vacuum jobs and would like to know how many files were deleted without making all the computation twice, with and without DRY RUN, is there a way to accomplish this?Thanks!

Data Engineering

4372 Views
4 replies
5 kudos

03-06-2022 5:09:33 PM

View Replies

Latest Reply

RKNutalapati
Valued Contributor

03-07-2022 6:22:17 AM

5 kudos

We have to enable logging to capture the logs for vacuum.spark.conf.set("spark.databricks.delta.vacuum.logging.enabled","true")

5 kudos

03-07-2022 6:22:17 AM

3 More Replies

by Oliver_Floyd • Contributor

02-03-2022 1:05:57 AM

3212 Views
2 replies
3 kudos

Resolved! How to update external metastore cluster configuration on the fly ?

Hello,In my use case, my data is pushed to an adls gen2 container called ingestAfter some data processing on a databricks cluster of the ingest workspace, I declare the associated table in an external metastore for this workspaceAt the end of this pr...

Data Engineering

3212 Views
2 replies
3 kudos

02-03-2022 1:05:57 AM

View Replies

Latest Reply

Oliver_Floyd
Contributor

03-07-2022 12:16:27 AM

3 kudos

Hello @Atanu Sarkar ,Thank you for your answer. I have created a feature request. I hope, it will be soon accepted ^^

3 kudos

03-07-2022 12:16:27 AM

1 More Replies

Databricks Community

Forum Posts

scheduling tasks through CLI

Problem with network security group (NSG) rules in case of VNet injection

Resolved! Delta, the specified key does not exist error

Not able to move files from local to dbfs through dbfs CLI

Cannot open Jobs tab in Databricks Community edition.

Resolved! Move/Migrate database from dbfs root (s3) to other mounted s3 bucket

Not able to connecting Denodo VDP from databricks

Resolved! "import pandas as pd" => [Errno 5]

Resolved! Spark Streaming - Checkpoint State EOF Exception

SparkJDBC42.jar Issue ?

HTML Backup Import Into my Account

Resolved! Databricks and CD4ML

Resolved! How many jobs can I create in my Databricks workspace?

Resolved! Show Vacuum operation result (files deleted) without DRY RUN

Resolved! How to update external metastore cluster configuration on the fly ?

File Arrival Trigger - Multiple tables

Issue while handling Deletes and Inserts in Struct...

DLT with CDC and schema changes in streaming pipel...

how to update not tracked column only in new row v...

Databricks Cost Estimation Template