Data Engineering

Forum Posts

Sorted by:

by Dean_Lovelace • New Contributor III

04-05-2023 3:22:49 AM

22742 Views
13 replies
2 kudos

How can I deploy workflow jobs to another databricks workspace?

I have created a number of workflows in the Databricks UI. I now need to deploy them to a different workspace.How can I do that?Code can be deployed via Git, but the job definitions are stored in the workspace only.

Data Engineering

22742 Views
13 replies
2 kudos

04-05-2023 3:22:49 AM

View Replies

Latest Reply

Walter_C
Databricks Employee

01-12-2025 4:47:07 AM

2 kudos

@itacdonev great option provided, @Dean_Lovelace you can also select the option View JSON on the Workflow and move to the option create, with this code you can use the API https://docs.databricks.com/api/workspace/jobs/create and create the job in th...

2 kudos

01-12-2025 4:47:07 AM

12 More Replies

by HariharaSam • Contributor

01-12-2022 11:45:58 PM

28340 Views
8 replies
4 kudos

Resolved! To get Number of rows inserted after performing an Insert operation into a table

Consider we have two tables A & B.qry = """INSERT INTO Table ASelect * from Table B where Id is null """spark.sql(qry)I need to get the number of records inserted after running this in databricks.

Data Engineering

28340 Views
8 replies
4 kudos

01-12-2022 11:45:58 PM

View Replies

Latest Reply

GRCL
New Contributor III

06-15-2023 1:27:28 AM

4 kudos

Almost same advice than Hubert, I use the history of the delta table :df_history.select(F.col('operationMetrics')).collect()[0].operationMetrics['numOutputRows']You can find also other 'operationMetrics' values, like 'numTargetRowsDeleted'.

4 kudos

06-15-2023 1:27:28 AM

7 More Replies

by chhavibansal • New Contributor III

01-17-2023 1:22:22 AM

1059 Views
1 replies
0 kudos

What is the upper bound limit for dataSkippingNumIndexedCols, to keeps stats in delta log file?

Is there an upper bound of number that i can assign to delta.dataSkippingNumIndexedCols for computing statistics. Is there some tradeoff benchmark available for increasing this number beyond 32.

Data Engineering

1059 Views
1 replies
0 kudos

01-17-2023 1:22:22 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-08-2023 8:21:43 PM

0 kudos

@Chhavi Bansal :The delta.dataSkippingNumIndexedCols configuration property controls the maximum number of columns that Delta Lake will build statistics on during data skipping. By default, this value is set to 32. There is no hard upper bound on th...

0 kudos

03-08-2023 8:21:43 PM

by youssefmrini • Databricks Employee

03-01-2023 6:26:39 AM

2250 Views
1 replies
4 kudos

Resolved! Can I limit the max number of clusters per user ?

Data Engineering

2250 Views
1 replies
4 kudos

03-01-2023 6:26:39 AM

View Replies

Latest Reply

youssefmrini
Databricks Employee

03-01-2023 6:27:29 AM

4 kudos

You can now use cluster policies to restrict the number of clusters a user can create. For more information https://docs.databricks.com/administration-guide/clusters/policies.html#cluster-limit

4 kudos

03-01-2023 6:27:29 AM

Databricks Community

How can I deploy workflow jobs to another databricks workspace?

Resolved! To get Number of rows inserted after performing an Insert operation into a table

What is the upper bound limit for dataSkippingNumIndexedCols, to keeps stats in delta log file?

Resolved! Can I limit the max number of clusters per user ?