Data Engineering

Forum Posts

Sorted by:

by swzzzsw • New Contributor III

01-24-2022 11:17:24 AM

11756 Views
4 replies
9 kudos

"Run now with different parameters" - different parameters not recognized by jobs involving multiple tasks

I'm running a databricks job involving multiple tasks and would like to run the job with different set of task parameters. I can achieve that by edit each task and and change the parameter values. However, it gets very manual when I have a lot of tas...

Data Engineering

11756 Views
4 replies
9 kudos

01-24-2022 11:17:24 AM

View Replies

Latest Reply

VijayNakkonda
New Contributor II

03-03-2025 2:13:02 AM

9 kudos

Dear Team, For now, I found a solution. Disconnect the bundle source on Databricks, edit the parameters that you want to run. After execution, redeploy your code again from repository.

9 kudos

03-03-2025 2:13:02 AM

3 More Replies

by Gopal269673 • Contributor

05-02-2023 7:30:00 PM

2593 Views
2 replies
0 kudos

Calling jobs inside another job

Hi All.. I had created 2 job flows and one for transaction layer and another for datamart layer. I need to specify the job dependency between job1 vs Job2 and need to trigger the job2 after completing job1 without using any other orchestration tool o...

Data Engineering

2593 Views
2 replies
0 kudos

05-02-2023 7:30:00 PM

View Replies

Latest Reply

Priyag1
Honored Contributor II

05-02-2023 7:35:06 PM

0 kudos

Verify with documentation

0 kudos

05-02-2023 7:35:06 PM

1 More Replies

by Tacuma • New Contributor II

01-11-2023 2:04:35 PM

2935 Views
4 replies
1 kudos

Scheduling jobs with Airflow result in each task running multiple jobs.

Hey everyone, I'm experiementing with running containerized pyspark jobs in Databricks, and orchestrating them with airflow. I am however, encountering an issue here. When I trigger an airflow DAG, and I look at the logs, I see that airflow is spinni...

Data Engineering

2935 Views
4 replies
1 kudos

01-11-2023 2:04:35 PM

View Replies

Latest Reply

Tacuma
New Contributor II

01-16-2023 4:51:01 AM

1 kudos

Both, I guess? Yes, all jobs share the same config - the question I have is why in the same airflow task log, there are 3 jobs runs. I'm hoping that there's something in the configs and may give me some kind of clue.

1 kudos

01-16-2023 4:51:01 AM

3 More Replies

by dbrick • New Contributor II

07-11-2022 5:57:39 AM

1860 Views
1 replies
1 kudos

Multiple Jobs with different resource requirements on the same cluster

I have a big cluster with the auto-scaling(min:1, max: 25) feature enabled. I want to run multiple jobs on that cluster with different values of spark properties( `--executor-cores` and `–executor-memory) but I don't see any option to specify the sam...

Data Engineering

1860 Views
1 replies
1 kudos

07-11-2022 5:57:39 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-03-2022 1:30:17 AM

1 kudos

Hi @Neelesh databricks Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell ...

1 kudos

09-03-2022 1:30:17 AM

by pawelmitrus • Contributor

07-24-2022 5:31:17 AM

5998 Views
4 replies
1 kudos

Why Databricks spawns multiple jobs

I have a Delta table spark101.airlines (sourced from `/databricks-datasets/airlines/`) partitioned by `Year`. My `spark.sql.shuffle.partitions` is set to default 200. I run a simple query:select Origin, count(*) from spark101.airlines group by Origi...

Data Engineering

5998 Views
4 replies
1 kudos

07-24-2022 5:31:17 AM

View Replies

Latest Reply

User16753725469
Databricks Employee

09-01-2022 12:01:18 AM

1 kudos

Could you please paste the query plan here to analyse the issue

1 kudos

09-01-2022 12:01:18 AM

3 More Replies

Databricks Community

"Run now with different parameters" - different parameters not recognized by jobs involving multiple tasks

Calling jobs inside another job

Scheduling jobs with Airflow result in each task running multiple jobs.

Multiple Jobs with different resource requirements on the same cluster

Why Databricks spawns multiple jobs