Data Engineering

Forum Posts

Sorted by:

by MartinH • New Contributor II

03-23-2023 3:09:56 PM

20831 Views
7 replies
6 kudos

Resolved! Azure Data Factory and Photon

Hello, we have Databricks Python workbooks accessing Delta tables. These workbooks are scheduled/invoked by Azure Data Factory. How can I enable Photon on the linked services that are used to call Databricks?If I specify new job cluster, there does n...

Data Engineering

20831 Views
7 replies
6 kudos

03-23-2023 3:09:56 PM

View Replies

Latest Reply

CharlesReily
New Contributor III

01-16-2024 11:22:48 PM

6 kudos

When you create a cluster on Databricks, you can enable Photon by selecting the "Photon" option in the cluster configuration settings. This is typically done when creating a new cluster, and you would find the option in the advanced cluster configura...

6 kudos

01-16-2024 11:22:48 PM

6 More Replies

by User16826992666 • Databricks Employee

06-25-2021 12:20:16 PM

2238 Views
1 replies
3 kudos

When developing Delta Live Tables, is there a way to see the query history?

I am not sure where I can look currently to see how my DLT queries are performing. How can I investigate the query plan for past DLT runs?

Data Engineering

2238 Views
1 replies
3 kudos

06-25-2021 12:20:16 PM

View Replies

Latest Reply

Priyanka_Biswas
Databricks Employee

01-31-2023 7:25:22 PM

3 kudos

Hello @Trevor Bishop You can check the query plan in the Spark UI , SQL tab. You would need to select the past run from dropdown and click on SparkUIAdditionally an event log is created and maintained for every Delta Live Tables pipeline. The event ...

3 kudos

01-31-2023 7:25:22 PM

by ricperelli • New Contributor II

10-19-2022 5:38:26 AM

3348 Views
0 replies
1 kudos

How can i save a parquet file using pandas with a data factory orchestrated notebook?

Hi guys,this is my first question, feel free to correct me if i'm doing something wrong.Anyway, i'm facing a really strange problem, i have a notebook in which i'm performing some pandas analysis, after that i save the resulting dataframe in a parque...

Data Engineering

3348 Views
0 replies
1 kudos

10-19-2022 5:38:26 AM

by Anonymous • Not applicable

04-26-2022 5:32:19 PM

2602 Views
0 replies
0 kudos

How Can we pass parameters from the data factory to databricks Job that is using a notebook

How Can I pass parameters from the data factory to databricks Jobs that is using a notebook but I know how to pass parameters from data factory to databricks notebooks when ADF calling directly the Notebook.

Data Engineering

2602 Views
0 replies
0 kudos

04-26-2022 5:32:19 PM

by irfanaziz • Contributor II

02-08-2022 6:51:27 AM

10043 Views
4 replies
0 kudos

Resolved! If two Data Factory pipelines are run at the same time or share a window of execution do they share the Databricks spark cluster(if both have the same linked service)? ( job clusters are those that are create on the go, defined in the linked service).

Continuing the above case, does that mean if i have several like 5 ADF pipelines scheduled regularly at the same time, its better to use an existing cluster as all of the ADF pipelines would share the same cluster and hence the cost will be lower?

Data Engineering

10043 Views
4 replies
0 kudos

02-08-2022 6:51:27 AM

View Replies

Latest Reply

Atanu
Databricks Employee

03-15-2022 10:03:59 PM

0 kudos

for adf or job run we always prefer job cluster. but for streaming, you may consider using interactive cluster . but anyway you need to monitor the cluster load, if loads are high there will be chance to job slowness as well as failure. also data siz...

0 kudos

03-15-2022 10:03:59 PM

3 More Replies

Databricks Community

Resolved! Azure Data Factory and Photon

When developing Delta Live Tables, is there a way to see the query history?

How can i save a parquet file using pandas with a data factory orchestrated notebook?

How Can we pass parameters from the data factory to databricks Job that is using a notebook

Resolved! If two Data Factory pipelines are run at the same time or share a window of execution do they share the Databricks spark cluster(if both have the same linked service)? ( job clusters are those that are create on the go, defined in the linked service).