Data Engineering

Forum Posts

Sorted by:

by Hubert-Dudek • Esteemed Contributor III

04-24-2023 7:24:48 AM

3033 Views
2 replies
9 kudos

databricks Photon is a next-generation engine on the Databricks Lakehouse Platform that provides speedy query performance at a low cost.- Its function...

databricks Photon is a next-generation engine on the Databricks Lakehouse Platform that provides speedy query performance at a low cost.- Its function coverage is growing, and UDF under Photon is coming, which can bring significant improvements in us...

Data Engineering

3033 Views
2 replies
9 kudos

04-24-2023 7:24:48 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

04-25-2023 4:29:14 AM

9 kudos

9 kudos

04-25-2023 4:29:14 AM

1 More Replies

by JLSy • New Contributor III

04-16-2023 7:29:52 PM

14882 Views
5 replies
6 kudos

cannot convert Parquet type INT64 to Photon type string

I am receiving an error similar to the post in this link: https://community.databricks.com/s/question/0D58Y00009d8h4tSAA/cannot-convert-parquet-type-int64-to-photon-type-doubleHowever, instead of type double the error message states that the type can...

Data Engineering

14882 Views
5 replies
6 kudos

04-16-2023 7:29:52 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-18-2023 1:53:12 AM

6 kudos

@John Laurence Sy :It sounds like you are encountering a schema conversion error when trying to read in a Parquet file that contains an INT64 column that cannot be converted to a string type. This error can occur when the Parquet file has a schema t...

6 kudos

04-18-2023 1:53:12 AM

4 More Replies

by cgrant • Databricks Employee

06-09-2021 3:12:47 PM

3997 Views
4 replies
6 kudos

How do I know how much of a query/job used Photon?

I'm trying to use the native execution engine, Photon. How can I tell if a query is using Photon or is falling back to the non-native Spark engine?

Data Engineering

3997 Views
4 replies
6 kudos

06-09-2021 3:12:47 PM

View Replies

Latest Reply

venkat09
New Contributor III

01-21-2023 5:05:52 PM

6 kudos

Typo error in my second point of the previous post. Click the execution plan of your task[this is available under SQL/Dataframe tab in Spark UI]. It explains what operations run in the photon engine and what didn't execute by photon.

6 kudos

01-21-2023 5:05:52 PM

3 More Replies

by auser85 • New Contributor III

12-16-2022 5:30:15 AM

4650 Views
2 replies
2 kudos

cannot convert Parquet type INT64 to Photon type double

I am trying to read in files via the COPY INTO command but I am getting this error lately for a certain subset of the data;`Error while reading file: Schema conversion error: cannot convert Parquet type INT64 to Photon type double`These are my option...

Data Engineering

4650 Views
2 replies
2 kudos

12-16-2022 5:30:15 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-16-2022 7:00:59 AM

2 kudos

hey @Andrew Fogarty I also faced the same issue when I moved from the 7.3 LTS version to a higher runtime version so to mitigate this issue you can use the below cluster configuration spark.sql.storeAssignmentPolicy LEGACYspark.sql.parquet.binaryAsS...

2 kudos

12-16-2022 7:00:59 AM

1 More Replies

by lawrence009 • Contributor

11-13-2022 2:59:53 PM

3362 Views
3 replies
7 kudos

Photon does not fully support the query because of dynamic pruning

Does it still make sense to run this job on a cluster with Photon enable when I am receiving the following?This is the code I ran:CREATE OR REPLACE TABLE ${tbl_name}_dups SELECT src.*, ROW_NUMBER() OVER ( PARTITION BY src.id ...

Data Engineering

3362 Views
3 replies
7 kudos

11-13-2022 2:59:53 PM

View Replies

Latest Reply

PriyaAnanthram
Contributor III

11-13-2022 4:04:17 PM

7 kudos

hmm could you show us what your query is

7 kudos

11-13-2022 4:04:17 PM

2 More Replies

by Sajid1 • Contributor

10-10-2022 5:34:52 AM

4086 Views
3 replies
6 kudos

Resolved! Photon Acceleration not getting enabled for ML runtime in Azure

I tried to enable the photon acceleration in ML runtime 9.1 LTS ML (Scala 2.12,Spark 3.1.2) but getting error "selected runtime version does not support photon".I tried for other versions of ML runtime with single and multinode , access mode being s...

Data Engineering

4086 Views
3 replies
6 kudos

10-10-2022 5:34:52 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-16-2022 10:39:35 PM

6 kudos

Hi @Sajid Thavalengal Rahiman Does @Kaniz Fatma response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

6 kudos

11-16-2022 10:39:35 PM

2 More Replies

by AJMorgan591 • New Contributor II

09-21-2022 1:29:50 PM

3750 Views
4 replies
0 kudos

Temporarily disable Photon

Is it possible to temporarily disable Photon?I have a large workload that greatly benefits from Photon apart from a specific operation therein that is actually slowed by Photon. It's not worth creating a separate cluster for this operation however, s...

Data Engineering

3750 Views
4 replies
0 kudos

09-21-2022 1:29:50 PM

View Replies

Latest Reply

Anonymous
Not applicable

10-13-2022 1:26:24 AM

0 kudos

Hi @Aaron Morgan Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thank...

0 kudos

10-13-2022 1:26:24 AM

3 More Replies

by User16826994223 • Honored Contributor III

06-28-2021 6:26:25 AM

1411 Views
0 replies
0 kudos

Some of the limitation I see In docs of photon until now july 2021 is Works on Delta and Parquet tables only for both read and write.Does not suppor...

Some of the limitation I see In docs of photon until now july 2021 is Works on Delta and Parquet tables only for both read and write.Does not support the following data types:MapArrayDoes not support window and sort operatorsDoes not support Spark S...

Data Engineering

1411 Views
0 replies
0 kudos

06-28-2021 6:26:25 AM

by User16826987838 • Contributor

06-25-2021 12:42:16 PM

1387 Views
1 replies
0 kudos

What type of aws instance and how many are used for an L sized Databricks SQL(SQLA) cluster ?

What type of aws instance and how many are used for an L sized Databricks SQL(SQLA) cluster with Photon enabled

Data Engineering

1387 Views
1 replies
0 kudos

06-25-2021 12:42:16 PM

View Replies

Latest Reply

Taha
Databricks Employee

06-25-2021 4:25:21 PM

0 kudos

L size is 16 workers of i3.8xlarge

0 kudos

06-25-2021 4:25:21 PM

by Srikanth_Gupta_ • Databricks Employee

06-23-2021 8:23:24 AM

962 Views
1 replies
1 kudos

Can we use Photon for batch and streaming process instead of Spark, when will be available for public?

Data Engineering

962 Views
1 replies
1 kudos

06-23-2021 8:23:24 AM

View Replies

Latest Reply

aladda
Databricks Employee

06-23-2021 12:03:30 PM

1 kudos

Photon is supported for batch workloads today and is the standard on Databricks SQL clusters and available as an option for Automated and Interactive clusters. And photon is in public preview today so available as an option for everyone. See this lin...

1 kudos

06-23-2021 12:03:30 PM

by Anonymous • Not applicable

06-02-2021 4:38:45 PM

961 Views
1 replies
0 kudos

Photon usage

How do I know how much of a query/job used Photon?

Data Engineering

961 Views
1 replies
0 kudos

06-02-2021 4:38:45 PM

View Replies

Latest Reply

sajith_appukutt
Honored Contributor II

06-23-2021 12:28:16 AM

0 kudos

If you are using Photon on Databricks SQLClick the Query History icon on the sidebar.Click the line containing the query you’d like to analyze.On the Query Details pop-up, click Execution Details.Look at the Task Time in Photon metric at the bottom.

0 kudos

06-23-2021 12:28:16 AM

by User16826992666 • Valued Contributor

06-22-2021 8:17:55 AM

1494 Views
1 replies
0 kudos

Resolved! In Databricks SQL how can I tell if my query is using Photon?

I have turned Photon on in my endpoint, but I don't know if it's actually being used in my queries. Is there some way I can see this other than manually testing queries with Photon turned on and off?

Data Engineering

1494 Views
1 replies
0 kudos

06-22-2021 8:17:55 AM

View Replies

Latest Reply

Digan_Parikh
Valued Contributor

06-22-2021 10:50:20 AM

0 kudos

@Trevor Bishop If you go to the History tab in DBSQL, click on the specific query and look at the execution details. At the bottom, you will see "Task time in Photon".

0 kudos

06-22-2021 10:50:20 AM

by User16826994223 • Honored Contributor III

06-15-2021 9:10:34 AM

1555 Views
1 replies
0 kudos

What is Photon in DataBricks

Hey I am new to Databricks and heard of photon , which is the fastest engine developed by Databricks , Will it make the query faster , what about Concurrency of the queries , will it increase

Data Engineering

1555 Views
1 replies
0 kudos

06-15-2021 9:10:34 AM

View Replies

Latest Reply

Mooune_DBU
Valued Contributor

06-18-2021 11:31:29 AM

0 kudos

Photon is databrick's brand new native vectorized engine developed in C++ for improved query performance (speed and concurrency). It integrates directly with the Databricks Runtime and Spark, meaning no code changes are required to use Photon. At thi...

0 kudos

06-18-2021 11:31:29 AM

by User16826994223 • Honored Contributor III

06-17-2021 6:15:38 AM

956 Views
1 replies
0 kudos

Start photon cluster

How to start a photon cluster, where I can fins the pricing of photon Cluster

Data Engineering

956 Views
1 replies
0 kudos

06-17-2021 6:15:38 AM

View Replies

Latest Reply

craig_ng
New Contributor III

06-18-2021 9:53:24 AM

0 kudos

As of the time of this message, Photon availability in the Data Science & Engineering workspace in Public Preview on AWS. You can reference our docs for instructions on how to provision a cluster using a Photon-enabled runtime. As for pricing, we tre...

0 kudos

06-18-2021 9:53:24 AM

by MoJaMa • Databricks Employee

06-17-2021 6:10:39 PM

1097 Views
1 replies
0 kudos

What is this Photon Engine I keep hearing about?

Data Engineering

1097 Views
1 replies
0 kudos

06-17-2021 6:10:39 PM

View Replies

Latest Reply

MoJaMa
Databricks Employee

06-17-2021 6:13:51 PM

0 kudos

It's our new high-performance runtime, using a native vectorized engine developed in C++.Please see our blog for a great overview. https://databricks.com/blog/2021/06/17/announcing-photon-public-preview-the-next-generation-query-engine-on-the-databri...

0 kudos

06-17-2021 6:13:51 PM