Data Engineering

Forum Posts

Sorted by:

by Dataengineer_mm • New Contributor

03-13-2023 5:04:23 PM

1784 Views
2 replies
1 kudos

Surrogate key using identity column.

I want to create a surrogate in the delta table And i used the identity column id-Generated as DefaultCan i insert rows into the delta table using only spark.sql like Insert query ? or i can also use write delta format options? If i use the df.write ...

Data Engineering

1784 Views
2 replies
1 kudos

03-13-2023 5:04:23 PM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

03-18-2023 3:30:09 AM

1 kudos

Hi @Menaka Murugesan(Customer), We haven’t heard from you since the last response from @Nandini N (Customer), and I was checking back to see if her suggestions helped you.Or else, If you have any solution, please share it with the community, as ...

1 kudos

03-18-2023 3:30:09 AM

1 More Replies

by bluesky111 • New Contributor II

03-14-2023 2:49:49 AM

1306 Views
2 replies
3 kudos

Resolved! I Input the wrong schedule time for the exams can it be reschedule ?

Helo today ,i think i was scheduled to do an exams at 2.15 PM but unfortunately i made a mistake put the time to 2.15 AM, could it be rescheduled? i already submit a ticket to https://help.databricks.com/s/contact-us?ReqType=training but no reply yet...

Data Engineering

1306 Views
2 replies
3 kudos

03-14-2023 2:49:49 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

03-18-2023 3:24:26 AM

3 kudos

Hi @heron halim (Customer), We haven't heard from you since the last response from @Akshay Padmanabhan, and I was checking back to see if his suggestions helped you. Or else, If you have any solution, please share it with the community, as it ca...

3 kudos

03-18-2023 3:24:26 AM

1 More Replies

by Philearner • New Contributor II

03-16-2023 8:15:20 PM

1715 Views
3 replies
3 kudos

Unable to find input by typing input in the Multiselect Widget

In the AWS databricks widgets.multiselect, I'm unable to find input by typing input in the mulitselect bar. It was working before. Although I can find the inputs by scrolling down the list, it's annoying if the list is long.Here's my script:measlis...

Data Engineering

1715 Views
3 replies
3 kudos

03-16-2023 8:15:20 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-17-2023 11:14:06 PM

3 kudos

Hi @Philip Teu Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

3 kudos

03-17-2023 11:14:06 PM

2 More Replies

by Sas • New Contributor II

03-12-2023 5:49:40 PM

2926 Views
3 replies
4 kudos

Resolved! Confusion in string comparison

Hello expertI am new to spark. I am using same price of code but getting different resultsWhen i am using below piece of code, i am getting errorpy4j.Py4JException: Method or([class java.lang.String]) does not existdf.filter(F.col("state").isNull() ...

Data Engineering

2926 Views
3 replies
4 kudos

03-12-2023 5:49:40 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-18-2023 12:35:10 AM

4 kudos

Hi @Saswata Dutta Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedbac...

4 kudos

03-18-2023 12:35:10 AM

2 More Replies

by sagiatul • New Contributor II

03-15-2023 4:55:39 AM

3544 Views
2 replies
3 kudos

Databricks driver logs

I am running jobs on databricks clusters. When the cluster is running I am able to find the executor logs by going to Spark Cluster UI Master dropdown, selecting a worker and going through the stderr logs. However, once the job is finished and cluste...

Data Engineering

3544 Views
2 replies
3 kudos

03-15-2023 4:55:39 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-18-2023 12:23:15 AM

3 kudos

Hi @Atul Arora Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedback w...

3 kudos

03-18-2023 12:23:15 AM

1 More Replies

by saikrishna3390 • New Contributor II

03-12-2023 11:51:49 AM

5172 Views
2 replies
2 kudos

How do I configure managed identity to databricks cluster and access azure storage using spark config

Partner want to use adf managed identity to connect to my databricks cluster and connect to my azure storage and copy the data from my azure storage to their azure storage storage

Data Engineering

5172 Views
2 replies
2 kudos

03-12-2023 11:51:49 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-18-2023 12:17:55 AM

2 kudos

Hi @SAI PUSALA Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedback w...

2 kudos

03-18-2023 12:17:55 AM

1 More Replies

by js54123875 • New Contributor III

03-17-2023 7:54:34 AM

1608 Views
2 replies
2 kudos

Failure to initialize configuration' on SQL Warehouse Tables

Yesterday I had a basic DLT pipeline up and running, and was able to query the hive_metastore tables successfully. The pipeline uses autloader to ingest a few csv files from cloud storage to streaming live bronze and silver tables. Today after star...

Data Engineering

1608 Views
2 replies
2 kudos

03-17-2023 7:54:34 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-17-2023 11:56:39 PM

2 kudos

Hi @Jennette Shepard Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feed...

2 kudos

03-17-2023 11:56:39 PM

1 More Replies

by User16752244127 • Contributor

03-16-2023 2:24:04 AM

1088 Views
2 replies
4 kudos

Resolved! DLT code examples and notebooks?

we like the examples that you show in webinars especially with DLT and Huggingface or DLT with ingestion from Kafka, are they publicly available?

Data Engineering

1088 Views
2 replies
4 kudos

03-16-2023 2:24:04 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-17-2023 11:25:20 PM

4 kudos

Hi @Frank Munz Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedback w...

4 kudos

03-17-2023 11:25:20 PM

1 More Replies

by Vijay_Bhau • New Contributor II

03-12-2023 11:22:30 PM

1790 Views
4 replies
3 kudos

Hello Team, I am not able to find the bookstore dataset in Databricks. Please guide me to how to download this dataset

Data Engineering

1790 Views
4 replies
3 kudos

03-12-2023 11:22:30 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-17-2023 11:24:27 PM

3 kudos

Hi @Vijay Gadhave Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Than...

3 kudos

03-17-2023 11:24:27 PM

3 More Replies

by nirajtanwar • New Contributor

03-16-2023 4:29:00 AM

1232 Views
2 replies
2 kudos

To collect the elements of a SparkDataFrame and coerces them into an R dataframe.

Hello Everyone,I am facing the challenge while collecting a spark dataframe into an R dataframe, this I need to do as I am using TraMineR algorithm whih is implemented in R only and the data pre-processing I have done in pysparkI am trying this:event...

Data Engineering

1232 Views
2 replies
2 kudos

03-16-2023 4:29:00 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-17-2023 11:20:04 PM

2 kudos

Hi @Niraj Tanwar Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thank...

2 kudos

03-17-2023 11:20:04 PM

1 More Replies

by Arunsundar • New Contributor III

03-13-2023 12:59:03 AM

1767 Views
4 replies
3 kudos

Automating the initial configuration of dbx

Hi Team,Good morning.As of now, for the deployment of our code to Databricks, dbx is configured providing the parameters such as cloud provider, git provider, etc., Say, I have a code repository in any one of the git providers. Can this process of co...

Data Engineering

1767 Views
4 replies
3 kudos

03-13-2023 12:59:03 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-17-2023 11:18:23 PM

3 kudos

Hi @Arunsundar Muthumanickam Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear fr...

3 kudos

03-17-2023 11:18:23 PM

3 More Replies

by Mado • Valued Contributor II

03-16-2023 5:02:26 AM

3560 Views
4 replies
1 kudos

Resolved! How to set properties for a delta table when I want to write a DataFrame?

Hi,I have a PySpark DataFrame with 11 million records. I created the DataFrame on a cluster. It is not saved on DBFS or storage account. import pyspark.sql.functions as F from pyspark.sql.functions import col, when, floor, expr, hour, minute, to_time...

Data Engineering

3560 Views
4 replies
1 kudos

03-16-2023 5:02:26 AM

View Replies

Latest Reply

Lakshay
Esteemed Contributor

03-16-2023 5:57:04 AM

1 kudos

Hi @Mohammad Saber , Are you getting the error while writing the file to the table? Or before that?

1 kudos

03-16-2023 5:57:04 AM

3 More Replies

by Andrei_Radulesc • Contributor III

02-22-2023 4:30:47 AM

3834 Views
3 replies
3 kudos

Resolved! FutureWarning: Deprecated in 3.0.0. Use SparkSession.builder.getOrCreate() instead.

I'm trying to get rid of the warning below:/databricks/spark/python/pyspark/sql/context.py:117: FutureWarning: Deprecated in 3.0.0. Use SparkSession.builder.getOrCreate() instead.In my setup, I have a front-end notebook that gets parameters from the ...

Data Engineering

3834 Views
3 replies
3 kudos

02-22-2023 4:30:47 AM

View Replies

Latest Reply

Andrei_Radulesc
Contributor III

03-17-2023 1:26:53 PM

3 kudos

That fixes it. Thanks. I need to do spark = SparkSession.builder.getOrCreate() df = spark.table("prod.some_schema.some_table")instead of sc = SparkSession.builder.getOrCreate() sqlc = SQLContext(sc) df = sqlc.table(f"prod.some_schema.some...

3 kudos

03-17-2023 1:26:53 PM

2 More Replies

by sage5616 • Valued Contributor

03-15-2023 7:40:21 AM

3398 Views
1 replies
3 kudos

Resolved! Set Workflow Job Concurrency Limit

Hi Everyone,I need a job to be triggered every 5 minutes. However, if that job is already running, it must not be triggered again until that run is finished. Hence, I need to set the maximum run concurrency for that job to only one instance at a time...

Data Engineering

3398 Views
1 replies
3 kudos

03-15-2023 7:40:21 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-17-2023 8:48:42 AM

3 kudos

@Michael Okulik :To ensure that a Databricks job is not triggered again until a running instance of the job is completed, you can set the maximum concurrency for the job to 1. Here's how you can configure this in Databricks:Go to the Databricks work...

3 kudos

03-17-2023 8:48:42 AM

by sandeepv • New Contributor II

03-10-2023 6:29:54 AM

1651 Views
3 replies
0 kudos

Databricks Spark certification voucher code expired

Hi Team,I am getting error that voucher code expired when trying to register for "Databricks Certified Associate Developer for Apache Spark 3.0 - Python" certification.Can you please help here

Data Engineering

1651 Views
3 replies
0 kudos

03-10-2023 6:29:54 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-16-2023 7:59:24 PM

0 kudos

Hi @Sandeep Venishetti Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell ...

0 kudos

03-16-2023 7:59:24 PM

2 More Replies

User

Count

1602

738

348

285

247

Databricks Community

Forum Posts

Surrogate key using identity column.

Resolved! I Input the wrong schedule time for the exams can it be reschedule ?

Unable to find input by typing input in the Multiselect Widget

Resolved! Confusion in string comparison

Databricks driver logs

How do I configure managed identity to databricks cluster and access azure storage using spark config

Failure to initialize configuration' on SQL Warehouse Tables

Resolved! DLT code examples and notebooks?

Hello Team, I am not able to find the bookstore dataset in Databricks. Please guide me to how to download this dataset

To collect the elements of a SparkDataFrame and coerces them into an R dataframe.

Automating the initial configuration of dbx

Resolved! How to set properties for a delta table when I want to write a DataFrame?

Resolved! FutureWarning: Deprecated in 3.0.0. Use SparkSession.builder.getOrCreate() instead.

Resolved! Set Workflow Job Concurrency Limit

Databricks Spark certification voucher code expired

when to activate photon and when not to ?

Databricks with Private cloud

Pyspark serialization

Getting com.databricks.client.jdbc.Driver is not f...

Unit Testing DLT Pipelines