Data Engineering

Forum Posts

Sorted by:

Start a conversation

by sher • Valued Contributor II

01-03-2023 8:51:17 PM

767 Views
3 replies
1 kudos

Resolved! Do we have any certificate voucher for the data bricks session in the upcoming days

Hi Team,Do we have any program for certificate vouchers for the data bricks session in upcoming days

Data Engineering

767 Views
3 replies
1 kudos

01-03-2023 8:51:17 PM

View Replies

Latest Reply

sher
Valued Contributor II

01-05-2023 10:41:08 PM

1 kudos

@Vidula Khanna I got this link for the certificate voucher register link.https://docs.google.com/presentation/d/1sy5hSSnFtncrpYY1EYi0WMsDkJK0dYk9iKBAeeAha8E/edit#slide=id.g1ade45a9cd6_0_543

1 kudos

01-05-2023 10:41:08 PM

2 More Replies

by pjp94 • Contributor

12-05-2022 12:53:47 PM

3916 Views
9 replies
7 kudos

Calling a python function (def) in databricks

Not sure if I'm missing something here, but running a task outside of a python function runs much much quicker than executing the same task inside a function. Is there something I'm missing with how spark handles functions? 1) def task(x): y = dostuf...

Data Engineering

3916 Views
9 replies
7 kudos

12-05-2022 12:53:47 PM

View Replies

Latest Reply

sher
Valued Contributor II

01-05-2023 10:30:06 PM

7 kudos

don't use python normal function use UDF in pyspark so that will be faster

7 kudos

01-05-2023 10:30:06 PM

8 More Replies

by vr • Contributor

01-04-2023 5:37:03 PM

2601 Views
3 replies
2 kudos

Resolved! Is timestamp difference always INTERVAL DAY TO SECOND?

My observations show that timestamp difference has type of INTERVAL DAY TO SECONDS:select typeof(getdate() - current_date()) ----------------------------------------- interval day to secondBut is it guaranteed? Can it be DAY TO MINUTE or, say, YEAR T...

Data Engineering

2601 Views
3 replies
2 kudos

01-04-2023 5:37:03 PM

View Replies

Latest Reply

sher
Valued Contributor II

01-05-2023 10:20:02 PM

2 kudos

you can check here for given example: https://docs.databricks.com/sql/language-manual/functions/minussign.htmlthis might help to you.

2 kudos

01-05-2023 10:20:02 PM

2 More Replies

by prasannar • New Contributor II

12-21-2022 10:08:36 PM

1873 Views
4 replies
3 kudos

How to write dataframe to Oracle from Databricks ?

Data Engineering

1873 Views
4 replies
3 kudos

12-21-2022 10:08:36 PM

View Replies

Latest Reply

sher
Valued Contributor II

01-05-2023 9:52:29 PM

3 kudos

Df.write.format('jdbc').options( url='jdbc:oracle:thin:@192.168.11.100:1521:ORCL', driver='oracle.jdbc.driver.OracleDriver', dbtable='testschema.test', user='testschema', password='password').mode('overwrite').save()try ...

3 kudos

01-05-2023 9:52:29 PM

3 More Replies

by dulu • New Contributor III

12-15-2022 8:44:49 PM

4660 Views
6 replies
6 kudos

split character string in cell with sql

I have the following input: I am looking for a way to split the characters in the item_order_detail column into 2 columns itemID and itemName. As below output table uses SQL function in databricks with spark_sql version 3.2.1.Can someone suggest a so...

Data Engineering

4660 Views
6 replies
6 kudos

12-15-2022 8:44:49 PM

View Replies

Latest Reply

sher
Valued Contributor II

01-05-2023 9:49:05 PM

6 kudos

you need to use explode functionhttps://stackoverflow.com/questions/61070630/spark-explode-column-with-json-array-to-rows

6 kudos

01-05-2023 9:49:05 PM

5 More Replies

by RamyaN • New Contributor II

01-05-2023 4:29:51 AM

1904 Views
2 replies
3 kudos

How to read enum[] (enum of array) datatype from postgres using spark

We are trying to read a column which is enum of array datatype from postgres as string datatype to target. We could able to achieve this by expilcitly using concat function while extracting like belowval jdbcDF3 = spark.read .format("jdbc") .option(...

Data Engineering

1904 Views
2 replies
3 kudos

01-05-2023 4:29:51 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

01-05-2023 5:12:26 AM

3 kudos

You can try custom schema for JDBC read.option("customSchema", "colname STRING")

3 kudos

01-05-2023 5:12:26 AM

1 More Replies

by IonFreeman_Pace • New Contributor III

01-05-2023 12:49:19 PM

932 Views
1 replies
1 kudos

Resolved! Apache Spark Programming course -- repo notebooks don't seem to match up with content

I have no idea what I'm doing wrong. I synched this repo <https://github.com/databricks-academy/apache-spark-programming-with-databricks> as instructed. You can see that Notebook 4.1 is query optimization in the 'published' branch.https://github.com/...

Data Engineering

932 Views
1 replies
1 kudos

01-05-2023 12:49:19 PM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

01-05-2023 1:41:38 PM

1 kudos

It happened to me also in the past as courses are constantly updated. You can inform the training department through https://help.databricks.com/s/contact-us?ReqType=training

1 kudos

01-05-2023 1:41:38 PM

by aiwithqasim • Contributor

01-04-2023 11:55:28 PM

1224 Views
2 replies
4 kudos

Resolved! How to Kafka configured on your PC with Databricks?

I'm working on the case to configure Kafka that is installed on my machine (Laptop) & I want to connect it with my Databricks account hosted on the AWS cloud.Secondly, I have CSV files that I want to use for real-time processing from Kafka to Databri...

Data Engineering

1224 Views
2 replies
4 kudos

01-04-2023 11:55:28 PM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

01-05-2023 2:20:06 AM

4 kudos

For CSV, you need just to readStream in the notebook and append output to CSV using forEachBatch method.Your Kafka on PC needs to have the public address or you need to set AWS VPN and connect from your laptop to be in the same VPC as databricks.

4 kudos

01-05-2023 2:20:06 AM

1 More Replies

by User16830818469 • New Contributor

06-15-2021 6:13:02 AM

2180 Views
2 replies
0 kudos

Databricks SQL Visualizations - export

How can I export Databricks SQL Dashboards as a .pdf?

Data Engineering

2180 Views
2 replies
0 kudos

06-15-2021 6:13:02 AM

View Replies

Latest Reply

BigMF
New Contributor III

01-05-2023 7:06:15 AM

0 kudos

Hello, I'm trying to export my dashboard to PDF but when I run the export, none of the data is exported, only blank visuals. I would appreciate any help in getting this resolved.Dashboard in DBX: Exported PDF:

0 kudos

01-05-2023 7:06:15 AM

1 More Replies

by su • New Contributor

10-14-2022 6:29:43 AM

2096 Views
3 replies
0 kudos

Reading from /tmp no longer working

Since yesterday, reading a file copied into the cluster is no longer working.What used to work:blob = gcs_bucket.get_blob("dev/data.ndjson") -> worksblob.download_to_filename("/tmp/data-copy.ndjson") -> worksdf = spark.read.json("/tmp/data-copy.ndjso...

Data Engineering

2096 Views
3 replies
0 kudos

10-14-2022 6:29:43 AM

View Replies

Latest Reply

Evan_From_Bosto
New Contributor II

01-05-2023 6:55:43 AM

0 kudos

I encountered this same issue, and figured out a fix!For some reason, it seems like only %sh cells can access the /tmp directory. So I just did...%sh ch /tmp/<file> /dbfs/<desired-location> and then accessed it form there using Spark.

0 kudos

01-05-2023 6:55:43 AM

2 More Replies

by francisix • New Contributor II

01-05-2023 1:55:22 AM

1466 Views
4 replies
1 kudos

Resolved! I haven't received badge for completion

Hi,Today I completed the test for Lakehouse fundamentals by scored 85%, still I haven't received the badge through my email francis@intellectyx.comKindly let me know please !-Francis

Data Engineering

1466 Views
4 replies
1 kudos

01-05-2023 1:55:22 AM

View Replies

Latest Reply

AdrianLobacz
Contributor

01-05-2023 4:44:06 AM

1 kudos

Hi, I have same problem but you must wait up to 24 hours and you will receive the badge

1 kudos

01-05-2023 4:44:06 AM

3 More Replies

by satishshravan88 • New Contributor II

10-28-2022 6:33:56 AM

833 Views
3 replies
2 kudos

I haven't received certificate or badge after clearing data engineer associate exam

Hello Databricks Team,I completed my data engineer associate exam & passed the exam. But still now I haven't received certificate or badge. Also created help ticket, Ticket Number: #[00232273] Mail id : satishshravan888@gmail.comThanks,Satish

Data Engineering

833 Views
3 replies
2 kudos

10-28-2022 6:33:56 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-05-2023 1:35:55 AM

2 kudos

Hi @Satish Shravan Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Tha...

2 kudos

01-05-2023 1:35:55 AM

2 More Replies

by virbickt • New Contributor III

12-27-2022 5:42:55 AM

6188 Views
3 replies
6 kudos

Resolved! "User not authorized" error when trying to deploy access connector on Azure

Hi,I have been trying to deploy Access Connector resource on Azure using Azure Pipelines (YAML) and a Bicep template but I cannot find a solution to this error:ERROR: {"status":"Failed","error":{"code":"DeploymentFailed","message":"At least one resou...

Data Engineering

6188 Views
3 replies
6 kudos

12-27-2022 5:42:55 AM

View Replies

Latest Reply

ThomasVanBilsen
New Contributor III

01-04-2023 11:44:21 PM

6 kudos

Hi,I fixed this issue by adding the service principal to the list of service principals in the Account Console. My guess is that after the access connector is created an API call is made to the Databricks account and the service principal making that...

6 kudos

01-04-2023 11:44:21 PM

2 More Replies

by aicd_de • New Contributor III

01-04-2023 5:54:52 PM

1375 Views
1 replies
0 kudos

Resolved! Error Using spark.catalog.dropTempView()

I have a set of Spark Dataframes that I convert into Temp Views to run Spark SQL with. Then, I delete them after my logic/use is complete. The delete step throws an odd error that I am not sure how to fix. Looking for some tips on fixing it. As a not...

Data Engineering

1375 Views
1 replies
0 kudos

01-04-2023 5:54:52 PM

View Replies

Latest Reply

aicd_de
New Contributor III

01-04-2023 7:49:03 PM

0 kudos

spark.sql("DROP TABLE "+prefix_updates) spark.sql("DROP TABLE "+prefix_main)Fixed it for me.

0 kudos

01-04-2023 7:49:03 PM

by databicky • Contributor II

01-02-2023 1:08:45 AM

9234 Views
12 replies
4 kudos

How can we write a pandas dataframe into azure adls as excel file, when trying to write it is showing error as protocol not known 'abfss' like that.

Data Engineering

9234 Views
12 replies
4 kudos

01-02-2023 1:08:45 AM

View Replies

Latest Reply

FerArribas
Contributor

01-02-2023 1:25:43 PM

4 kudos

Hi @Hubert Dudek,Pandas API doesn't support abfss protocol.You have three options:If you need to use pandas, you can write the excel to the local file system (dbfs) and then move it to ABFSS (for example with dbutils)Write as csv directly in abfss...

4 kudos

01-02-2023 1:25:43 PM

11 More Replies

User

Count

1601

736

343

284

246

Databricks

Forum Posts

Resolved! Do we have any certificate voucher for the data bricks session in the upcoming days

Calling a python function (def) in databricks

Resolved! Is timestamp difference always INTERVAL DAY TO SECOND?

How to write dataframe to Oracle from Databricks ?

split character string in cell with sql

How to read enum[] (enum of array) datatype from postgres using spark

Resolved! Apache Spark Programming course -- repo notebooks don't seem to match up with content

Resolved! How to Kafka configured on your PC with Databricks?

Databricks SQL Visualizations - export

Reading from /tmp no longer working

Resolved! I haven't received badge for completion

I haven't received certificate or badge after clearing data engineer associate exam

Resolved! "User not authorized" error when trying to deploy access connector on Azure

Resolved! Error Using spark.catalog.dropTempView()

How can we write a pandas dataframe into azure adls as excel file, when trying to write it is showing error as protocol not known 'abfss' like that.

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...