Data Engineering

Forum Posts

Sorted by:

by Aviral-Bhardwaj • Esteemed Contributor III

01-07-2023 8:18:54 AM

1046 Views
2 replies
20 kudos

⏩ Understanding Unity Catalog in Databricks ⏮ In Databricks, the Unity Catalog is a data catalog that allows you to store, access, and manage data wit...

Understanding Unity Catalog in Databricks In Databricks, the Unity Catalog is a data catalog that allows you to store, access, and manage data within your Databricks workspace. It provides a unified interface for working with data across different s...

Data Engineering

1046 Views
2 replies
20 kudos

01-07-2023 8:18:54 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-10-2023 2:21:51 AM

20 kudos

Nice one!Keep sharing such informative posts.

20 kudos

01-10-2023 2:21:51 AM

1 More Replies

by tanjil • New Contributor III

01-08-2023 9:50:11 PM

1278 Views
2 replies
2 kudos

print(flush = True) not working

Hello, I have the following minimum example working example using multiprocessing:from multiprocessing import Pool files_list = [('bla', 1, 3, 7), ('spam', 12, 4, 8), ('eggs', 17, 1, 3)] def f(t): print('Hello from child process', flush = Tr...

Data Engineering

1278 Views
2 replies
2 kudos

01-08-2023 9:50:11 PM

View Replies

Latest Reply

tanjil
New Contributor III

01-10-2023 1:58:47 AM

2 kudos

No errors are generated. The code executes successfully, but there the print statement for "Hello from child process" does not work.

2 kudos

01-10-2023 1:58:47 AM

1 More Replies

by Optum • New Contributor III

02-04-2022 12:07:41 PM

5302 Views
10 replies
4 kudos

Resolved! Databricks JDBC & Remote Write

Hello,I'm trying to write to a Delta Table in my Databricks instance from a remote Spark session on a different cluster with the Simba Spark driver. I can do reads, but when I attempt to do a write, I get the following error:{ df.write.format("jdbc...

Data Engineering

5302 Views
10 replies
4 kudos

02-04-2022 12:07:41 PM

View Replies

Latest Reply

Atanu
Esteemed Contributor

03-15-2022 10:24:51 PM

4 kudos

Could you try setting the flag to ignore transactions? I’m not sure what the exact flag is, but there should be more details in the JDBC manual on how to do this

4 kudos

03-15-2022 10:24:51 PM

9 More Replies

by User16869510359 • Esteemed Contributor

06-25-2021 3:41:25 PM

3946 Views
2 replies
1 kudos

Why do we need CRC files in Delta logs. How does CRC file help for the transaction control in Delta

Data Engineering

3946 Views
2 replies
1 kudos

06-25-2021 3:41:25 PM

View Replies

Latest Reply

User16752240150
New Contributor II

01-09-2023 2:37:28 PM

1 kudos

Every 10 transactions json files in the _delta_log are converted to parquet files. The .crc file is a checksum added to prevent corruption if a parquet file is corrupted in flight

1 kudos

01-09-2023 2:37:28 PM

1 More Replies

by cybersam • New Contributor II

12-12-2022 12:58:33 PM

613 Views
2 replies
0 kudos

How do I find the documentation for a Databricks "platform release"?

My DB portal says the platform version is v3.86, and provides a link to all the releases. But none of those releases state the "platform version". And I can't find "v3.86" by searching in the Databricks docs.So, how does one find the documentation fo...

Data Engineering

613 Views
2 replies
0 kudos

12-12-2022 12:58:33 PM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-20-2022 6:15:40 AM

0 kudos

Hey @Samuel Yang ,Here you will find all detailshttps://docs.databricks.com/release-notes/index.htmlThanksAviral Bhardwaj

0 kudos

12-20-2022 6:15:40 AM

1 More Replies

by cmilligan • Contributor II

01-06-2023 9:15:18 AM

1385 Views
3 replies
4 kudos

Link a visio diagram in a markdown cell

Is there a way to have databricks pull a diagram directly from visio? I've tried to use the embed links from visio but the image won't render. I'm trying to get around loading the image to DBFS as there may be updates to the image that I want it to g...

Data Engineering

1385 Views
3 replies
4 kudos

01-06-2023 9:15:18 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

01-07-2023 8:07:14 AM

4 kudos

thanks for this

4 kudos

01-07-2023 8:07:14 AM

2 More Replies

by Sharath • New Contributor II

10-30-2022 6:16:58 AM

971 Views
7 replies
0 kudos

Hi Databricks Team, I passed the associate data engineer exam day before but still haven't received on accredible or on db academy. My registere...

Hi Databricks Team, I passed the associate data engineer exam day before but still haven't received on accredible or on db academy. My registered email id for exam is sharath.koushik@gmail.com. Could you please help ?

Data Engineering

971 Views
7 replies
0 kudos

10-30-2022 6:16:58 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-08-2023 10:01:34 PM

0 kudos

Hi @Sharath K Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

0 kudos

01-08-2023 10:01:34 PM

6 More Replies

by shamly • New Contributor III

01-08-2023 8:20:09 AM

1742 Views
3 replies
2 kudos

How to remove extra ENTER line in csv UTF-16 while reading

Dear Friends,I have a csv and it looks like this‡‡Id‡‡,‡‡Version‡‡,‡‡Questionnaire‡‡,‡‡Date‡‡‡‡123456‡‡,‡‡Version2‡‡,‡‡All questions have been answered accurately and the guidance in the questionnaire was understood and followed‡‡,‡‡2010-12-16 00:01:...

Data Engineering

1742 Views
3 replies
2 kudos

01-08-2023 8:20:09 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

01-08-2023 8:33:55 PM

2 kudos

This is working fine, from pyspark.sql.functions import regexp_replace path="dbfs:/FileStore/df/test.csv" dff = spark.read.option("header", "true").option("inferSchema", "true").option('multiline', 'true').option('encoding', 'UTF-8').option("delimi...

2 kudos

01-08-2023 8:33:55 PM

2 More Replies

by Paradox_Parijat • New Contributor III

11-29-2022 10:00:14 AM

1112 Views
6 replies
8 kudos

Hello World! This my first databricks community post. Looking forward to contribute from my end. Peace out! @Dinesh Mergu

Hello World! This my first databricks community post. Looking forward to contribute from my end. Peace out!@Dinesh Mergu

Data Engineering

1112 Views
6 replies
8 kudos

11-29-2022 10:00:14 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-08-2023 10:37:31 PM

8 kudos

Welcome to the community @Parijat Dhar !!

8 kudos

01-08-2023 10:37:31 PM

5 More Replies

by 299305 • New Contributor II

10-30-2022 5:26:13 AM

984 Views
5 replies
4 kudos

where can I review the exam

Data Engineering

984 Views
5 replies
4 kudos

10-30-2022 5:26:13 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-08-2023 10:02:35 PM

4 kudos

Hi @A DB Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

4 kudos

01-08-2023 10:02:35 PM

4 More Replies

by Anatoly • New Contributor III

10-30-2022 3:29:11 AM

2965 Views
9 replies
4 kudos

"Detected schema change" error while reading from delta table in streaming after applying "ALTER COLUMN DROP NOT NULL" to more than one columns.

Hi!I have a delta table and a process that reading a stream from this table.I need to drop the NOT NULL constraint from some of the columns of this table.The first drop command does not affect the reading stream.But the second command results in erro...

Data Engineering

2965 Views
9 replies
4 kudos

10-30-2022 3:29:11 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-08-2023 9:59:43 PM

4 kudos

Hi @Anatoly Tikhonov Hope everything is going great.Does @Kaniz Fatma response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

4 kudos

01-08-2023 9:59:43 PM

8 More Replies

by gj0904 • New Contributor III

10-30-2022 1:14:48 AM

1758 Views
8 replies
5 kudos

Certificate not received -

Hi ThereI successfully passed the exam on 27th Oct 2022 - but I haven't received the certificate yet.

Data Engineering

1758 Views
8 replies
5 kudos

10-30-2022 1:14:48 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-08-2023 9:47:07 PM

5 kudos

Hi @Gaurav Jhamb Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thank...

5 kudos

01-08-2023 9:47:07 PM

7 More Replies

by Anuroop • New Contributor II

10-29-2022 12:31:02 PM

1656 Views
6 replies
0 kudos

I've successfully passed Databricks Data Engineer Associate Certified exam but still have not received the certificate. Could you help on it pleas...

I've successfully passed Databricks Data Engineer Associate Certified exam but still have not received the certificate. Could you help on it please.

Data Engineering

1656 Views
6 replies
0 kudos

10-29-2022 12:31:02 PM

View Replies

Latest Reply

Anonymous
Not applicable

01-08-2023 9:46:15 PM

0 kudos

Hi @Venkata Sai Anuroop Samudrala Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to he...

0 kudos

01-08-2023 9:46:15 PM

5 More Replies

by Justin_Stuparit • New Contributor II

10-29-2022 10:16:00 AM

867 Views
2 replies
1 kudos

Configure DLT Pipeline to use existing running cluster

How can I configure a DLT pipeline to use an existing running cluster? I don't see where in the settings to set the pipeline to use an existing cluster. Instead it wants to always standup a new cluster.

Data Engineering

867 Views
2 replies
1 kudos

10-29-2022 10:16:00 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-08-2023 9:44:52 PM

1 kudos

Hi @Justin Stuparitz Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.T...

1 kudos

01-08-2023 9:44:52 PM

1 More Replies

by alejandrofm • Valued Contributor

10-28-2022 8:47:38 PM

1587 Views
3 replies
0 kudos

Cluster not following the zone: auto configuration

Hi, I'm trying some new instance types, so to be sure I will get one I set the Availability zone to 'auto'.But this is happening, the image of the error and JSON are attached.So, the cluster is trying three times to upscale and the three times is usi...

Data Engineering

1587 Views
3 replies
0 kudos

10-28-2022 8:47:38 PM

View Replies

Latest Reply

Anonymous
Not applicable

01-08-2023 9:17:19 PM

0 kudos

Hi @Alejandro Martinez Hope everything is going great.Does @Sivaprasad C S response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

0 kudos

01-08-2023 9:17:19 PM

2 More Replies

User

Count

1601

736

343

284

247

Databricks

Forum Posts

⏩ Understanding Unity Catalog in Databricks ⏮ In Databricks, the Unity Catalog is a data catalog that allows you to store, access, and manage data wit...

print(flush = True) not working

Resolved! Databricks JDBC & Remote Write

Why do we need CRC files in Delta logs. How does CRC file help for the transaction control in Delta

How do I find the documentation for a Databricks "platform release"?

Link a visio diagram in a markdown cell

Hi Databricks Team, I passed the associate data engineer exam day before but still haven't received on accredible or on db academy. My registere...

How to remove extra ENTER line in csv UTF-16 while reading

Hello World! This my first databricks community post. Looking forward to contribute from my end. Peace out! @Dinesh Mergu

where can I review the exam

"Detected schema change" error while reading from delta table in streaming after applying "ALTER COLUMN DROP NOT NULL" to more than one columns.

Certificate not received -

I've successfully passed Databricks Data Engineer Associate Certified exam but still have not received the certificate. Could you help on it pleas...

Configure DLT Pipeline to use existing running cluster

Cluster not following the zone: auto configuration

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...