Data Engineering

Forum Posts

Sorted by:

Start a conversation

by 132736 • New Contributor

04-19-2023 8:01:08 AM

2003 Views
1 replies
0 kudos

Can sql result display more than 25 records per page?

Hi! I have a result table with 41 rows. What should I do to make all rows available on the same page?

Data Engineering

2003 Views
1 replies
0 kudos

04-19-2023 8:01:08 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-23-2023 8:27:22 AM

0 kudos

Hi @wenting_deng wenting_deng Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, pleas...

0 kudos

04-23-2023 8:27:22 AM

by alejandrofm • Valued Contributor

04-20-2023 5:44:19 AM

4250 Views
2 replies
2 kudos

Resolved! Lot of write shuffle on optimize + ZORDER, is it normal?

Hi! I'm optimizing several Tb of partitioned data on ZSTD lvl 9.It surprises me the level of shuffle write, it could make sense because of ZORDER but I want to be sure that I'm not missing something, here is some context: Could I be missing something...

Data Engineering

4250 Views
2 replies
2 kudos

04-20-2023 5:44:19 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-23-2023 8:05:20 AM

2 kudos

Hi @Alejandro Martinez Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best an...

2 kudos

04-23-2023 8:05:20 AM

1 More Replies

by AyushModi038 • New Contributor III

04-21-2023 8:27:56 AM

23848 Views
2 replies
1 kudos

Resolved! Upgrade Python version in cluster

Currently I am using the following cluster. It is using the default python version of 3.9.5 and I would like to update it to 3.10.1.0How to achieve this?

Data Engineering

23848 Views
2 replies
1 kudos

04-21-2023 8:27:56 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-23-2023 7:14:23 AM

1 kudos

Hi @Ayush Modi Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers yo...

1 kudos

04-23-2023 7:14:23 AM

1 More Replies

by knawara • Contributor

02-20-2023 5:08:15 AM

17204 Views
4 replies
3 kudos

Relationship between Databricks Account and Azure resources

Hello,What is the relationship between Databricks Account (as described in [1]) and Azure resources? Is Databricks Account created per Azure account? Or per Azure tenant? Or maybe per subscription?[1] https://learn.microsoft.com/en-us/azure/databrick...

Data Engineering

17204 Views
4 replies
3 kudos

02-20-2023 5:08:15 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 10:27:20 PM

3 kudos

Hi @Chris Nawara Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so ...

3 kudos

04-21-2023 10:27:20 PM

3 More Replies

by elgeo • Valued Contributor II

02-21-2023 3:21:41 AM

8825 Views
1 replies
0 kudos

Iteration - Pyspark vs Pandas

Hello. Could someone please explain why iteration over a Pyspark dataframe is way slower than over a Pandas dataframe?Pysparkdf_list = df.collect()for index in range(0, len(df_list )):.....Pandasdf_pnd = df.toPandas() for index, row in df_p...

Data Engineering

8825 Views
1 replies
0 kudos

02-21-2023 3:21:41 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-22-2023 12:11:56 AM

0 kudos

Hi @ELENI GEORGOUSI Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us ...

0 kudos

04-22-2023 12:11:56 AM

by gentresh • New Contributor III

02-21-2023 10:45:53 AM

3341 Views
1 replies
0 kudos

Running terraform plan; databricks throws a "Permission_Denied: Missing required permissions [View] on node with ID "1759335429158542"

To give you a little bit of background:We use terraform to deploy a resource group with multiple Azure services Terraform leverages an Azure Service Principal that has Owner rights to the Azure subscriptionThis way, databricks is also deployed. We al...

Data Engineering

3341 Views
1 replies
0 kudos

02-21-2023 10:45:53 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 11:37:58 PM

0 kudos

Hi @Gent Reshtani Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers...

0 kudos

04-21-2023 11:37:58 PM

by Ogawa • New Contributor III

02-21-2023 12:50:10 AM

4975 Views
10 replies
2 kudos

Selfpaced course link for "Apache Spark developer associate "

Please where can i find the the "Apache Spark developer associate " selfpaced course ?Thanks in advance.

Data Engineering

4975 Views
10 replies
2 kudos

02-21-2023 12:50:10 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 11:23:11 PM

2 kudos

Hi @youssef ansari Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us s...

2 kudos

04-21-2023 11:23:11 PM

9 More Replies

by ckwan48 • New Contributor III

02-20-2023 7:00:46 PM

2894 Views
3 replies
1 kudos

Create a Dockerfile from Cluster

Is there a way to create a Dockerfile from Workspace A's cluster configurations and deploy that on a different different cluster in Workspace B?

Data Engineering

2894 Views
3 replies
1 kudos

02-20-2023 7:00:46 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 11:17:34 PM

1 kudos

Hi @Kevin Kim Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so we ...

1 kudos

04-21-2023 11:17:34 PM

2 More Replies

by Tannu858 • New Contributor

02-20-2023 10:43:01 PM

1274 Views
1 replies
0 kudos

Why we require Unity Catalog

Data Engineering

1274 Views
1 replies
0 kudos

02-20-2023 10:43:01 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 11:02:38 PM

0 kudos

Hi @Bhupesh Aggarwal Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us...

0 kudos

04-21-2023 11:02:38 PM

by ssy • New Contributor II

02-19-2023 7:20:53 PM

1771 Views
1 replies
1 kudos

How to use SparkNLP library and JohnSnowLabs maven coordinates in cluster which is not connected to internet

Hi,I am trying SparkNLP library for the first time. The cluster I'm using is corporate and cannot be connected to internet. I can only download packages that are provided to us or by using a jar file.I've three questions:What jar files do I need to ...

Data Engineering

1771 Views
1 replies
1 kudos

02-19-2023 7:20:53 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 10:22:03 PM

1 kudos

Hi @Samy Syed Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so we ...

1 kudos

04-21-2023 10:22:03 PM

by Dilorom • New Contributor

02-19-2023 2:51:50 PM

6883 Views
3 replies
4 kudos

What is a recommended directory for creating a database with a specified path?

I was going through Data Engineering with Databricks training, and in DE 3.3L - Databases, Tables & Views Lab section, it says "Defining database directories for groups of users can greatly reduce the chances of accidental data exfiltration." I agree...

Data Engineering

6883 Views
3 replies
4 kudos

02-19-2023 2:51:50 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 9:47:04 PM

4 kudos

Hi @Dilorom A Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so we ...

4 kudos

04-21-2023 9:47:04 PM

2 More Replies

by Smitha1 • Valued Contributor II

12-01-2022 12:54:02 PM

2030 Views
3 replies
1 kudos

December exam voucher for Databricks Certified Associate Developer for Apache Spark 3.0 exam

Dear @Jose Gonzalez Hope you're having great day. This is of HIGH priority for me, I've to schedule exam in December before slots are full.I gave Databricks Certified Associate Developer for Apache Spark 3.0 exam on 30th Nov but missed by one perc...

Data Engineering

2030 Views
3 replies
1 kudos

12-01-2022 12:54:02 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 1:53:00 AM

1 kudos

Hi @Smitha Nelapati Thank you for reaching out! Please submit a ticket to our Training Team here: https://help.databricks.com/s/contact-us?ReqType=training and our team will get back to you shortly.

1 kudos

04-21-2023 1:53:00 AM

2 More Replies

by Mani1800 • New Contributor

04-20-2023 5:03:55 AM

2051 Views
2 replies
0 kudos

I need to run Sql Update/Delete commands for a AWS RDS system.

I tried 'jdbc' connection to access the data from the RDS. I was able to read the data successfully but I need to do run some update queries. It seems the jdbc won't support update operation. I tried to make connection to my RDS mysql with host, user...

Data Engineering

2051 Views
2 replies
0 kudos

04-20-2023 5:03:55 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 3:14:20 AM

0 kudos

Hi @Manikandan Ramachandran Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear fro...

0 kudos

04-21-2023 3:14:20 AM

1 More Replies

by vinaykumar • New Contributor III

02-18-2023 11:43:28 AM

6723 Views
6 replies
0 kudos

Log files are not getting deleted automatically after logRetentionDuration internal

Hi team Log files are not getting deleted automatically after logRetentionDuration internal from delta log folder and after analysis , I see checkpoint files are not getting created after 10 commits . Below table properties using spark.sql( f""" ...

Data Engineering

6723 Views
6 replies
0 kudos

02-18-2023 11:43:28 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 2:26:48 AM

0 kudos

Hi @vinay kumar Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks...

0 kudos

04-21-2023 2:26:48 AM

5 More Replies

by UR • New Contributor II

04-10-2023 12:00:08 PM

1822 Views
3 replies
1 kudos

Didn't received the certificate for Databricks Certified Data Engineer Associate exam

@Vidula Khanna @Nadia Elsayed Hi,I pass Databricks Certified Data Engineer Associate exam 48 hours ago. But still didn't received the certificate yet. I also created ticket(00312849) 6 hours ago but still no one reach out to me yet regarding this i...

Data Engineering

1822 Views
3 replies
1 kudos

04-10-2023 12:00:08 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 1:53:31 AM

1 kudos

Hi @Urvish Patel Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers ...

1 kudos

04-21-2023 1:53:31 AM

2 More Replies

User

Count

1611

768

348

286

252

Databricks Community

Forum Posts

Can sql result display more than 25 records per page?

Resolved! Lot of write shuffle on optimize + ZORDER, is it normal?

Resolved! Upgrade Python version in cluster

Relationship between Databricks Account and Azure resources

Iteration - Pyspark vs Pandas

Running terraform plan; databricks throws a "Permission_Denied: Missing required permissions [View] on node with ID "1759335429158542"

Selfpaced course link for "Apache Spark developer associate "

Create a Dockerfile from Cluster

Why we require Unity Catalog

How to use SparkNLP library and JohnSnowLabs maven coordinates in cluster which is not connected to internet

What is a recommended directory for creating a database with a specified path?

December exam voucher for Databricks Certified Associate Developer for Apache Spark 3.0 exam

I need to run Sql Update/Delete commands for a AWS RDS system.

Log files are not getting deleted automatically after logRetentionDuration internal

Didn't received the certificate for Databricks Certified Data Engineer Associate exam

Join Us as a Local Community Builder!

global temp view issue

Dlt pipeline showing legacy , even though all thin...

SERVERLESS SQL WAREHOUSE

Unity Catalog Table in Databricks Asset Bundle

Databricks data engineer associate exam