Data Engineering

Forum Posts

Sorted by:

by Joao_DE • New Contributor III

03-09-2023 4:40:18 AM

1070 Views
2 replies
0 kudos

Run pytest inside repos and store the results in dbfs

Hi everyone!I am trying to run pytest inside a notebook on repos and store the results inside dbfs but i am getting an error stating permission denied, does anyone know why this happens and the solution. Error:

Data Engineering

1070 Views
2 replies
0 kudos

03-09-2023 4:40:18 AM

View Replies

Latest Reply

Vartika
Moderator

03-31-2023 2:23:08 AM

0 kudos

Hi @João Peixoto Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so ...

0 kudos

03-31-2023 2:23:08 AM

1 More Replies

by pvignesh92 • Honored Contributor

03-30-2023 6:50:41 AM

1438 Views
4 replies
2 kudos

Resolved! Pls restrict Spamming

Hi @Vidula Khanna , Recently there has been too many spams posted in the community discussions. I'm sure you might have noticed them. Is there any chance to clear all of them and may be restrict them in some way so that the purpose of this community...

Data Engineering

1438 Views
4 replies
2 kudos

03-30-2023 6:50:41 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

03-30-2023 7:04:18 AM

2 kudos

@Suteja Kanuri using Azure OpenAI GPT4 we could connect it to the community and use 2 features of it.Ask the question, "is it spam?" and verify the user post this way.Display ready answers by OpenAI so we will avoid asking over and over again duplic...

2 kudos

03-30-2023 7:04:18 AM

3 More Replies

by pandu • New Contributor II

03-08-2023 6:44:23 AM

1406 Views
2 replies
3 kudos

connect to Oracle database using JDBC and perform merge condition

I would like to connect to oracle database using JDBC driver and write a code to perform merge condition using python.

Data Engineering

1406 Views
2 replies
3 kudos

03-08-2023 6:44:23 AM

View Replies

Latest Reply

Vartika
Moderator

03-31-2023 1:43:10 AM

3 kudos

Hi @Venkata Krishna Jonnalagadda Hope you are well.Just checking in. If @John Lourdu's answer helped, would you let us know and mark the answer as best? If not, would you be happy to give us more information?Thanks!

3 kudos

03-31-2023 1:43:10 AM

1 More Replies

by William_Scardua • Valued Contributor

03-08-2023 9:32:29 AM

687 Views
2 replies
1 kudos

How to get executors info by SDK (Python)

Hi guys,How I get executors information to my cluster by SDK (Python) have any idea ?Thank you

Data Engineering

687 Views
2 replies
1 kudos

03-08-2023 9:32:29 AM

View Replies

Latest Reply

Vartika
Moderator

03-31-2023 1:33:27 AM

1 kudos

Hi @William Scardua We haven't heard from you since the last response from @josephk and I was checking back to see if it helped you.Or else, If you have any solution, please share it with the community, as it can be helpful to others. Also, Please d...

1 kudos

03-31-2023 1:33:27 AM

1 More Replies

by Anonymous • Not applicable

03-30-2023 4:56:36 AM

906 Views
4 replies
4 kudos

How to create a new group in the Databricks community? Dear esteemed community users, It is with great pleasure that we inform you of an important upd...

How to create a new group in the Databricks community?Dear esteemed community users,It is with great pleasure that we inform you of an important update regarding the creation of Groups on Community. As part of our continuous efforts to enhance your e...

Data Engineering

906 Views
4 replies
4 kudos

03-30-2023 4:56:36 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-31-2023 12:58:47 AM

4 kudos

HI @Hubert Dudek and @Ratna Chaitanya Raju Bandaru : Thanks for pointing this out. This is going to be a design decision which we will take after looking into the ask carefully. Thanks for getting the conversation going. This really helps us.

4 kudos

03-31-2023 12:58:47 AM

3 More Replies

by JordiDekker • New Contributor III

03-27-2023 2:26:24 AM

1452 Views
5 replies
6 kudos

StreamCorruptedException, databricks-connect 9.1

Last week, around the 21st of march, we started having issues with databricks-connect (DBR 9.1 LTS). "databricks-connect test" works, but the following code snippet:from pyspark.sql import SparkSession spark = SparkSession.builder.getOrCreate() s...

Data Engineering

1452 Views
5 replies
6 kudos

03-27-2023 2:26:24 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-30-2023 12:50:33 AM

6 kudos

Hi @Jordi Dekker Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers ...

6 kudos

03-30-2023 12:50:33 AM

4 More Replies

by Gk • New Contributor III

03-03-2023 3:18:10 AM

1485 Views
2 replies
1 kudos

DataFrame

How can we create empty dataframe in databricks and how many ways we can create dataframe?

Data Engineering

1485 Views
2 replies
1 kudos

03-03-2023 3:18:10 AM

View Replies

Latest Reply

Vartika
Moderator

03-31-2023 12:09:26 AM

1 kudos

Hi @Govardhana Reddy Hope everything is going great.Does @Suteja Kanuri's answer help? If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so we can help you. Cheers!

1 kudos

03-31-2023 12:09:26 AM

1 More Replies

by tlbarata • New Contributor II

03-03-2023 5:40:21 AM

1339 Views
3 replies
1 kudos

Outdated - Databricks Data Engineer associate v2 lesson DE 4.2

While following the video lesson and executing the notebook 4.2, I noticed that creating the CREATE Table "users_jdbc" command generates an EXTERNAL table, while the video and, notebook too, suggests it as being a Managed table.Here are some printscr...

Data Engineering

1339 Views
3 replies
1 kudos

03-03-2023 5:40:21 AM

View Replies

Latest Reply

Vartika
Moderator

03-30-2023 11:55:22 PM

1 kudos

Hi @Tiago Barata Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thank...

1 kudos

03-30-2023 11:55:22 PM

2 More Replies

by alejandrofm • Valued Contributor

03-04-2023 7:47:51 AM

1390 Views
4 replies
0 kudos

AppendDataExecV1 Taking a lot of time

Hi, I have a Pyspark job that takes about an hour to complete, when looking at the SQL tab on Spark UI I see this:Those processes run for more than 1 minute on a 60-minute process.This is Ganglia for that period (the last snapshot, will look into a l...

Data Engineering

1390 Views
4 replies
0 kudos

03-04-2023 7:47:51 AM

View Replies

Latest Reply

Vartika
Moderator

03-30-2023 11:44:10 PM

0 kudos

Hi @Alejandro Martinez Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you...

0 kudos

03-30-2023 11:44:10 PM

3 More Replies

by ghofigjong • New Contributor

02-27-2023 12:29:55 AM

3114 Views
2 replies
1 kudos

Resolved! How does partition pruning work on a merge into statement?

I have a delta table that is partitioned by Year, Date and month. I'm trying to merge data to this on all three partition columns + an extra column (an ID). My merge statement is below:MERGE INTO delta.<path of delta table> oldData using df newData ...

Data Engineering

3114 Views
2 replies
1 kudos

02-27-2023 12:29:55 AM

View Replies

Latest Reply

Umesh_S
New Contributor II

03-30-2023 1:24:57 PM

1 kudos

Isn't the suggested idea only filtering the input dataframe (resulting in a smaller amount of data to match across the whole delta table) rather than prune the delta table for relevant partitions to scan?

1 kudos

03-30-2023 1:24:57 PM

1 More Replies

by Anonymous • Not applicable

11-23-2022 10:32:33 PM

5581 Views
3 replies
14 kudos

Resolved! No suitable driver error When configure the Databricks ODBC and JDBC drivers

Hi all,I've just encountered with this issue. Before I launched an My SQL database in RDS of AWS after use this simple code to create connection to it but it all fails with this error.Is there any additional step? or could anyone can take a look on i...

Data Engineering

5581 Views
3 replies
14 kudos

11-23-2022 10:32:33 PM

View Replies

Latest Reply

Jag
New Contributor II

03-30-2023 11:40:16 AM

14 kudos

Hello, It looks issue with JDBC URL. When I am trying to access the Azure SQL database. I was facing the same issue. So I have created JDBC URL as below and it went well.jdbc:sqlserver://<serverurl>:1433;database=<databasename>;user=<username>@<serve...

14 kudos

03-30-2023 11:40:16 AM

2 More Replies

by alex_python • New Contributor II

03-28-2023 10:52:13 AM

764 Views
3 replies
0 kudos

Division Auto Truncates Decimal Even After Casting Inputs

Division of two numbers is auto truncating decimals and I can't get a more precise result.Example of things I've tried:10 / 60 => 0.17cast(10 as float) / cast(60 as float) => 0.17cast(cast(10 as float) / cast(60 as float) as float) => 0.17round(10 / ...

Data Engineering

764 Views
3 replies
0 kudos

03-28-2023 10:52:13 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-29-2023 8:44:35 PM

0 kudos

Hi @Alex Python Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers y...

0 kudos

03-29-2023 8:44:35 PM

2 More Replies

by Kanna1706 • New Contributor III

03-29-2023 11:40:42 PM

1049 Views
3 replies
0 kudos

about .dbc notebook

I can't be able to import .dbc notebook into my community edition. Please help.

Data Engineering

1049 Views
3 replies
0 kudos

03-29-2023 11:40:42 PM

View Replies

Latest Reply

Kanna1706
New Contributor III

03-30-2023 12:11:07 AM

0 kudos

I imported .dbc notebook using url successfully but I can't be able to import using upload file option and I didn't get either any error message or anything when I tried to import using upload file option.

0 kudos

03-30-2023 12:11:07 AM

2 More Replies

by Rajkishore • New Contributor II

03-28-2023 3:53:20 AM

5604 Views
6 replies
4 kudos

Need a way to show the non-trimmed data while query a table ?

When querying a json data , the values are getting trimmed. I need to see the full data for that field, is there any way to do so ?

Data Engineering

5604 Views
6 replies
4 kudos

03-28-2023 3:53:20 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-29-2023 8:35:57 PM

4 kudos

Hi @Raj Sethi We haven't heard from you since the last response from @Lakshay Goel and @Vigneshraja Palaniraj , and I was checking back to see if her suggestions helped you.Or else, If you have any solution, please share it with the community, a...

4 kudos

03-29-2023 8:35:57 PM

5 More Replies

by oleole • Contributor

03-26-2023 9:50:34 PM

4818 Views
3 replies
2 kudos

Resolved! Using "FOR XML PATH" in Spark SQL in sql syntax

I'm using spark version 3.2.1 on databricks (DBR 10.4 LTS), and I'm trying to convert sql server sql query to a new sql query that runs on a spark cluster using spark sql in sql syntax. However, spark sql does not seem to support XML PATH as a functi...

Data Engineering

4818 Views
3 replies
2 kudos

03-26-2023 9:50:34 PM

View Replies

Latest Reply

oleole
Contributor

03-30-2023 5:59:03 AM

2 kudos

Posting the solution that I ended up using:%sql DROP TABLE if exists UserCountry; CREATE TABLE if not exists UserCountry ( UserID INT, Country VARCHAR(5000) ); INSERT INTO UserCountry SELECT L.UserID AS UserID, CONCAT_WS(',', co...

2 kudos

03-30-2023 5:59:03 AM

2 More Replies

User

Count

1602

736

343

284

247

Databricks

Forum Posts

Run pytest inside repos and store the results in dbfs

Resolved! Pls restrict Spamming

connect to Oracle database using JDBC and perform merge condition

How to get executors info by SDK (Python)

How to create a new group in the Databricks community? Dear esteemed community users, It is with great pleasure that we inform you of an important upd...

StreamCorruptedException, databricks-connect 9.1

DataFrame

Outdated - Databricks Data Engineer associate v2 lesson DE 4.2

AppendDataExecV1 Taking a lot of time

Resolved! How does partition pruning work on a merge into statement?

Resolved! No suitable driver error When configure the Databricks ODBC and JDBC drivers

Division Auto Truncates Decimal Even After Casting Inputs

about .dbc notebook

Need a way to show the non-trimmed data while query a table ?

Resolved! Using "FOR XML PATH" in Spark SQL in sql syntax

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...