Data Engineering

Forum Posts

Sorted by:

Start a conversation

by Mahesh777k • New Contributor

01-10-2023 5:08:54 PM

3163 Views
2 replies
2 kudos

How to delete duplicate tables?

Hi Everyone,Accidently imported duplicate tables, guide me how to delete themusing data bricks community edition

Data Engineering

3163 Views
2 replies
2 kudos

01-10-2023 5:08:54 PM

View Replies

Latest Reply

UmaMahesh1
Honored Contributor III

01-11-2023 6:30:06 AM

2 kudos

Hi @Mahesh Babu Uppala You can use the following method to delete only the duplicate tables%scala val tables = spark.sql("""SHOW TABLES""").createOrReplaceTempView("tables") val temp_tables = spark.sql("""select tableName from tables where tableName...

2 kudos

01-11-2023 6:30:06 AM

1 More Replies

by labtech • Valued Contributor II

01-08-2023 2:10:27 AM

6303 Views
4 replies
18 kudos

Resolved! Limit resource when create cluster in Databricks on AWS platform

Hi team,Could you please help check on my case? I always failed at this step Thanks

Data Engineering

6303 Views
4 replies
18 kudos

01-08-2023 2:10:27 AM

View Replies

Latest Reply

labtech
Valued Contributor II

01-11-2023 5:52:01 AM

18 kudos

Thanks all your answer. The problem come from AWS side. Don't know why the first ticket they said that the issue didn't come from AWS

18 kudos

01-11-2023 5:52:01 AM

3 More Replies

by jamesw • New Contributor II

01-10-2023 6:04:31 PM

3432 Views
1 replies
1 kudos

Ganglia not working with custom container services

Setup:custom docker container starting from the "databricksruntime/gpu-conda:cuda11" base image layer10.4 LTS (includes Apache Spark 3.2.1, Scala 2.12)multi-node, p3.8xlarge GPU computeWhen I try to view Ganglia metrics I am met with "502 Bad Gatewa...

Data Engineering

3432 Views
1 replies
1 kudos

01-10-2023 6:04:31 PM

View Replies

Latest Reply

Vivian_Wilfred
Databricks Employee

01-11-2023 4:38:38 AM

1 kudos

Hi @James W , Ganglia is not available for custom docker containers by default. This is a known limitation. However, you can try this experimental support for ganglia in custom DCS:https://github.com/databricks/containers/tree/master/experimental/ub...

1 kudos

01-11-2023 4:38:38 AM

by Dinu2 • New Contributor III

01-10-2023 2:36:20 PM

2850 Views
1 replies
1 kudos

base64 encode is not matching with Oracle's base64 encode

Hi , base64 encode is not matching with Oracle's base64 encode. please see below result. Could anyone help me on this?In Azure Databricks: encoded= base64.b64encode(b'952B8D04E5CFB9BE')output is - b'OTUyQjhEMDRFNUNGQjlCRQ=='In Oracle: select utl_enco...

Data Engineering

2850 Views
1 replies
1 kudos

01-10-2023 2:36:20 PM

View Replies

Latest Reply

daniel_sahal
Esteemed Contributor

01-11-2023 1:11:53 AM

1 kudos

Oracle handles base64 encoding a little bit differently.Please check this link to understand what's the difference:https://dba.stackexchange.com/a/129134

1 kudos

01-11-2023 1:11:53 AM

by preetham333 • New Contributor II

12-28-2022 5:32:29 AM

2245 Views
3 replies
4 kudos

Did not received badge

I have completed my data bricks lakehouse fundamentals but did not received badge. Please help in this issue.

Data Engineering

2245 Views
3 replies
4 kudos

12-28-2022 5:32:29 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-10-2023 10:54:57 PM

4 kudos

Hi @kalle preetham Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Tha...

4 kudos

01-10-2023 10:54:57 PM

2 More Replies

by prashant7sep • New Contributor II

12-02-2022 12:11:51 AM

4986 Views
7 replies
5 kudos

Lakehouse Fundamentals Accreditation badge not received

Lakehouse Fundamentals Accreditation badge not receivedI just passed the Lakehouse Fundamentals Accreditation at https://partner-academy.databricks.com/ and I haven't received my badge yet and cant find the credentials. Please advise.

Data Engineering

4986 Views
7 replies
5 kudos

12-02-2022 12:11:51 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-10-2023 10:46:05 PM

5 kudos

Hi @Prashant Singh Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Tha...

5 kudos

01-10-2023 10:46:05 PM

6 More Replies

by Mado • Valued Contributor II

01-10-2023 4:20:55 PM

4407 Views
0 replies
1 kudos

How to get a snapshot of a streaming delta table as a static table?

Hi,Assume that I have a streaming delta table. Is there any way to get snapshot of the streaming table as a static table?Reason is that I need to join this streaming table with a static table by:output = output.join(country_information, ["Country"], ...

Data Engineering

4407 Views
0 replies
1 kudos

01-10-2023 4:20:55 PM

by Hubert-Dudek • Esteemed Contributor III

03-28-2022 11:29:41 AM

3015 Views
1 replies
22 kudos

How to process files from the internet in databricks? "spark.sparkContext.addFile" download file to HDFS directory. "SparkFiles.get&quo...

How to process files from the internet in databricks?"spark.sparkContext.addFile" download file to HDFS directory. "SparkFiles.get" return the path and the name. However, as Databricks use the DBFS file system, we need to add the "file:///" prefix to...

Data Engineering

3015 Views
1 replies
22 kudos

03-28-2022 11:29:41 AM

View Replies

Latest Reply

Matt101122
Contributor II

01-10-2023 2:24:36 PM

22 kudos

@Hubert Dudek Do you know if addFile should work with abfss:// path? Trying to add a file from azure data lake with external location in unity catalog.

22 kudos

01-10-2023 2:24:36 PM

by jeremy1 • New Contributor II

05-17-2022 12:57:47 PM

18939 Views
9 replies
7 kudos

DLT and Modularity (best practices?)

I have [very] recently started using DLT for the first time. One of the challenges I have run into is how to include other "modules" within my pipelines. I missed the documentation where magic commands (with the exception of %pip) are ignored and was...

Data Engineering

18939 Views
9 replies
7 kudos

05-17-2022 12:57:47 PM

View Replies

Latest Reply

Greg_Galloway
New Contributor III

10-20-2022 11:14:13 AM

7 kudos

I like the approach @Arvind Ravish shared since you can't currently use %run in DLT pipelines. However, it took a little testing to be clear on how exactly to make it work. First, ensure in the Admin Console that the repos feature is configured as f...

7 kudos

10-20-2022 11:14:13 AM

8 More Replies

by databicky • Contributor II

01-10-2023 4:34:21 AM

1734 Views
1 replies
1 kudos

how to add the title excelsheet with python

i want to write title with some combination of rows in pandas df, and write into excel sheet. i tried some method but i could see styler object is not subscriptable

Data Engineering

1734 Views
1 replies
1 kudos

01-10-2023 4:34:21 AM

View Replies

Latest Reply

Chaitanya_Raju
Honored Contributor

01-10-2023 6:01:28 AM

1 kudos

Hi @Mohammed sadamusean ,Can you please share the sample input and sample expected output, so that we can try on our end and can let you know.Happy Learning!!

1 kudos

01-10-2023 6:01:28 AM

by ACP • New Contributor III

01-09-2023 1:43:00 AM

9803 Views
5 replies
0 kudos

Screenshot 2023-01-09 094039

Hey guys,Databricks academy login is not working. I have been trying for the past 1 hour and still doesn't work. It seems to be with the Databricks https certificate being expired but not sure. I'm attaching an image with the error. Any help with thi...

Data Engineering

9803 Views
5 replies
0 kudos

01-09-2023 1:43:00 AM

View Replies

Latest Reply

Chaitanya_Raju
Honored Contributor

01-10-2023 5:51:47 AM

0 kudos

Hi @Andre Paiva ,Can you please try now I can able to load both customer and partner academy websites, I think the Academy team has fixed the issue. Happy Learning!!

0 kudos

01-10-2023 5:51:47 AM

4 More Replies

by databicky • Contributor II

01-07-2023 7:04:08 AM

4066 Views
4 replies
3 kudos

Resolved! How to add a current date as suffix while using copy?

how to add a current date after filename suffix while copy from the dbutils like report20221223.xlsxdbutils.fs.cp('dbfs://temp/balancing/report.xlsx','abfss://con@adls/provsn/result/report.xlsx',True)i need to add the current date in the file like ...

Data Engineering

4066 Views
4 replies
3 kudos

01-07-2023 7:04:08 AM

View Replies

Latest Reply

Chaitanya_Raju
Honored Contributor

01-07-2023 9:03:44 AM

3 kudos

@Mohammed sadamusean hope the below code might help you, from datetime import datetime date_value = datetime.now().strftime("%Y%m%d") src = 'dbfs:/FileStore/Test/File.csv' trgt = f'dbfs:/FileStore/Test/File_{date_value}.csv' dbutils.fs.cp(src,t...

3 kudos

01-07-2023 9:03:44 AM

3 More Replies

by thibault • Contributor III

12-01-2022 6:32:38 AM

7916 Views
6 replies
0 kudos

Resolved! Monaco editor - Toggle line comment not working

I recently tried the new editor, and usual shortcuts like CTRL + / to comment is ineffective. Is this a known issue? It's working fine with the classic editor, so I am switching back to it in the meantime, but it would be great to use this new additi...

Data Engineering

7916 Views
6 replies
0 kudos

12-01-2022 6:32:38 AM

View Replies

Latest Reply

thibault
Contributor III

01-10-2023 3:54:36 AM

0 kudos

It has been fixed now, thanks!

0 kudos

01-10-2023 3:54:36 AM

5 More Replies

by Aviral-Bhardwaj • Esteemed Contributor III

01-07-2023 8:19:37 AM

1745 Views
1 replies
19 kudos

&#xd83d;&#xde00; Deltalake Vs Datalake in Databricks &#xd83d;&#xde00;Delta Lake Databricks Delta Lake is an open-source storage layer that sits on top of existing d...

Deltalake Vs Datalake in Databricks Delta Lake DatabricksDelta Lake is an open-source storage layer that sits on top of existing data lake storage, such as Azure Data Lake Store or Amazon S3. It provides a more robust and scalable alternative to tra...

Data Engineering

1745 Views
1 replies
19 kudos

01-07-2023 8:19:37 AM

View Replies

by rubenteixeira • New Contributor III

01-09-2023 7:16:03 AM

7512 Views
2 replies
0 kudos

Can't parallelize model training with sc.parallelize, even tough I can run the same code without parallelizing

I'm training a NeuralProphet for a time series forecasting problem. I'm trying to parallelize my training, but this error is appearingThe folder lightning_logs has a hparams.yaml but it's empty. Is this related to permissions on the cluster? Thanks i...

Data Engineering

7512 Views
2 replies
0 kudos

01-09-2023 7:16:03 AM

View Replies

Latest Reply

Debayan
Databricks Employee

01-09-2023 2:07:40 PM

0 kudos

Hi,Please let us know if this was checked already:

0 kudos

01-09-2023 2:07:40 PM

1 More Replies

Databricks Community

Forum Posts

How to delete duplicate tables?

Resolved! Limit resource when create cluster in Databricks on AWS platform

Ganglia not working with custom container services

base64 encode is not matching with Oracle's base64 encode

Did not received badge

Lakehouse Fundamentals Accreditation badge not received

How to get a snapshot of a streaming delta table as a static table?

How to process files from the internet in databricks? "spark.sparkContext.addFile" download file to HDFS directory. "SparkFiles.get&quo...

DLT and Modularity (best practices?)

how to add the title excelsheet with python

Screenshot 2023-01-09 094039

Resolved! How to add a current date as suffix while using copy?

Resolved! Monaco editor - Toggle line comment not working

&#xd83d;&#xde00; Deltalake Vs Datalake in Databricks &#xd83d;&#xde00;Delta Lake Databricks Delta Lake is an open-source storage layer that sits on top of existing d...

Can't parallelize model training with sc.parallelize, even tough I can run the same code without parallelizing

Join Us as a Local Community Builder!

Resource Throttling; Large Merge Operation - Recen...

Databricks Asset Bundles - High Level Diagrams Flo...

Delta live table not showing in workspace (Azure d...

Unable to install libraries from requirements.txt ...

Databricks Bundle Validation Error After CLI Upgra...