Data Engineering

Forum Posts

Sorted by:

by Vikad • New Contributor II

02-23-2023 1:06:55 PM

1169 Views
5 replies
2 kudos

Databricks certificaton voucher not recieved

Hi team,I attended the webinar on 21th feb 2023 and also took Lakehouse fundamentals badge, yet I have not received any certification voucher from databricks.regards,vikas

Data Engineering

1169 Views
5 replies
2 kudos

02-23-2023 1:06:55 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-16-2023 8:09:15 PM

2 kudos

Hi @Vikas Singh Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedback ...

2 kudos

03-16-2023 8:09:15 PM

4 More Replies

by Hubert-Dudek • Esteemed Contributor III

03-14-2023 6:37:54 AM

973 Views
2 replies
12 kudos

Databricks now supports event-driven workloads, especially for loading cloud files from external locations. This means you can save costs and resource...

Databricks now supports event-driven workloads, especially for loading cloud files from external locations. This means you can save costs and resources by triggering your Databricks jobs only when new files arrive in your cloud storage instead of mou...

Data Engineering

973 Views
2 replies
12 kudos

03-14-2023 6:37:54 AM

View Replies

Latest Reply

Vartika
Moderator

03-23-2023 12:44:25 AM

12 kudos

Hi @Hubert Dudek We really appreciate you sharing this bit of information.Cheers!

12 kudos

03-23-2023 12:44:25 AM

1 More Replies

by Naveen_KumarMad • New Contributor III

02-27-2023 2:23:20 AM

5377 Views
13 replies
14 kudos

Resolved! How to find the last modified date of a notebook?

I would like to find the notebooks that are not required and not being used and then I can review and delete them. If there is a way to find last modified date of a notebook programmatically then I can get a list of notebooks, which I can review and ...

Data Engineering

5377 Views
13 replies
14 kudos

02-27-2023 2:23:20 AM

View Replies

Latest Reply

Amit_352107
New Contributor III

03-23-2023 12:14:50 AM

14 kudos

Hi @Naveen Kumar Madas you can go through below code block%shls -lt /dbfs/

14 kudos

03-23-2023 12:14:50 AM

12 More Replies

by wyzer • Contributor II

01-11-2022 6:50:05 AM

28567 Views
15 replies
7 kudos

Resolved! What's the equivalent of "DECLARE..." in Databricks SQL ?

Hello everyone,I'm new in Databricks SQL, and I'm comming from SQL Server.I would like to know what's the equivalent of :DECLARE @P_Name varchar(50) = 'BackOffice'It's for use it like this : CREATE DATABASE @P_NameThanks.

Data Engineering

28567 Views
15 replies
7 kudos

01-11-2022 6:50:05 AM

View Replies

Latest Reply

Amit_352107
New Contributor III

03-23-2023 12:04:26 AM

7 kudos

Hi @Salah K. you can go through this code block%python P_Name = 'BackOffice'spark.sql(f""" create database {P_name} """)

7 kudos

03-23-2023 12:04:26 AM

14 More Replies

by StephanieRivera • Valued Contributor II

03-22-2023 7:30:08 AM

2870 Views
4 replies
2 kudos

Resolved! How do I download and unzip datasets from Kaggle into DBFS?

Data Engineering

2870 Views
4 replies
2 kudos

03-22-2023 7:30:08 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

03-22-2023 10:43:00 PM

2 kudos

Hi, You can refer to https://docs.databricks.com/files/unzip-files.html. You can curl the file you want and then it can be unzipped as mentioned in the doc. Please let us know if this helps.Also, please tag @Debayan with your next update which will n...

2 kudos

03-22-2023 10:43:00 PM

3 More Replies

by RC • Contributor

03-22-2023 6:53:14 AM

717 Views
2 replies
2 kudos

Not able to create a unity metastore in a specified region

Hi Team,I'm not able to create a metastore in a region (us-east-1).It tells me that "This region already contains a metastore. Only a single metastore is allowed per region"But we don't have any metastore. Earlier we had one metastore we had deleted...

Data Engineering

717 Views
2 replies
2 kudos

03-22-2023 6:53:14 AM

View Replies

Latest Reply

karthik_p
Esteemed Contributor

03-22-2023 9:18:27 AM

2 kudos

@Rajath C can you please re-check if it has been properly deleted and if still old one has been tied to any of workspaces. also try to delete that storage if no data exists and retry

2 kudos

03-22-2023 9:18:27 AM

1 More Replies

by User16691272604 • New Contributor II

03-20-2023 12:43:29 AM

513 Views
1 replies
2 kudos

Flattening Nested XML in Databricks

Flattening Nested XML - Multiple ways

Data Engineering

513 Views
1 replies
2 kudos

03-20-2023 12:43:29 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-22-2023 9:49:30 PM

2 kudos

Thanks for this and educating the community members about it.

2 kudos

03-22-2023 9:49:30 PM

by repcak • New Contributor III

03-13-2023 9:37:15 AM

1456 Views
1 replies
2 kudos

Init Scripts with mounted azure data lake storage gen2

I'm trying to access init script which is stored on mounted azure data lake storage gen2 to dbfsI mounted storage to dbfs:/mnt/storage/container/script.shand when i try to access it i got an error:Cluster scoped init script dbfs:/mnt/storage/containe...

Data Engineering

1456 Views
1 replies
2 kudos

03-13-2023 9:37:15 AM

View Replies

Latest Reply

User16752239289
Valued Contributor

03-22-2023 4:01:57 PM

2 kudos

I do not think the init script saved under mount point work and we do not suggest that. If you specify abfss , then the cluster need to be configured so that the cluster can authenticate and access the adls gen2 folder. Otherwise, the cluster will no...

2 kudos

03-22-2023 4:01:57 PM

by Erik_L • Contributor II

03-17-2023 11:25:28 AM

3063 Views
1 replies
0 kudos

How to merge parquets with different column types

ProblemI have a directory in S3 with a bunch of data files, like "data-20221101.parquet". They all have the same columns: timestamp, reading_a, reading_b, reading_c. In the earlier files, the readings are floats, but in the later ones they are double...

Data Engineering

3063 Views
1 replies
0 kudos

03-17-2023 11:25:28 AM

View Replies

Latest Reply

mathan_pillai
Valued Contributor

03-22-2023 3:26:15 PM

0 kudos

1) Can you let us know what was the error message when you don't set the schema & use mergeSchema2) What happens when you define schema (with FloatType) & use mergeSchema ? what error message do you get ?

0 kudos

03-22-2023 3:26:15 PM

by karthik_p • Esteemed Contributor

12-01-2022 8:52:55 PM

1787 Views
9 replies
8 kudos

Tool For Monitoring Security/Health of Databricks Workspace Since from a year we have been looking for a tool to monitor health of data bricks workspa...

Tool For Monitoring Security/Health of Databricks WorkspaceSince from a year we have been looking for a tool to monitor health of data bricks workspace in automated way. we used to monitor below few things in workspace manually clusters JobsTablesACL...

Data Engineering

1787 Views
9 replies
8 kudos

12-01-2022 8:52:55 PM

View Replies

Latest Reply

karthik_p
Esteemed Contributor

03-22-2023 9:09:43 AM

8 kudos

@Harish Koduru @Arnold Souza are you still seeing issues

8 kudos

03-22-2023 9:09:43 AM

8 More Replies

by Sujitha • Community Manager

03-13-2023 7:31:12 PM

410 Views
1 replies
3 kudos

Data + AI Summit Virtual - Register Now! This year’s free virtual experience will include access to live-streamed keynotes, select sessions designed ...

Data + AI Summit Virtual - Register Now! This year’s free virtual experience will include access to live-streamed keynotes, select sessions designed and led by data experts, as well as unlimited access to on-demand sessions soon after the live event....

Data Engineering

410 Views
1 replies
3 kudos

03-13-2023 7:31:12 PM

View Replies

Latest Reply

jose_gonzalez
Moderator

03-22-2023 2:32:58 PM

3 kudos

Thank you for sharing @Sujitha Ramamoorthy !!

3 kudos

03-22-2023 2:32:58 PM

by Hubert-Dudek • Esteemed Contributor III

03-07-2023 3:56:58 AM

738 Views
1 replies
6 kudos

Exciting news for #azure users! The #databricks runtime 12.2 has been officially released as a long-term support (LTS) version, providing a stable and...

Exciting news for #azure users! The #databricks runtime 12.2 has been officially released as a long-term support (LTS) version, providing a stable and reliable platform for users to build and deploy their applications. As part of this release, the en...

Data Engineering

738 Views
1 replies
6 kudos

03-07-2023 3:56:58 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

03-22-2023 2:27:00 PM

6 kudos

Thank you for sharing @Hubert Dudek !!!

6 kudos

03-22-2023 2:27:00 PM

by Hubert-Dudek • Esteemed Contributor III

03-09-2023 2:36:56 PM

753 Views
1 replies
7 kudos

Starting from #databricks 12.2 LTS, the explode function can be used in the FROM statement to manipulate data in new and powerful ways. This function ...

Starting from #databricks 12.2 LTS, the explode function can be used in the FROM statement to manipulate data in new and powerful ways. This function takes an array column as input and returns a new row for each element in the array, offering new pos...

Data Engineering

753 Views
1 replies
7 kudos

03-09-2023 2:36:56 PM

View Replies

Latest Reply

jose_gonzalez
Moderator

03-22-2023 2:26:25 PM

7 kudos

Thank you for sharing @Hubert Dudek

7 kudos

03-22-2023 2:26:25 PM

by oriole • New Contributor III

03-19-2023 12:35:30 PM

2359 Views
5 replies
2 kudos

Resolved! Spark Driver Crash Writing Large Text

I'm working with a large text variable, working it into single line JSON where Spark can process beautifully. Using a single node 256 GB 32 core Standard_E32d_v4 "cluster", which should be plenty memory for this dataset (haven't seen cluster memory u...

Data Engineering

2359 Views
5 replies
2 kudos

03-19-2023 12:35:30 PM

View Replies

Latest Reply

pvignesh92
Honored Contributor

03-20-2023 8:46:30 AM

2 kudos

@David Toft Hi, The current implementation of dbutils.fs is single-threaded, performs the initial listing on the driver and subsequently launches a Spark job to perform the per-file operations. So I guess the put operation is running on a single cor...

2 kudos

03-20-2023 8:46:30 AM

4 More Replies

by andrew0117 • Contributor

03-21-2023 4:35:19 PM

1036 Views
3 replies
2 kudos

Resolved! Will a table backed by a SQL server database table automatically get updated if the base table in SQL server database is updated?

If I creat a table using the code below: CREATE TABLE IF NOT EXISTS jdbcTableusing org.apache.spark.sql.jdbcoptions( url "sql_server_url", dbtable "sqlserverTable", user "username", password "password")will jdbcTable always be automatically sync...

Data Engineering

1036 Views
3 replies
2 kudos

03-21-2023 4:35:19 PM

View Replies

Latest Reply

pvignesh92
Honored Contributor

03-22-2023 1:30:19 AM

2 kudos

Hi @andrew li There is a feature introduced from DBR11 where you can directly ingest the data to the table from a selected list of sources. As you are creating a table, I believe this command will create a managed table by loading the data from the...

2 kudos

03-22-2023 1:30:19 AM

2 More Replies

User

Count

1601

736

343

284

247

Databricks

Forum Posts

Databricks certificaton voucher not recieved

Databricks now supports event-driven workloads, especially for loading cloud files from external locations. This means you can save costs and resource...

Resolved! How to find the last modified date of a notebook?

Resolved! What's the equivalent of "DECLARE..." in Databricks SQL ?

Resolved! How do I download and unzip datasets from Kaggle into DBFS?

Not able to create a unity metastore in a specified region

Flattening Nested XML in Databricks

Init Scripts with mounted azure data lake storage gen2

How to merge parquets with different column types

Tool For Monitoring Security/Health of Databricks Workspace Since from a year we have been looking for a tool to monitor health of data bricks workspa...

Data + AI Summit Virtual - Register Now! This year’s free virtual experience will include access to live-streamed keynotes, select sessions designed ...

Exciting news for #azure users! The #databricks runtime 12.2 has been officially released as a long-term support (LTS) version, providing a stable and...

Starting from #databricks 12.2 LTS, the explode function can be used in the FROM statement to manipulate data in new and powerful ways. This function ...

Resolved! Spark Driver Crash Writing Large Text

Resolved! Will a table backed by a SQL server database table automatically get updated if the base table in SQL server database is updated?

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...