Data Engineering

Forum Posts

Sorted by:

Start a conversation

by Rags98 • New Contributor II

01-18-2024 1:28:39 PM

1590 Views
1 replies
0 kudos

Undrop a table from built-in catalogs Azure Databricks

How can I undrop a table from a built-in catalog in Azure Databricks

Data Engineering

Undrop

1590 Views
1 replies
0 kudos

01-18-2024 1:28:39 PM

View Replies

Latest Reply

Lakshay
Esteemed Contributor

01-19-2024 4:41:14 AM

0 kudos

If you are using Unity Catalog, you can simply run the UnDrop command. Ref Doc:- https://docs.databricks.com/en/sql/language-manual/sql-ref-syntax-ddl-undrop-table.html

0 kudos

01-19-2024 4:41:14 AM

by SenthilJ • New Contributor III

01-17-2024 5:49:19 AM

2175 Views
3 replies
5 kudos

Resolved! Unity Catalog and Data Accessibility

Hi,I got a few question about the internals of #Unity Catalog in #Databricks1. Understand that we can customize the UC metastore at different levels (catalog/schema). Wondering where is the information about UC permission model stored for every data ...

Data Engineering

Databricks

Unity Catalog

2175 Views
3 replies
5 kudos

01-17-2024 5:49:19 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-19-2024 1:18:19 AM

5 kudos

Hi @SenthilJ, Unity Catalog manages access to data and other objects across workspaces. Access can be granted by either a metastore admin, an object's owner, or the catalog or schema that contains the object. When User Y queries the table “X-DB-Tabl...

5 kudos

01-19-2024 1:18:19 AM

2 More Replies

by Simon_T • New Contributor

01-17-2024 8:02:38 AM

1601 Views
1 replies
0 kudos

CURL API - Error while parsing token: io.jsonwebtoken.ExpiredJwtException: JWT expired

I am running this code:curl -X --request GET -H "Authorization: Bearer <databricks token>" "https://adb-1817728758721967.7.azuredatabricks.net/api/2.0/clusters/list"And I am getting this error:2024-01-17T13:21:41.4245092Z </head>2024-01-17T13:21:41.4...

Data Engineering

1601 Views
1 replies
0 kudos

01-17-2024 8:02:38 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

01-18-2024 8:12:58 PM

0 kudos

Hi, Could you please renew the token and confirm?

0 kudos

01-18-2024 8:12:58 PM

by youssefmrini • Honored Contributor III

06-16-2023 4:35:23 AM

730 Views
2 replies
0 kudos

Resolved! Can I undrop a table in Databricks ?

Data Engineering

730 Views
2 replies
0 kudos

06-16-2023 4:35:23 AM

View Replies

Latest Reply

youssefmrini
Honored Contributor III

06-16-2023 4:35:50 AM

0 kudos

yes you can use the undrop command.Watch the video to know more : https://www.youtube.com/watch?v=NZMhG25mg2E&feature=youtu.be

0 kudos

06-16-2023 4:35:50 AM

1 More Replies

by Ambesh • New Contributor III

01-04-2024 2:42:38 PM

2479 Views
6 replies
1 kudos

Reading external Iceberg table

Hi all, I am trying to Read an external Iceberg table. A separate spark sql script creates my Iceberg table and now i need to read the Iceberg tables(created outside of databricks) from my Databricks notebook. Could someone tell me the approach for ...

Data Engineering

2479 Views
6 replies
1 kudos

01-04-2024 2:42:38 PM

View Replies

Latest Reply

Ambesh
New Contributor III

01-18-2024 1:06:49 PM

1 kudos

Hi @Kaniz yes the iceberg table does not exist in the default catalog because its created externally(outside of Databricks) by a separate spark sql script. The catalog it uses is Glue catalog. The ques is how can i access that external iceberg table...

1 kudos

01-18-2024 1:06:49 PM

5 More Replies

by jborn • New Contributor III

01-11-2024 9:14:55 AM

5424 Views
7 replies
1 kudos

Resolved! Connecting an Azure Databricks to Azure Gen 2 storage stuck on "Running Command..."

I recently had an Azure Databricks setup done behind a VPN. I'm trying to connect to my Azure Storage Account Gen 2 Using the following code I haven't been able to connect and keep getting stuck on reading the file. What should I be checking? #i...

Data Engineering

5424 Views
7 replies
1 kudos

01-11-2024 9:14:55 AM

View Replies

Latest Reply

jborn
New Contributor III

01-18-2024 11:45:45 AM

1 kudos

I ended up opening a ticket with Microsoft support about this issue and they walked us through the debugging on the issue. In the end the route table was not attached to the subnet. Once attached everything worked.

1 kudos

01-18-2024 11:45:45 AM

6 More Replies

by VJ3 • New Contributor III

01-17-2024 2:22:37 PM

2921 Views
3 replies
2 kudos

Best Practice to use/implement SQL Persona using Azure Databricks

Hello,I am looking for details of Security Controls to use/implement SQL Persona using Azure Databricks.

Data Engineering

2921 Views
3 replies
2 kudos

01-17-2024 2:22:37 PM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

01-18-2024 11:32:01 AM

2 kudos

Hi, There are several documents for the same and can be followed, let me know if the below helps. https://learn.microsoft.com/en-us/answers/questions/1039176/whitelist-databricks-to-read-and-write-into-azure https://www.databricks.com/blog/2020/03/2...

2 kudos

01-18-2024 11:32:01 AM

2 More Replies

by Twilight • New Contributor III

02-27-2023 9:01:58 AM

2333 Views
5 replies
3 kudos

Resolved! Bug - Databricks requires extra escapes in repl string in regexp_replace (compared to Spark)

In Spark (but not Databricks), these work:regexp_replace('1234567890abc', '^(?<one>\\w)(?<two>\\w)(?<three>\\w)', '$3$2$1') regexp_replace('1234567890abc', '^(?<one>\\w)(?<two>\\w)(?<three>\\w)', '${three}${two}${one}')In Databricks, you have to use ...

Data Engineering

2333 Views
5 replies
3 kudos

02-27-2023 9:01:58 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-13-2023 4:54:53 AM

3 kudos

@Stephen Wilcoxon : No, it is not a bug. Databricks uses a different flavor of regular expression syntax than Apache Spark. In particular, Databricks uses Java's regular expression syntax, whereas Apache Spark uses Scala's regular expression syntax....

3 kudos

03-13-2023 4:54:53 AM

4 More Replies

by ChristianRRL • Contributor

01-18-2024 8:35:56 AM

1441 Views
1 replies
1 kudos

Resolved! DLT Bronze: Incremental File Updates

Hi there, I would like to clarify if there's a way for bronze data to be ingested from "the same" CSV file if the file has been modified (i.e. new file with new records overwriting the old file)? Currently in my setup my bronze table is a `streaming ...

Data Engineering

1441 Views
1 replies
1 kudos

01-18-2024 8:35:56 AM

View Replies

Latest Reply

Lakshay
Esteemed Contributor

01-18-2024 11:08:36 AM

1 kudos

You can use the option "cloudFiles.allowOverwrites" in DLT. This option will allow you to read the same csv file again but you should use it cautiously, as it can lead to duplicate data being loaded.

1 kudos

01-18-2024 11:08:36 AM

by otum • New Contributor II

01-10-2024 9:12:47 PM

1090 Views
6 replies
0 kudos

[Errno 2] No such file or directory

I am reading a Json a file as in below location, using the below code, file_path = "/dbfs/mnt/platform-data/temp/ComplexJSON/sample.json" # replace with the file path f = open(file_path, "r") print(f.read()) but it is failing for no such file...

Data Engineering

1090 Views
6 replies
0 kudos

01-10-2024 9:12:47 PM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

01-18-2024 10:51:00 AM

0 kudos

Hi, As Shan mentioned, could you please cat the file and see if it exists.

0 kudos

01-18-2024 10:51:00 AM

5 More Replies

by MYB24 • New Contributor III

01-10-2024 8:24:06 AM

3666 Views
4 replies
0 kudos

Resolved! Error: cannot create mws credentials: invalid Databricks Account configuration

Good Evening, I am configuring databricks_mws_credentials through Terraform on AWS. I am getting the following error:Error: cannot create mws credentials: invalid Databricks Account configuration││ with module.databricks.databricks_mws_credentials.t...

Data Engineering

AWS

credentials

Databricks

Terraform

3666 Views
4 replies
0 kudos

01-10-2024 8:24:06 AM

View Replies

Latest Reply

MYB24
New Contributor III

01-18-2024 8:49:50 AM

0 kudos

Managed to fix the issue by updating the provider.tf while. Had to create a Service Principle token and add that into my provider.tf file. provider "databricks" {alias = "accounts"host = "https://accounts.cloud.databricks.com"client_id = "service-pri...

0 kudos

01-18-2024 8:49:50 AM

3 More Replies

by dbx_687_3__1b3Q • New Contributor III

10-18-2023 3:28:01 PM

2851 Views
4 replies
3 kudos

Databricks Asset Bundle (DAB) from a Git repo?

My earlier question was about creating a Databricks Asset Bundle (DAB) from an existing workspace. I was able to get that working but after further consideration and some experimenting, I need to alter my question. My question is now "how do I create...

Data Engineering

2851 Views
4 replies
3 kudos

10-18-2023 3:28:01 PM

View Replies

Latest Reply

erima21
New Contributor II

12-01-2023 11:58:34 AM

3 kudos

This is how I solved this:. Hope this works for you.- bundle:- jobs:

3 kudos

12-01-2023 11:58:34 AM

3 More Replies

by ac0 • New Contributor III

01-11-2024 9:32:35 AM

1406 Views
4 replies
0 kudos

Resolved! Setting environment variables to use in a SQL Delta Live Table Pipeline

I'm trying to use the Global Init Scripts in Databricks to set an environment variable to use in a Delta Live Table Pipeline. I want to be able to reference a value passed in as a path versus hard coding it. Here is the code for my pipeline:CREATE ST...

Data Engineering

1406 Views
4 replies
0 kudos

01-11-2024 9:32:35 AM

View Replies

Latest Reply

ac0
New Contributor III

01-18-2024 7:06:11 AM

0 kudos

I was able to accomplish this by creating a Cluster Policy that put in place the scripts, config settings, and environment variables I needed.

0 kudos

01-18-2024 7:06:11 AM

3 More Replies

by ChrisS • New Contributor III

06-17-2023 3:53:43 AM

2083 Views
7 replies
8 kudos

How to get data scraped from the web into your data storage

I learning data bricks for the first time following the book that is copywrited in 2020 so I imagine it might be a little outdated at this point. What I am trying to do is move data from an online source (in this specific case using shell script but ...

Data Engineering

2083 Views
7 replies
8 kudos

06-17-2023 3:53:43 AM

View Replies

Latest Reply

CharlesReily
New Contributor III

01-18-2024 6:53:03 AM

8 kudos

In Databricks, you can install external libraries by going to the Clusters tab, selecting your cluster, and then adding the Maven coordinates for Deequ. This represents the best b2b data enrichment services in Databricks.In your notebook or script, y...

8 kudos

01-18-2024 6:53:03 AM

6 More Replies

by aockenden • New Contributor III

01-10-2024 5:56:36 AM

707 Views
3 replies
0 kudos

Switching SAS Tokens Mid-Script With Spark Dataframes

Hey all, my team has settled on using directory-scoped SAS tokens to provision access to data in our Azure Gen2 Datalakes. However, we have encountered an issue when switching from a first SAS token (which is used to read a first parquet table in the...

Data Engineering

707 Views
3 replies
0 kudos

01-10-2024 5:56:36 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 3:23:42 AM

0 kudos

Hi @aockenden, The data in the Data Lake is not actually retrieved into cluster memory by the Spark dataframes until an action (like .show()) is executed. At this point, the fs.azure.sas.fixed.token Spark configuration setting has been switched to a ...

0 kudos

01-18-2024 3:23:42 AM

2 More Replies

User

Count

1603

736

344

284

247

Databricks

Forum Posts

Undrop a table from built-in catalogs Azure Databricks

Resolved! Unity Catalog and Data Accessibility

CURL API - Error while parsing token: io.jsonwebtoken.ExpiredJwtException: JWT expired

Resolved! Can I undrop a table in Databricks ?

Reading external Iceberg table

Resolved! Connecting an Azure Databricks to Azure Gen 2 storage stuck on "Running Command..."

Best Practice to use/implement SQL Persona using Azure Databricks

Resolved! Bug - Databricks requires extra escapes in repl string in regexp_replace (compared to Spark)

Resolved! DLT Bronze: Incremental File Updates

[Errno 2] No such file or directory

Resolved! Error: cannot create mws credentials: invalid Databricks Account configuration

Databricks Asset Bundle (DAB) from a Git repo?

Resolved! Setting environment variables to use in a SQL Delta Live Table Pipeline

How to get data scraped from the web into your data storage

Switching SAS Tokens Mid-Script With Spark Dataframes

DLT table not picked in python notebook

Load multiple delta tables at once from Sql server

Starting Serverless sql cluster on GCP

"Can't login to databricks socket is closed" when ...

Temporary views no longer working for Share Comput...