Data Engineering

Forum Posts

Sorted by:

by Raghav_597352 • New Contributor II

05-17-2023 11:41:22 AM

1094 Views
2 replies
4 kudos

Resolved! Workspace not getting created

Hey guys,I tried to create a workspace, I didn't encountered error like this. I provided everything correctly but don't know why I'm getting this. Tried doing it by using different Data bricks Id and AWS ID also access this on AWS Root account

Data Engineering

1094 Views
2 replies
4 kudos

05-17-2023 11:41:22 AM

View Replies

Latest Reply

Priyag1
Honored Contributor II

05-18-2023 1:14:51 AM

4 kudos

https://docs.gcp.databricks.com/administration-guide/workspace/create-workspace.html

4 kudos

05-18-2023 1:14:51 AM

1 More Replies

by Prank • New Contributor III

03-27-2023 9:22:55 AM

696 Views
1 replies
1 kudos

Access DBU used per cluster using within Databricks Clusters

Could it be possible, we can retrieve the DBU's on cluster basis within Databricks Notebook itself?This info we get in the compute tab in Databricks for each cluster as Active DBU/hr.

Data Engineering

696 Views
1 replies
1 kudos

03-27-2023 9:22:55 AM

View Replies

Latest Reply

Anonymous
Not applicable

05-17-2023 11:35:35 PM

1 kudos

It wont be possible to access DBU used per cluster within DB Clusters.

1 kudos

05-17-2023 11:35:35 PM

by Chinu • New Contributor III

05-17-2023 12:31:56 PM

330 Views
0 replies
0 kudos

Pulling query history only for the last 5 mins using "/api/2.0/sql/history/queries" api

I know query history api provides filter_by option with start and end time in ms but I was wondering if I can get only the last 5 mins of query data every time I run the api call (using telegraf to call the api). Is it possible I can use relative dat...

Data Engineering

330 Views
0 replies
0 kudos

05-17-2023 12:31:56 PM

by Enthusiastic_Da • New Contributor II

05-17-2023 7:54:05 AM

1439 Views
0 replies
0 kudos

how to read columns dynamically using pyspark

I have a table called MetaData and what columns are needed in the select are stored in MetaData.columnsI would like to read columns dynamically from MetaData.columns and create a view based on that.csv_values = "col1, col2, col3, col4"df = spark.crea...

Data Engineering

1439 Views
0 replies
0 kudos

05-17-2023 7:54:05 AM

by drewtoby • New Contributor II

05-16-2023 11:50:10 AM

2386 Views
2 replies
1 kudos

Resolved! How to Pull Cached SQL Table into Python Dictionary?

Hello,I have been working on this issue as a proof of concept - it would be extremely helpful to iterate through tables via loops in a few scenarios. I have a simple three column dimension that I added to a cached table.cache lazy table hedis_cache s...

Data Engineering

2386 Views
2 replies
1 kudos

05-16-2023 11:50:10 AM

View Replies

Latest Reply

drewtoby
New Contributor II

05-17-2023 7:34:16 AM

1 kudos

Got it to work, thank you for the tip! I needed to convert the dataframe over to a pandas dataframehttps://www.geeksforgeeks.org/convert-pyspark-dataframe-to-dictionary-in-python/

1 kudos

05-17-2023 7:34:16 AM

1 More Replies

by AkasBala • New Contributor III

05-13-2023 11:41:13 PM

1175 Views
4 replies
3 kudos

Unity Catalog Primary key column taking duplicates

I have Updated a Hive Meta Store from a Unity Catalog. I have setup Primary keys on the table. When I try to insert duplicates its succeeding Inserts and seems like PK is not working. Anyone else seeing such behaviour ?

Data Engineering

1175 Views
4 replies
3 kudos

05-13-2023 11:41:13 PM

View Replies

Latest Reply

AkasBala
New Contributor III

05-17-2023 6:25:00 AM

3 kudos

@Debayan Mukherjee Any info on the above plz ??

3 kudos

05-17-2023 6:25:00 AM

3 More Replies

by Saurabh707344 • New Contributor III

05-17-2023 4:29:17 AM

374 Views
0 replies
0 kudos

Databricks CI/CD process is decoupled or coupled ?

In Databricks, CI/CD process is decoupled or coupled ?

Data Engineering

374 Views
0 replies
0 kudos

05-17-2023 4:29:17 AM

by Anonymous • Not applicable

05-17-2023 4:15:22 AM

246 Views
0 replies
0 kudos

docs.databricks.com

What Serverless features are you using on Databricks? I am curious to know.Is it Databricks SQL Serverless or Model Serving?Proceed here to Compare serverless compute to other Databricks architectureshttps://docs.databricks.com/serverless-compute/ind...

Data Engineering

246 Views
0 replies
0 kudos

05-17-2023 4:15:22 AM

by Anuj93 • New Contributor III

05-17-2023 3:56:10 AM

486 Views
0 replies
0 kudos

Change Azure Databricks cluster owner

I wanted to add secrets to spark conf of the cluster but i am not able to because i am not the cluster owner. I want to know how can we change the cluster owner?

Data Engineering

486 Views
0 replies
0 kudos

05-17-2023 3:56:10 AM

by Ryu1 • New Contributor

05-17-2023 3:39:02 AM

479 Views
0 replies
0 kudos

Other than the "account admin" permission, is there a small permission or role to collect only catalog information?

I am going to use an open source called "datahub" to collect and share metadata information of databricks. (https://datahubproject.io/)Recently, however, there has been a big challenge. That is, to collect the unity catalog information of databricks,...

Data Engineering

479 Views
0 replies
0 kudos

05-17-2023 3:39:02 AM

by Dean_Lovelace • New Contributor III

05-17-2023 1:36:23 AM

2433 Views
1 replies
1 kudos

Resolved! Efficiently move multiple files with dbutils.fs.mv command on abfs storage

As part of my batch processing I archive a large number of small files received from the source system each day using the dbutils.fs.mv command. This takes hours as dbutils.fs.mv moves the files one at a time.How can I speed this up?

Data Engineering

2433 Views
1 replies
1 kudos

05-17-2023 1:36:23 AM

View Replies

Latest Reply

daniel_sahal
Esteemed Contributor

05-17-2023 2:02:06 AM

1 kudos

@Dean Lovelace You can use multithreading.See example here: https://nealanalytics.com/blog/databricks-spark-jobs-optimization-techniques-multi-threading/

1 kudos

05-17-2023 2:02:06 AM

by Phani1 • Valued Contributor

05-11-2023 11:27:49 PM

5969 Views
2 replies
2 kudos

Resolved! Web application integrated with Gradio or streamlit on Databricks

We are trying to run a web application integrated with Gradio on Databricks. Although, we have configured launch parameter with (share="True")The app executes and gives us output but it keeps on running with no Public URL is generated:o/p: Running on...

Data Engineering

5969 Views
2 replies
2 kudos

05-11-2023 11:27:49 PM

View Replies

Latest Reply

Anonymous
Not applicable

05-17-2023 12:59:07 AM

2 kudos

Hi @Janga Reddy Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers y...

2 kudos

05-17-2023 12:59:07 AM

1 More Replies

by youssefmrini • Honored Contributor III

05-17-2023 12:50:35 AM

506 Views
1 replies
0 kudos

Resolved! Can I share notebooks as well as the data with Delta Sharing ?

Data Engineering

506 Views
1 replies
0 kudos

05-17-2023 12:50:35 AM

View Replies

Latest Reply

youssefmrini
Honored Contributor III

05-17-2023 12:50:48 AM

0 kudos

You can now use Delta Sharing to share notebook files securely using the Databricks-to-Databricks sharing flow.Sharing notebooks empowers users to collaborate across metastores and accounts, and enables providers to demonstrate use cases and visualiz...

0 kudos

05-17-2023 12:50:48 AM

by youssefmrini • Honored Contributor III

05-17-2023 12:49:14 AM

596 Views
1 replies
0 kudos

Resolved! Can I use Alation with Partner Connect ?

Data Engineering

596 Views
1 replies
0 kudos

05-17-2023 12:49:14 AM

View Replies

Latest Reply

youssefmrini
Honored Contributor III

05-17-2023 12:49:28 AM

0 kudos

You can now connect your Databricks workspace to Alation using Partner ConnectFor more information : https://lnkd.in/ePmyWiVr

0 kudos

05-17-2023 12:49:28 AM

by Kaniz • Community Manager

05-16-2023 11:59:41 PM

267 Views
0 replies
2 kudos

&#xd83d;&#xdd14;RAFFLE ALERT&#xd83d;&#xdd14; Hey there, Awesome Community Members! &#xd83c;&#xdf0d; Tick-Tock, Tick-Tock! Time is racing, and we're just FOUR WEEKS aw...

RAFFLE ALERTHey there, Awesome Community Members! Tick-Tock, Tick-Tock! Time is racing, and we're just FOUR WEEKS away from the grand raffle draw! Some may think, "What if I can't reach the United States?" Well, we've got your back. We understand tha...

Data Engineering

267 Views
0 replies
2 kudos

05-16-2023 11:59:41 PM

User

Count

1602

736

343

284

247

Databricks

Forum Posts

Resolved! Workspace not getting created

Access DBU used per cluster using within Databricks Clusters

Pulling query history only for the last 5 mins using "/api/2.0/sql/history/queries" api

how to read columns dynamically using pyspark

Resolved! How to Pull Cached SQL Table into Python Dictionary?

Unity Catalog Primary key column taking duplicates

Databricks CI/CD process is decoupled or coupled ?

docs.databricks.com

Change Azure Databricks cluster owner

Other than the "account admin" permission, is there a small permission or role to collect only catalog information?

Resolved! Efficiently move multiple files with dbutils.fs.mv command on abfs storage

Resolved! Web application integrated with Gradio or streamlit on Databricks

Resolved! Can I share notebooks as well as the data with Delta Sharing ?

Resolved! Can I use Alation with Partner Connect ?

&#xd83d;&#xdd14;RAFFLE ALERT&#xd83d;&#xdd14; Hey there, Awesome Community Members! &#xd83c;&#xdf0d; Tick-Tock, Tick-Tock! Time is racing, and we're just FOUR WEEKS aw...

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...