Community Platform Discussions

by Bhanu1 • New Contributor III

11-01-2023 1:47:58 PM

1551 Views
0 replies
0 kudos

Thoughts on how to improve string search queries

Please see sample code I am running below. What options can I explore to improve speed of query execution in such a scenario? Current full code takes about 4 hrs to run on 1.5 billion rows. Thanks!SELECT fullVisitorId ,VisitId ,EventDate ,PagePath ,d...

Community Platform Discussions

Reply

1551 Views
0 replies
0 kudos

11-01-2023 1:47:58 PM

by AJ270990 • Contributor II

10-09-2023 3:09:58 AM

3423 Views
1 replies
0 kudos

API for Databricks code functionality

I have a Databricks notebook for which I want to create an API. From that API I will have to call the notebook and perform certain operations. Result will be sent back to API. I dont want to do via Postman, as someone has to install Postman at their ...

Community Platform Discussions

Reply

3423 Views
1 replies
0 kudos

10-09-2023 3:09:58 AM

View Replies

by Anonym • New Contributor II

10-25-2023 5:52:41 AM

1137 Views
1 replies
0 kudos

Error ingesting files with databricks jobs

The source path that i want to ingest files with is:"gs://bucket-name/folder1/folder2/*/*.json"I have a file in this path that ends with ".json.gz" and the databricks job ingests this file even though it doesn't suppose to.How can i fix it?Thanks.

Community Platform Discussions

Reply

1137 Views
1 replies
0 kudos

10-25-2023 5:52:41 AM

View Replies

Latest Reply

Anonym
New Contributor II

11-01-2023 5:35:15 AM

0 kudos

Thanks Kaniz

0 kudos

11-01-2023 5:35:15 AM

by Jun_NN • New Contributor

10-31-2023 5:24:03 PM

9333 Views
0 replies
0 kudos

Deleted the s3 bucket assocated with metastore

I deleted the aws s3 bucket for the databricks metastore by mistake.How to fix this? can I re-create the s3 bucket? Or can I delete the metastore (I don't have much data in it), and re-generate one? Thank you!

Community Platform Discussions

Reply

9333 Views
0 replies
0 kudos

10-31-2023 5:24:03 PM

by Madhawa • New Contributor II

10-30-2023 1:18:17 AM

1690 Views
0 replies
0 kudos

org.apache.spark.SparkException - FileReadException

Sometimes getting this kind of error "org.apache.spark.SparkException: Job aborted due to stage failure: Task 1 in stage 12224.0 failed 4 times, most recent failure: Lost task 1.5 in stage 12224.0 (TID ) (12.xxx.x.xxx executor 1): com.datab...

Community Platform Discussions

Reply

1690 Views
0 replies
0 kudos

10-30-2023 1:18:17 AM

by adrianhernandez • New Contributor III

10-26-2023 1:12:09 PM

2578 Views
3 replies
1 kudos

Add Oracle Jar to Databricks cluster policy

I created a policy for users to use when they create their own Job clusters. When I'm editing the policy, I don't have the UI options for adding library (I can only see Definitions and Permissions tabs). I need to add via JSON the option to allows th...

Community Platform Discussions

Reply

2578 Views
3 replies
1 kudos

10-26-2023 1:12:09 PM

View Replies

Latest Reply

karthik_p
Esteemed Contributor

10-27-2023 3:52:22 AM

1 kudos

@adrianhernandez are you admin to workspace, if not you might be missing permissions, if you have policies enabled, admin can allow you.https://docs.databricks.com/en/administration-guide/clusters/policies.html#librariesif your workspace is Unity cat...

1 kudos

10-27-2023 3:52:22 AM

2 More Replies

by thuovi • New Contributor II

10-27-2023 12:26:52 AM

1928 Views
0 replies
2 kudos

dbutils.fs.ls MAX_LIST_SIZE_EXCEEDED

Hi!I'm experiencing different behaviours between two DBX Workspaces when trying to list file contents from an abfss: location.In workspace A running len(dbutils.fs.ls('abfss://~~@~~~~.dfs.core.windows.net/~~/')) results in "Out[1]: 1551", while runni...

Community Platform Discussions

Reply

1928 Views
0 replies
2 kudos

10-27-2023 12:26:52 AM

by llvu • New Contributor III

10-05-2023 7:33:23 AM

3116 Views
3 replies
1 kudos

getArgument works fine in interactive cluster 10.4 LTS, raises error in interactive cluster 10.4 LTS

Hello,I am trying to use the getArgument() function in a spark.sql query. It works fine if I run the notebook via an interactive cluster, but gives an error when executed via a job run in an instance Pool.query:OPTIMIZE <table>where date = replace(re...

Community Platform Discussions

Reply

3116 Views
3 replies
1 kudos

10-05-2023 7:33:23 AM

View Replies

Latest Reply

llvu
New Contributor III

10-25-2023 6:47:04 AM

1 kudos

Hi @Retired_mod,Would you be able to respond to my last comment? I couldn't manage to get it working yet.Thank you in advance.

1 kudos

10-25-2023 6:47:04 AM

2 More Replies

by AH • New Contributor III

10-24-2023 5:07:11 AM

2253 Views
1 replies
0 kudos

AWS Databricks VS AWS EMR

HiWhich services should I use for data lake implementation?any cost comparison between Databricks and aws emr.which one is best to choose

Community Platform Discussions

Reply

2253 Views
1 replies
0 kudos

10-24-2023 5:07:11 AM

View Replies

Latest Reply

karthik_p
Esteemed Contributor

10-24-2023 5:27:36 PM

0 kudos

@AH that depends on use case, if your implementation involves Data Lake, ML, Data engineering tasks better to go with databricks as it has got good UI and there good governance using unity catalog for your data lake and you have good consumer tool su...

0 kudos

10-24-2023 5:27:36 PM

by elgeo • Valued Contributor II

10-23-2023 3:31:33 AM

2215 Views
1 replies
1 kudos

Resolved! System billing usage table - Usage column

Hello experts,Could someone please explain what is exactly contained into the column usage in the system.billing.usage table?We ran specific queries in a cluster trying to calculate the cost and we observe that the DBUs shown in the system table are ...

Community Platform Discussions

Reply

2215 Views
1 replies
1 kudos

10-23-2023 3:31:33 AM

View Replies

Latest Reply

karthik_p
Esteemed Contributor

10-24-2023 5:21:37 PM

1 kudos

@elgeo both should be same, untill if somehow we miss to pick proper plan DBU price, usage column will have complete information related to sku name and DBU units etc... if you use azure databricks calculator and compare we should see similar result

1 kudos

10-24-2023 5:21:37 PM

by Siebert_Looije • Contributor

10-24-2023 3:58:35 AM

2263 Views
0 replies
0 kudos

How to search on empty string on text filter with Lakeview Dashboards

Hi,I have created a lakeview dashboard with a couple of filters and a table. Now I would like to search if a certain filter (column) has an empty string but if I search for ' ' then it goes 'no data'. I am wondering how can I search for an empty stri...

Community Platform Discussions

Reply

2263 Views
0 replies
0 kudos

10-24-2023 3:58:35 AM

by RyanHager • Contributor

07-12-2023 11:05:39 AM

3152 Views
4 replies
2 kudos

Roadmap on export menu option for SQL Query and Dashboard Types in Workspace

Are there plans for an export option for SQL Query and SQL Dashboard in the Workspace explorer screen similar to notebooks?Background: Need a way to export and backup any queries and dashboards to save design work and move from staging environments ...

Community Platform Discussions

Reply

3152 Views
4 replies
2 kudos

07-12-2023 11:05:39 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

07-20-2023 8:17:35 AM

2 kudos

Th best option would be to have them just under git Repo (especially dashboards).

2 kudos

07-20-2023 8:17:35 AM

3 More Replies

by surya_1527 • New Contributor

10-22-2023 1:00:36 AM

1751 Views
0 replies
0 kudos

DataBricks Certification Exam Got Suspended. Require support for the same.

Hello Team, I encountered Pathetic experience while attempting my 1st DataBricks certification. Abruptly, Proctor asked me to show my desk, after showing he/she asked multiple times.. wasted my time and then suspended my exam, saying I have exceeded ...

Community Platform Discussions

Reply

1751 Views
0 replies
0 kudos

10-22-2023 1:00:36 AM

by kll • New Contributor III

09-18-2023 12:24:54 PM

1283 Views
1 replies
0 kudos

how to save variables in one notebook to be imported into another?

Say, I have a list of values, dictionaries, variable names in `notebook1.ipynb` that I'd like to re-use / import in another `notebook2.ipynb`. For example, in `notebook1.ipynb`, I have the following: var1 = "dallas" var_lst = [100, 200, 300, 400, ...

Community Platform Discussions

Reply

1283 Views
1 replies
0 kudos

09-18-2023 12:24:54 PM

View Replies

Latest Reply

Krishnamatta
New Contributor III

10-20-2023 6:09:41 PM

0 kudos

You can use %run ./notebook2 after defining variables in notebook1So notebook2 will use the variables defined in notebook1

0 kudos

10-20-2023 6:09:41 PM

by elgeo • Valued Contributor II

10-18-2023 5:12:45 AM

1724 Views
1 replies
0 kudos

Retrieve DBU per query executed

Hello experts,Do you know how we can retrieve the DBUs consumed for a specific query?Thank you

Community Platform Discussions

Reply

1724 Views
1 replies
0 kudos

10-18-2023 5:12:45 AM

View Replies

Latest Reply

elgeo
Valued Contributor II

10-20-2023 1:18:58 AM

0 kudos

I couldn't find a metadata table. However the workaround is to multiply the DBU of the current cluster (retrieve it either online or to be more accurate from the compute page at the right) and multiply it with the time in minutes that the query took ...

0 kudos

10-20-2023 1:18:58 AM

Databricks Community

Forum Posts

Thoughts on how to improve string search queries

API for Databricks code functionality

Error ingesting files with databricks jobs

Deleted the s3 bucket assocated with metastore

org.apache.spark.SparkException - FileReadException

Add Oracle Jar to Databricks cluster policy

dbutils.fs.ls MAX_LIST_SIZE_EXCEEDED

getArgument works fine in interactive cluster 10.4 LTS, raises error in interactive cluster 10.4 LTS

AWS Databricks VS AWS EMR

Resolved! System billing usage table - Usage column

How to search on empty string on text filter with Lakeview Dashboards

Roadmap on export menu option for SQL Query and Dashboard Types in Workspace

DataBricks Certification Exam Got Suspended. Require support for the same.

how to save variables in one notebook to be imported into another?

Retrieve DBU per query executed

Connect with Databricks Users in Your Area

Container lifetime?

Permission denied during write

How to grant custom container AWS credentials for ...

Format when specifying docker_image url?

DataBricks x Query Folding Power BI