Data Engineering

Forum Posts

Sorted by:

by Ravikumashi • Contributor

12-10-2022 11:01:18 AM

660 Views
2 replies
0 kudos

access databricks secretes in int script

we are trying install databricks cli on init scripts and in order to do this we need to autheticate with databricks token but it is not secure as anyone got access to cluster can get hold of this databricks token.we try to inject the secretes into se...

Data Engineering

660 Views
2 replies
0 kudos

12-10-2022 11:01:18 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

12-10-2022 1:21:57 PM

0 kudos

I think you don't need to install CLI. There is a whole API available via notebook. below is example:import requests ctx = dbutils.notebook.entry_point.getDbutils().notebook().getContext() host_name = ctx.tags().get("browserHostName").get() host_toke...

0 kudos

12-10-2022 1:21:57 PM

1 More Replies

by KVNARK • Honored Contributor II

12-10-2022 10:39:35 AM

1621 Views
4 replies
11 kudos

Resolved! Pyspark learning path

Can anyone suggest to take the best series of courses offered by Databricks to learn pyspark for ETL purpose either in Databricks partner learning portal or Databricks learning portal.

Data Engineering

1621 Views
4 replies
11 kudos

12-10-2022 10:39:35 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

12-10-2022 1:23:42 PM

11 kudos

To learn Databricks ETL, I highy recommend videos made by Simon on that channel https://www.youtube.com/@AdvancingAnalytics

11 kudos

12-10-2022 1:23:42 PM

3 More Replies

by Harish2122 • Contributor

12-10-2022 3:22:20 AM

4936 Views
2 replies
10 kudos

Databricks SQL string_agg

Migrating some on-premise SQL views to Databricks and struggling to find conversions for some functions. the main one is the string_agg function.string_agg(field_name, ', ')Anyone know how to convert that to Databricks SQL?Thanks in advance.

Data Engineering

4936 Views
2 replies
10 kudos

12-10-2022 3:22:20 AM

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

12-10-2022 9:40:39 AM

10 kudos

Hi @Harish K you can use the below query in spark SQL-%sql SELECT col1, array_join(collect_set(col2), ',') j FROM tmp GROUP BY col1

10 kudos

12-10-2022 9:40:39 AM

1 More Replies

by boyelana • Contributor III

12-06-2022 7:45:59 AM

1586 Views
9 replies
5 kudos

I am preparing for the data analyst exam and I need as many resources as I can get to fully prepare. Hands-on labs will be welcome as well

Data Engineering

1586 Views
9 replies
5 kudos

12-06-2022 7:45:59 AM

View Replies

Latest Reply

tunstila
Contributor II

12-09-2022 3:37:16 AM

5 kudos

Hi,Kindly refer to the materials below:Videohttps://info.databricks.com/dc/kvtpV3WYob2etSFEoxuDGMYVc6afyrIMgIW50ZzIbvpUgj2uOQyz91VsFjIVPsTMDcYAQ8K0HTbFHGKunTHn_tZmFrrG7SaByl8pfwUNMIZfHhQHiMHwQEKzYSwtM9Vr6hKVl28RlEsSlOluDqaxKqoLcg8-qEwq4xtnrG8zKMEOSpQ...

5 kudos

12-09-2022 3:37:16 AM

8 More Replies

by Searce • New Contributor III

12-08-2022 6:49:23 PM

824 Views
3 replies
5 kudos

Databricks Cross cloud

We have service with AWS Databricks. We are doing the same replica on GCP Databricks. Here we required all the services and functionalities should be run in AWS and AWS Databricks. The only thing data should be stored on the GCP Storage. Simply funct...

Data Engineering

824 Views
3 replies
5 kudos

12-08-2022 6:49:23 PM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-10-2022 7:15:17 AM

5 kudos

no, right now i don't think they are supporting this type of architecture

5 kudos

12-10-2022 7:15:17 AM

2 More Replies

by agnar05 • New Contributor II

12-08-2022 9:03:37 AM

1201 Views
4 replies
4 kudos

Databricks dashboard alias URL

Can we create a alias URL for databricks dashboard ?

Data Engineering

1201 Views
4 replies
4 kudos

12-08-2022 9:03:37 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-10-2022 7:13:13 AM

4 kudos

yes it is possible , you have to create DNS for your databricks and map it to your workspace URL

4 kudos

12-10-2022 7:13:13 AM

3 More Replies

by Smitha1 • Valued Contributor II

12-09-2022 1:41:35 PM

826 Views
1 replies
2 kudos

#00244807 and #00245872 Ticket Status - HIGH Priority

Dear @Vidula Khanna Vidula, Databricks team, @Nadia Elsayed @Jose Gonzalez @Aden Jaxson What is the SLA/ETA for normal priority ticket and HIGH priority ticket?I created tickets #00244807 on 7th Dec and #00245872 but haven't received any update ...

Data Engineering

826 Views
1 replies
2 kudos

12-09-2022 1:41:35 PM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-10-2022 7:03:35 AM

2 kudos

you can only create high-priority tasks if you have an enterprise plan.as a normal user you can only create normal tasksif you have enterprise plan then you can escalate case .databricks team will revert you soon there.

2 kudos

12-10-2022 7:03:35 AM

by john_odwyer • New Contributor III

05-21-2021 11:33:17 AM

3133 Views
1 replies
1 kudos

Resolved! Masking A Data Column

Is there a way to mask the data in a column in a table from specific users or user groups?

Data Engineering

3133 Views
1 replies
1 kudos

05-21-2021 11:33:17 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-10-2022 6:53:46 AM

1 kudos

yesthis doc will be helpful for you -- https://www.databricks.com/blog/2020/11/20/enforcing-column-level-encryption-and-avoiding-data-duplication-with-pii.html

1 kudos

12-10-2022 6:53:46 AM

by Mahendra1 • New Contributor III

11-30-2022 7:12:59 AM

519 Views
1 replies
0 kudos

Materials for preparing data bricks professional exam.

Hi All, Is there any book / materials for studying for data bricks professional certification ?Thank You !!!

Data Engineering

519 Views
1 replies
0 kudos

11-30-2022 7:12:59 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-10-2022 6:50:27 AM

0 kudos

please check databricks academy,there you will find the right courses

0 kudos

12-10-2022 6:50:27 AM

by Kaniz • Community Manager

09-22-2021 1:47:54 PM

465 Views
1 replies
0 kudos

How to read data from hdfs using scala language?

Data Engineering

465 Views
1 replies
0 kudos

09-22-2021 1:47:54 PM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-10-2022 6:48:02 AM

0 kudos

use magic commands

0 kudos

12-10-2022 6:48:02 AM

by 183530 • New Contributor III

12-09-2022 2:07:05 PM

404 Views
2 replies
1 kudos

i need a regex to get whole word with parentheses

SELECT '(CC) ABC' REGEXP '\\b\\(CC\\)\\b' AS TEST1, 'A(CC) ABC' REGEXP '\\b\\(CC\\)\\b' AS TEST2, 'A (CC)A ABC' REGEXP '\\b\\(CC\\)\\b' AS TEST3, 'A (CC) A ABC' REGEXP '\\b\\(CC\\)\\b' AS TEST4, 'A ABC (CC)' REGEXP '\\b\\(CC\\)\\b' AS TES...

Data Engineering

404 Views
2 replies
1 kudos

12-09-2022 2:07:05 PM

View Replies

Latest Reply

183530
New Contributor III

12-10-2022 6:47:37 AM

1 kudos

get whole word "(CC)"I had already written the outputexpected outuput '(CC) ABC' REGEXP <<regex>> = TRUE'A(CC) ABC' REGEXP <<regex>> = FALSE'A (CC)A ABC' REGEXP <<regex>> = FALSE 'A (CC) A ABC' REGEXP <<regex>> = TRUE 'A ABC (CC)' REGEXP <<regex>> = ...

1 kudos

12-10-2022 6:47:37 AM

1 More Replies

by kavs • New Contributor

07-20-2022 11:46:13 PM

417 Views
1 replies
0 kudos

I am reading a online API and creating data frame I want to pass the url value as an argument how to achieve this?

Data Engineering

417 Views
1 replies
0 kudos

07-20-2022 11:46:13 PM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-10-2022 6:46:29 AM

0 kudos

use python formatting

0 kudos

12-10-2022 6:46:29 AM

by Koko • New Contributor II

09-16-2022 8:53:05 AM

663 Views
1 replies
2 kudos

execute sql server agent jobs from Databricks notebook

Is it possible to execute sql server agent job from Databricks notebook?

Data Engineering

663 Views
1 replies
2 kudos

09-16-2022 8:53:05 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-10-2022 6:39:00 AM

2 kudos

i dont think this type of feature is available there

2 kudos

12-10-2022 6:39:00 AM

by akshay_1333 • New Contributor II

11-29-2022 8:39:34 AM

421 Views
1 replies
3 kudos

Note book formatting

I am using DBR 10.4 LTS instance can anyone help me formatting the code.I have tried with format python error pop up with upgrade to DBR 11.2 any other alternative to this?

Data Engineering

421 Views
1 replies
3 kudos

11-29-2022 8:39:34 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-10-2022 6:37:37 AM

3 kudos

please give us a code by that we can help you

3 kudos

12-10-2022 6:37:37 AM

by Ossian • New Contributor

07-21-2021 12:08:18 AM

1208 Views
1 replies
0 kudos

Driver restarts and job dies after 10-20 hours (Structured Streaming)

I am running a java/jar Structured Streaming job on a single node cluster (Databricks runtime 8.3). The job contains a single query which reads records from multiple Azure Event Hubs using Spark Kafka functionality and outputs results to a mssql dat...

Data Engineering

1208 Views
1 replies
0 kudos

07-21-2021 12:08:18 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-10-2022 6:31:30 AM

0 kudos

its seems that when your nodes are increasing it is seeking for init script and it is failing so you can use reserve instances for this activity instead of spot instances it will increase your overall costor alternatively, you can use depended librar...

0 kudos

12-10-2022 6:31:30 AM

User

Count

1601

736

343

284

246

Databricks

Forum Posts

access databricks secretes in int script

Resolved! Pyspark learning path

Databricks SQL string_agg

I am preparing for the data analyst exam and I need as many resources as I can get to fully prepare. Hands-on labs will be welcome as well

Databricks Cross cloud

Databricks dashboard alias URL

#00244807 and #00245872 Ticket Status - HIGH Priority

Resolved! Masking A Data Column

Materials for preparing data bricks professional exam.

How to read data from hdfs using scala language?

i need a regex to get whole word with parentheses

I am reading a online API and creating data frame I want to pass the url value as an argument how to achieve this?

execute sql server agent jobs from Databricks notebook

Note book formatting

Driver restarts and job dies after 10-20 hours (Structured Streaming)

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...