Data Engineering

Forum Posts

Sorted by:

by Nick_Hughes • New Contributor III

11-09-2021 4:22:15 AM

3962 Views
8 replies
3 kudos

Resolved! Formatting on Databricks Alerts

Hi Guys. I have looked at the formatting options and I'm still struggling to work out how to best format the email body of a databricks alert. I want to be able to selectively choose columns from the query and dispaly them in a table. Or even if i ca...

Data Engineering

3962 Views
8 replies
3 kudos

11-09-2021 4:22:15 AM

View Replies

Latest Reply

Prabakar
Esteemed Contributor III

11-16-2021 8:49:29 AM

3 kudos

Hi @Nick Hughes , unfortunately, this is not available for now. We have a feature request for the same. DB-I-4105 - SQL Alerts: Formatting message body when creating Custom TemplateThis feature has been considered by our product team and it will be...

3 kudos

11-16-2021 8:49:29 AM

7 More Replies

by Mohit_m • Valued Contributor II

11-16-2021 3:41:58 AM

1290 Views
2 replies
1 kudos

Resolved! Why EC2 or EBS volumes are not getting tagged for the instances pool

Why EC2 or EBS volumes are not getting tagged for the instances pool when the tags are present for the Clusters

Data Engineering

1290 Views
2 replies
1 kudos

11-16-2021 3:41:58 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

11-16-2021 4:19:47 AM

1 kudos

"If a cluster is created from a pool, its EC2 instances inherit only the custom and default pool tags, not the cluster tags. Therefore if you want to create clusters from a pool, make sure to assign all of the custom cluster tags you need to the pool...

1 kudos

11-16-2021 4:19:47 AM

1 More Replies

by pine • New Contributor III

11-15-2021 4:39:32 AM

2064 Views
5 replies
4 kudos

Resolved! Databricks fails writing after writing ~30 files

Good day, Copy of https://stackoverflow.com/questions/69974301/looping-through-files-in-databricks-failsI got 100 files of csv data on adls-gen1 store. I want to do some processing to them and save results to same drive, different directory. def look...

Data Engineering

2064 Views
5 replies
4 kudos

11-15-2021 4:39:32 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

11-15-2021 5:06:48 AM

4 kudos

was actually anything created by script in directory <my_output_dir>?The best would be to permanently mount ADSL storage and use azure app for that.In Azure please go to App registrations - register app with name for example "databricks_mount" . Ad...

4 kudos

11-15-2021 5:06:48 AM

4 More Replies

by Sebastian • Contributor

11-09-2021 5:55:45 AM

2571 Views
5 replies
3 kudos

Resolved! How to access databricks secret in global ini file

How to access databricks secret in global ini file. {{secrets/scope/key}} doesnt work. Do i have to put that inside quotes

Data Engineering

2571 Views
5 replies
3 kudos

11-09-2021 5:55:45 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

11-12-2021 3:54:25 PM

3 kudos

hi @SEBIN THOMAS ,I would like to share the docs here are you getting any error messages? like @Hubert Dudek mentioned, please share more details and error message in case you are getting any.

3 kudos

11-12-2021 3:54:25 PM

4 More Replies

by nicole_wong • New Contributor II

10-29-2021 10:46:11 AM

1225 Views
2 replies
1 kudos

Resolved! Best practices for working with Redshift

I have a customer with the following question - I'm posting on their behalf to introduce them to the community. For doing modeling in a python environment what is our best practice for getting the data from redshift? A "load" option seems to leave me...

Data Engineering

1225 Views
2 replies
1 kudos

10-29-2021 10:46:11 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

11-15-2021 4:16:55 PM

1 kudos

Hi @Nicole Wong ,Have you check the docs from here? As far as I know, this might be the only way to read/write data to/from redshift.

1 kudos

11-15-2021 4:16:55 PM

1 More Replies

by Constantine • Contributor III

11-13-2021 9:40:42 AM

5079 Views
4 replies
4 kudos

Resolved! How does Spark do lazy evaluation?

For context, I am running Spark on databricks platform and using Delta Tables (s3). Let's assume we a table called table_one. I create a view called view_one using the table and then call view_one. Next, I create another view, called view_two based o...

Data Engineering

5079 Views
4 replies
4 kudos

11-13-2021 9:40:42 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

11-15-2021 11:08:43 AM

4 kudos

Hi @John Constantine ,The following notebook url will help you to undertand better the difference between lazy transformations and action in Spark. You will be able to compare the physical query plans and undertand better what is going on when you e...

4 kudos

11-15-2021 11:08:43 AM

3 More Replies

by RantoB • Valued Contributor

11-15-2021 6:49:52 AM

1327 Views
2 replies
4 kudos

Resolved! Import a notebook in a Release Pipeline with a Python script

Hi, I would like to import a python file to Databricks with a Azure DevOps Release Pipeline.Within the pipeline I execute a python script which contains this code :import sys import os import base64 import requests dbw_url = sys.argv[1] # https://a...

Data Engineering

1327 Views
2 replies
4 kudos

11-15-2021 6:49:52 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

11-15-2021 7:43:46 AM

4 kudos

Recently I wrote about alternative way to export/import notebooks in pthon https://community.databricks.com/s/question/0D53f00001TgT52CAF/import-notebook-with-python-script-using-api This way you will get more readable error/message (often it is rela...

4 kudos

11-15-2021 7:43:46 AM

1 More Replies

by Hubert-Dudek • Esteemed Contributor III

11-15-2021 3:48:22 AM

1121 Views
2 replies
13 kudos

Resolved! something like AWS Macie to perform scans on Azure Data Lake

Does anyone know alternative for AWS Macie in Azure?AWS Macie scan S3 buckets for files with sensitive data (personal address, credit card etc...).I would like to use the same style ready scanner for Azure Data Lake.

Data Engineering

1121 Views
2 replies
13 kudos

11-15-2021 3:48:22 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

11-15-2021 4:58:17 AM

13 kudos

thank you, I checked and yes it is definitely the way to go

13 kudos

11-15-2021 4:58:17 AM

1 More Replies

by ahana • New Contributor III

11-09-2021 10:32:31 PM

5499 Views
13 replies
2 kudos

Resolved! i am trying to find different between two dates but i am getting null value in new column below are the dates in same format tryied to change the format but still it is not working is databricks

Data Engineering

5499 Views
13 replies
2 kudos

11-09-2021 10:32:31 PM

View Replies

Latest Reply

jose_gonzalez
Moderator

11-12-2021 3:49:55 PM

2 kudos

Hi @ahana ahana ,Did any of the replies helped you solve this issue? would you be happy to mark their answer as best so that others can quickly find the solution?Thank you

2 kudos

11-12-2021 3:49:55 PM

12 More Replies

by Chris_Shehu • Valued Contributor III

11-12-2021 11:05:51 AM

867 Views
2 replies
2 kudos

Resolved! Is there a location where customers can submit suggestions/feature request?

Data Engineering

867 Views
2 replies
2 kudos

11-12-2021 11:05:51 AM

View Replies

Latest Reply

Prabakar
Esteemed Contributor III

11-15-2021 2:52:01 AM

2 kudos

Hi @Christopher Shehu , if @Piper Wilson 's response helped you to solve your question? would you be happy to mark her answer as best so that others can quickly find the solution in the future.

2 kudos

11-15-2021 2:52:01 AM

1 More Replies

by Orianh • Valued Contributor II

10-17-2021 4:55:24 AM

4834 Views
7 replies
3 kudos

Resolved! Read JSON with backslash.

Hello guys.I'm trying to read JSON file which contains backslash and failed to read it via pyspark.Tried a lot of options but didn't solve this yet, I thought to read all the JSON as text and replace all "\" with "/" but pyspark fail to read it as te...

Data Engineering

4834 Views
7 replies
3 kudos

10-17-2021 4:55:24 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-11-2021 8:48:53 AM

3 kudos

@orian hindi - Would you be happy to post the solution you came up with and then mark it as best? That will help other members.

3 kudos

11-11-2021 8:48:53 AM

6 More Replies

by dataEngineer3 • New Contributor II

11-09-2021 4:55:21 AM

2038 Views
9 replies
0 kudos

Hi All, I am trying to read a csv file from datalake and loading data into sql table using Copyinto. am facing an issue Here i created one table wit...

Hi All,I am trying to read a csv file from datalake and loading data into sql table using Copyinto.am facing an issue Here i created one table with 6 columns same as data in csv file.but unable to load the data.can anyone helpme on this

Data Engineering

2038 Views
9 replies
0 kudos

11-09-2021 4:55:21 AM

View Replies

Latest Reply

dataEngineer3
New Contributor II

11-10-2021 9:58:41 PM

0 kudos

Thanks Werners for your Reply,How to pass schema(ColumnName && Types) to CSV file ??

0 kudos

11-10-2021 9:58:41 PM

8 More Replies

by kjoth • Contributor II

11-11-2021 4:37:20 AM

4182 Views
6 replies
6 kudos

Resolved! Pyspark logging - custom to Azure blob mount directory

I'm using the logging module to log the events from the job, but it seems the log is creating the file with only 1 lines. The consecutive log events are not being recorded. Is there any reference for custom logging in Databricks.

Data Engineering

4182 Views
6 replies
6 kudos

11-11-2021 4:37:20 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-13-2021 11:39:35 AM

6 kudos

@karthick J - If Jose's answer helped solve the issue, would you be happy to mark their answer as best so that others can find the solution more easily?

6 kudos

11-13-2021 11:39:35 AM

5 More Replies

by AjayHN • New Contributor II

10-28-2021 4:22:49 AM

1932 Views
2 replies
2 kudos

Resolved! Notebook failing in job-cluster but runs fine in all-purpose-cluster with the same configuration

I have a notebook with many join and few persist operations (which runs fine on all-purpose-cluster (with worker nodes - i3.xlarge and autoscale enabled), but the same notebook failing in job-cluster with the same cluster definition (to be frank the ...

Data Engineering

1932 Views
2 replies
2 kudos

10-28-2021 4:22:49 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

11-12-2021 4:48:13 PM

2 kudos

Hi @Ajay Nanjundappa ,Check "Event log" tab. Search for any spot terminations events. It seems like all your nodes are spot instances. The error "FetchFailedException" is associated with spot termination nodes.

2 kudos

11-12-2021 4:48:13 PM

1 More Replies

by Mohit_m • Valued Contributor II

11-10-2021 2:25:09 AM

1053 Views
5 replies
2 kudos

Which rest API to use in order to list the groups that belong to a specific user

Data Engineering

1053 Views
5 replies
2 kudos

11-10-2021 2:25:09 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

11-12-2021 3:46:40 PM

2 kudos

@Mohit Miglani ,Make sure to select the best option so the post will be moved to the top and will help in case more users have this question in the future.

2 kudos

11-12-2021 3:46:40 PM

4 More Replies

User

Count

1601

736

343

284

247

Databricks

Forum Posts

Resolved! Formatting on Databricks Alerts

Resolved! Why EC2 or EBS volumes are not getting tagged for the instances pool

Resolved! Databricks fails writing after writing ~30 files

Resolved! How to access databricks secret in global ini file

Resolved! Best practices for working with Redshift

Resolved! How does Spark do lazy evaluation?

Resolved! Import a notebook in a Release Pipeline with a Python script

Resolved! something like AWS Macie to perform scans on Azure Data Lake

Resolved! i am trying to find different between two dates but i am getting null value in new column below are the dates in same format tryied to change the format but still it is not working is databricks

Resolved! Is there a location where customers can submit suggestions/feature request?

Resolved! Read JSON with backslash.

Hi All, I am trying to read a csv file from datalake and loading data into sql table using Copyinto. am facing an issue Here i created one table wit...

Resolved! Pyspark logging - custom to Azure blob mount directory

Resolved! Notebook failing in job-cluster but runs fine in all-purpose-cluster with the same configuration

Which rest API to use in order to list the groups that belong to a specific user

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...