Data Engineering

Forum Posts

Sorted by:

by BradSheridan • Valued Contributor

09-13-2022 8:48:11 AM

1334 Views
1 replies
0 kudos

using a UDF in a Windows function

I have created a UDF using:%sqlCREATE OR REPLACE FUNCTION f_timestamp_max()....And I've confirmed it works with:%sqlselect f_timestamp_max()But when I try to use it in a Window function (lead over partition), I get:AnalysisException: Using SQL functi...

Data Engineering

1334 Views
1 replies
0 kudos

09-13-2022 8:48:11 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

09-15-2022 11:14:32 PM

0 kudos

Hi, As of now, Spark SQL supports three kinds of window functions: ranking functions. analytic functions. aggregate functions. Please refer: https://docs.databricks.com/sql/language-manual/sql-ref-window-functions.html#parameters

0 kudos

09-15-2022 11:14:32 PM

by Haima • New Contributor

09-15-2022 9:27:32 PM

320 Views
0 replies
0 kudos

FileNotFoundError: [Errno 2] /dbfs/fileone.csv

I'm trying to transfer my csv file from databricks to sftp but i'm getting file not found error.here is my code:file_size = sftp.stat("/dbfs/fileone.csv").st_sizewith open("/dbfs/fileone.csv", "rb") as fl:return self.putfo(fl, Destinationpath, file_s...

Data Engineering

320 Views
0 replies
0 kudos

09-15-2022 9:27:32 PM

by User16869510359 • Esteemed Contributor

06-24-2021 10:31:10 AM

4254 Views
3 replies
0 kudos

Resolved! How many notebooks/jobs can I run in parallel on a Databricks cluster?

Is there a limit on it and is the limit configurable?

Data Engineering

4254 Views
3 replies
0 kudos

06-24-2021 10:31:10 AM

View Replies

Latest Reply

User16869510359
Esteemed Contributor

06-24-2021 10:33:21 AM

0 kudos

There is a hard limit of 145 active execution contexts on a Cluster. This is to ensure the cluster is not overloaded with too many parallel threads starving for resources. The limit is not configurable. If there are more than 145 parallel jobs to be ...

0 kudos

06-24-2021 10:33:21 AM

2 More Replies

by data_serf • New Contributor

08-04-2022 2:17:40 PM

1906 Views
3 replies
1 kudos

Resolved! How to integrate java 11 code in Databricks

Hi all,We're trying to attach java libraries which are compiled/packaged using Java 11.After doing some research it looks like even the most recent runtimes use Java 8 which can't run the Java 11 code ("wrong version 55.0, should be 52.0" errors)Is t...

Data Engineering

1906 Views
3 replies
1 kudos

08-04-2022 2:17:40 PM

View Replies

Latest Reply

matthewrj
New Contributor II

09-15-2022 8:28:43 PM

1 kudos

I have tried setting JNAME=zulu11-ca-amd64 under Cluster > Advanced options > Spark > Environment variables but it doesn't seem to work. I still get errors indicating Java 8 is the JRE and in the Spark UI under "Environment" I still see:Java Home: /u...

1 kudos

09-15-2022 8:28:43 PM

2 More Replies

by 齐木木 • New Contributor III

09-15-2022 7:17:04 PM

714 Views
1 replies
3 kudos

Resolved! The case class reports an error when running in the notebook

As shown in the figure, the case class and the json string are converted through fasterxml.jackson, but an unexpected error occurred during the running of the code. I think this problem may be related to the loading principle of the notebook. Because...

Data Engineering

714 Views
1 replies
3 kudos

09-15-2022 7:17:04 PM

View Replies

Latest Reply

齐木木
New Contributor III

09-15-2022 7:29:54 PM

3 kudos

code：var str="{\"app_type\":\"installed-app\"}" import com.fasterxml.jackson.databind.ObjectMapper import com.fasterxml.jackson.module.scala.DefaultScalaModule val mapper = new ObjectMapper() mapper.registerModule(DefaultScalaModule) ...

3 kudos

09-15-2022 7:29:54 PM

by WBM1 • New Contributor

09-15-2022 6:14:59 PM

223 Views
0 replies
0 kudos

wbm.com.pk

WBM is the best online Supermarket in Pakistan provides you with Fast home delivery of your complete grocery, Home Cleaning, Skincare, Baby Products, and Mosquito Repellent Collection.https://wbm.com.pk/

Data Engineering

223 Views
0 replies
0 kudos

09-15-2022 6:14:59 PM

by Deepak_Kandpal • New Contributor III

09-13-2022 2:42:13 AM

2293 Views
3 replies
2 kudos

Resolved! Enable credential passthrough Option is not available in new UI for Job Cluster

Hi All,I am trying to add new workflow which require to use credential passthrough, but when I am trying to create new Job Cluster from Workflow -> Jobs -> My Job, the option of Enable credential passthrough is not available. Is there any other way t...

Data Engineering

2293 Views
3 replies
2 kudos

09-13-2022 2:42:13 AM

View Replies

Latest Reply

Rostislaw
New Contributor III

09-15-2022 12:25:54 PM

2 kudos

assuming your Excel file is located on ADLS you can add a service principal to the cluster configuration. see: https://docs.microsoft.com/en-us/azure/databricks/data/data-sources/azure/azure-storage#--access-azure-data-lake-storage-gen2-or-blob-stora...

2 kudos

09-15-2022 12:25:54 PM

2 More Replies

by vamsi0132 • New Contributor II

09-15-2022 8:23:14 AM

564 Views
0 replies
1 kudos

BUG in TIME ZONE EST function

Hi,I found the bug while using in "from_utc_timestamp" function while using from UTC time stamp to EST time stampBelow is the Query Query:select trim(current_timestamp()) as Current,trim(from_utc_timestamp(current_timestamp(),'EST')) as EST,trim(from...

Data Engineering

564 Views
0 replies
1 kudos

09-15-2022 8:23:14 AM

by TutorBees_Net • New Contributor

01-11-2022 1:45:54 AM

932 Views
2 replies
0 kudos

Tutorbees logo black

We provide online tutoring for students from Grade 5 and all the way up to professionals. You can find the best tutors for Maths, Biology, Physics, Chemistry, English, Social Sciences, Urdu in the comfort of your home. You can also find professional ...

Data Engineering

932 Views
2 replies
0 kudos

01-11-2022 1:45:54 AM

View Replies

Latest Reply

frillow
New Contributor II

09-15-2022 7:32:18 AM

0 kudos

The academized review is not that clear. The company seems legitimate enough, but the anonymous profiles make customers and users doubt its legitimacy. While Academized does list the number of custom feedbacks it offers and the fields of specializati...

0 kudos

09-15-2022 7:32:18 AM

1 More Replies

by Raj4 • New Contributor III

09-13-2022 12:04:34 AM

1626 Views
7 replies
0 kudos

www.databricks.com

Hi Team , I have attended the virtual instructor-led training on 23-08-2022 (https://www.databricks.com/p/webinar/databricks-lakehouse-fundamentals-learning-plan). As per the steps mentioned i have completed all of the steps for getting voucher, but ...

Data Engineering

1626 Views
7 replies
0 kudos

09-13-2022 12:04:34 AM

View Replies

Latest Reply

amit
New Contributor II

09-15-2022 7:26:19 AM

0 kudos

Thanks @Nadia Elsayed for quick response. I have booked my exam with supplied coupon without any issue.Thanks, Amit

0 kudos

09-15-2022 7:26:19 AM

6 More Replies

by sp334 • New Contributor II

12-16-2021 12:23:01 PM

6016 Views
5 replies
3 kudos

Resolved! NPIP tunnel setup failed [WaitForNgrokTunnel]

Hello All, We have deployed a new databricks instance in Azure cloud 1) Databricks service attached public subnet/private subnet (delegated to Microsoft.Databricks/workspaces)2) i created a job with cluster runtime ( 1 worker: Standard_DS3_v27.3 LTS...

Data Engineering

6016 Views
5 replies
3 kudos

12-16-2021 12:23:01 PM

View Replies

Latest Reply

fabienv
New Contributor II

09-15-2022 6:30:17 AM

3 kudos

In case others run into this in the future. Here is something additional to check:Is your account/workspace enabled for the "compliance security profile"? If yes, you should see a little shield icon in the lower left-hand corner of the workspace Once...

3 kudos

09-15-2022 6:30:17 AM

4 More Replies

by tompile • New Contributor III

05-06-2022 9:37:08 AM

2190 Views
7 replies
10 kudos

Resolved! Is it possible to make use of pygit2 or GitPython packages to reference git repositories from within databricks?

I am making use of repos in databricks and am trying to reference the current git branch from within the notebook session.For example:from pygit2 import Repositoryrepo = Repository('/Workspace/Repos/user@domain/repository')The code above throws an er...

Data Engineering

2190 Views
7 replies
10 kudos

05-06-2022 9:37:08 AM

View Replies

Latest Reply

niburg123
New Contributor III

09-15-2022 5:55:06 AM

10 kudos

You cannot use this as far as i know, but you can put a workaround in a notebook if you are calling code from your repo via a notebook:repo_path = "/Repos/xyz_repo_path/xyz_repo_name"repo_path_fs = "/Workspace" + repo_pathrepo_branch = "main"def chec...

10 kudos

09-15-2022 5:55:06 AM

6 More Replies

by HariharaSam • Contributor

09-15-2022 4:23:30 AM

476 Views
0 replies
0 kudos

Converting Rows of Spark Dataframe to List

How to convert the rows of a spark dataframe to list without using Pandas.Input Spark Dataframe :Expected Output:[['A','B','C'],['1','2','3'],['4','5','6'],['7','8','9']]

Data Engineering

476 Views
0 replies
0 kudos

09-15-2022 4:23:30 AM

by swetha • New Contributor III

08-30-2022 4:42:29 AM

1374 Views
2 replies
1 kudos

I am unable to attach a streaming listener to a spark streaming job. Error: no streaming listener attached to the spark application is the error we are observing post accessing streaming statistics API. Please help us with this issue ASAP. Thanks.

Issue:After adding the listener jar file in the cluster init script, the listener is working (From what I see in the stdout/log4j logs)But when I try to hit the 'Content-Type: application/json' http://host:port/api/v1/applications/app-id/streaming/st...

Data Engineering

1374 Views
2 replies
1 kudos

08-30-2022 4:42:29 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-15-2022 4:05:18 AM

1 kudos

Hi @swetha kadiyala Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Th...

1 kudos

09-15-2022 4:05:18 AM

1 More Replies

by Sadiq • New Contributor III

08-29-2022 10:15:41 AM

1474 Views
6 replies
4 kudos

Fixed length file from Databricks notebook ( Spark SQL)

Hi ,I need help writing data from azure databricks notebook into Fixed Length .txt.notebook has 10 lakh rows and 86 columns. can anyone suggest me

Data Engineering

1474 Views
6 replies
4 kudos

08-29-2022 10:15:41 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-15-2022 3:47:10 AM

4 kudos

Hi @sadiq vali Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

4 kudos

09-15-2022 3:47:10 AM

5 More Replies

User

Count

1601

736

343

284

246

Databricks

Forum Posts

using a UDF in a Windows function

FileNotFoundError: [Errno 2] /dbfs/fileone.csv

Resolved! How many notebooks/jobs can I run in parallel on a Databricks cluster?

Resolved! How to integrate java 11 code in Databricks

Resolved! The case class reports an error when running in the notebook

wbm.com.pk

Resolved! Enable credential passthrough Option is not available in new UI for Job Cluster

BUG in TIME ZONE EST function

Tutorbees logo black

www.databricks.com

Resolved! NPIP tunnel setup failed [WaitForNgrokTunnel]

Resolved! Is it possible to make use of pygit2 or GitPython packages to reference git repositories from within databricks?

Converting Rows of Spark Dataframe to List

I am unable to attach a streaming listener to a spark streaming job. Error: no streaming listener attached to the spark application is the error we are observing post accessing streaming statistics API. Please help us with this issue ASAP. Thanks.

Fixed length file from Databricks notebook ( Spark SQL)

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...