Data Engineering

Forum Posts

Sorted by:

by HamidHamid_Mora • New Contributor II

04-14-2023 12:53:03 AM

3374 Views
4 replies
3 kudos

ganglia is unavailable on DBR 13.0

We created a library in databricks to ingest ganglia metrics for all jobs in our delta tables;However end point 8652 is no more available on DBR 13.0is there any other endpoint available ? since we need to log all metrics for all executed jobs not on...

Data Engineering

3374 Views
4 replies
3 kudos

04-14-2023 12:53:03 AM

View Replies

Latest Reply

h_h_ak
Contributor

11-07-2024 1:45:21 PM

3 kudos

You should have a look here: https://community.databricks.com/t5/data-engineering/azure-databricks-metrics-to-prometheus/td-p/71569

3 kudos

11-07-2024 1:45:21 PM

3 More Replies

by jfarmer • New Contributor II

01-21-2023 11:18:59 AM

6318 Views
3 replies
1 kudos

PermissionError / Operation not Permitted with Files-in-Repos

I've been running a notebook using files-in-repo. Previously this has worked fine. I'm unsure what's changed (I was testing integration with DCS on older runtimes, but don't think I made any persistent changes)--but now it's throwing an error (always...

Data Engineering

6318 Views
3 replies
1 kudos

01-21-2023 11:18:59 AM

View Replies

Latest Reply

_carleto_
New Contributor II

10-17-2023 7:14:16 AM

1 kudos

Hi @jfarmer , did you solved this issue? I'm having exactly the same challenge.Thanks!

1 kudos

10-17-2023 7:14:16 AM

2 More Replies

by parthsalvi • Contributor

09-16-2022 5:26:08 AM

3248 Views
1 replies
2 kudos

Amazon SES : boto3 credentials not found. DBR 11.2 Shared mode

We're trying to send email using Amazon SES using boto3.client in python. We've added SES Full access in clusters IAM Role.We were able to send email in "No isolation shared" mode in DBR 11.2 using ses = boto3.client('ses', region_name='us-****-2') s...

Data Engineering

3248 Views
1 replies
2 kudos

09-16-2022 5:26:08 AM

View Replies

Latest Reply

JameDavi_51481
Contributor

08-11-2023 5:30:31 AM

2 kudos

This appears to be an intentional design choice to prevent users from using the credentials of the host machine to carry out arbitrary AWS API calls. I really wish there was a workaround or setting to disable this behavior because we put a lot of wor...

2 kudos

08-11-2023 5:30:31 AM

by Hubert-Dudek • Esteemed Contributor III

10-19-2022 6:22:45 AM

9698 Views
3 replies
25 kudos

Bamboolib with databricks, low-code programming is now available on #databricks Now you can prepare your databricks code without ... coding. Low code ...

Bamboolib with databricks, low-code programming is now available on #databricksNow you can prepare your databricks code without ... coding. Low code solution is now available on Databricks. Install and import bamboolib to start (require a version of ...

Data Engineering

9698 Views
3 replies
25 kudos

10-19-2022 6:22:45 AM

View Replies

Latest Reply

Palkers
New Contributor III

06-29-2023 3:52:12 AM

25 kudos

I have tried to load parquet file using bamboolib menu, and getting below error that path does not existI can load the same file without no problem using spark or pandas using following pathciti_pdf = pd.read_parquet(f'/dbfs/mnt/orbify-sales-raw/Wide...

25 kudos

06-29-2023 3:52:12 AM

2 More Replies

by parthsalvi • Contributor

09-16-2022 5:11:03 AM

5522 Views
2 replies
2 kudos

getContext() in dbutils.notebook not working in DBR 11.2 10.4 LTS Shared Mode It's also working in no isolation Mode in DBR 11.2

We are trying to fetch notebook context in our Job logging workflow.current_context = dbutils.notebook.entry_point.getDbutils().notebook().getContext().toJson()We were able to access this in DBR 10.4 custom mode but in DBR 10.4 & 11.2 (Shared Mode) w...

Data Engineering

5522 Views
2 replies
2 kudos

09-16-2022 5:11:03 AM

View Replies

Latest Reply

Tjomme
New Contributor III

06-22-2023 8:15:20 AM

2 kudos

See also: https://community.databricks.com/s/question/0D58Y00009t95NHSAY/unity-catalog-shared-access-mode-dbutilsnotebookentrypointgetcontext-not-whitelisted

2 kudos

06-22-2023 8:15:20 AM

1 More Replies

by Gilg • Contributor II

05-18-2023 9:54:02 PM

1539 Views
0 replies
0 kudos

Databricks Runtime 12.1 spins VM in Ubuntu 18.04 LTS

Hi Team,Our cluster is currently in DBR 12.1 but it spins up a VMs with Ubuntu 18.04 LTS. 18.04 will be EOL soon. According to this https://learn.microsoft.com/en-us/azure/databricks/release-notes/runtime/12.1 OS version should be 20.04 and now a bit...

Data Engineering

1539 Views
0 replies
0 kudos

05-18-2023 9:54:02 PM

by ivanychev • Contributor II

03-07-2023 11:23:40 AM

5974 Views
7 replies
5 kudos

DBR 12.2: DeltaOptimizedWriter: Resolved attribute(s) missing from in operator

After upgrading from DBR 11.3 LTS to DBR 12.2 LTS we started to observe the following error during "read from parquet and write to delta" piece of logic.AnalysisException: Resolved attribute(s) group_id#72,display_name#73,parent_id#74,path#75,path_li...

Data Engineering

5974 Views
7 replies
5 kudos

03-07-2023 11:23:40 AM

View Replies

Latest Reply

Valtor
New Contributor II

04-27-2023 8:08:16 AM

5 kudos

I can confirm that this issue is resolved for us as well in the latest 12.2 release.

5 kudos

04-27-2023 8:08:16 AM

6 More Replies

by gud4eve • New Contributor III

04-10-2023 12:07:10 AM

2888 Views
1 replies
0 kudos

Resolved! Scala app getting NullPointerException while migrating from DBR 7.3 to 9.1 (and above)

We are migrating our Scala jobs from AWS EMR (6.2.1 and Spark version - 3.0.1) to Lakehouse and few of our jobs are failing due to NullPointerException. We tried in Databricks Runtime 7.3 LTS, it is working fine. Because it had same spark version 3.0...

Data Engineering

2888 Views
1 replies
0 kudos

04-10-2023 12:07:10 AM

View Replies

Latest Reply

gud4eve
New Contributor III

04-10-2023 11:33:40 PM

0 kudos

In one of my code statements, I updated scala Boolean to java.lang.Boolean and this is working fine now. May be in new newer Spark versions, null in scala Boolean isn't supported.

0 kudos

04-10-2023 11:33:40 PM

by kll • New Contributor III

04-05-2023 5:12:49 PM

2545 Views
2 replies
0 kudos

Unable to render widget to display map within Jupyter notebook output cell

I am attempting to render a map within jupyter notebook and keep bumping into output limit. Below is my code: import pydeck as pdk import pandas as pd COLOR_BREWER_BLUE_SCALE = [ [240, 249, 232], [204, 235, 197], [168, 221, 181], ...

Data Engineering

2545 Views
2 replies
0 kudos

04-05-2023 5:12:49 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-07-2023 11:43:34 PM

0 kudos

Hi @Keval Shah Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedback w...

0 kudos

04-07-2023 11:43:34 PM

1 More Replies

by JordiDekker • New Contributor III

03-27-2023 2:26:24 AM

9510 Views
5 replies
6 kudos

StreamCorruptedException, databricks-connect 9.1

Last week, around the 21st of march, we started having issues with databricks-connect (DBR 9.1 LTS). "databricks-connect test" works, but the following code snippet:from pyspark.sql import SparkSession spark = SparkSession.builder.getOrCreate() s...

Data Engineering

9510 Views
5 replies
6 kudos

03-27-2023 2:26:24 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-30-2023 12:50:33 AM

6 kudos

Hi @Jordi Dekker Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers ...

6 kudos

03-30-2023 12:50:33 AM

4 More Replies

by matthewe97 • New Contributor

03-21-2023 3:03:32 AM

5580 Views
3 replies
2 kudos

Resolved! Are window functions more performant than self joins?

I have a table with data for each month end and want to know the LEAD and LAG data points either side of each month. For example:SELECT month_date, LEAD(month_date) OVER (PARTITION BY id ORDER BY month_date) next_month_date, LAG(month_date) OVER (PA...

Data Engineering

5580 Views
3 replies
2 kudos

03-21-2023 3:03:32 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-29-2023 10:48:42 PM

2 kudos

Hi @Matthew Elsham Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answer...

2 kudos

03-29-2023 10:48:42 PM

2 More Replies

by ABVectr • New Contributor III

02-08-2023 4:10:16 AM

3852 Views
6 replies
1 kudos

Resolved! Maven Package install failing on DBR 11.3 LTS

Hi Databricks Community,I ran into the following issue when setting up a new cluster with the latest LTS Databricks runtime (11.3). When trying to install the package with the coordinates com.microsoft.azure.kusto:kusto-spark_3.0_2.12:3.1.4 from Mave...

Data Engineering

3852 Views
6 replies
1 kudos

02-08-2023 4:10:16 AM

View Replies

Latest Reply

Anonymous
Not applicable

02-15-2023 10:04:18 PM

1 kudos

Hi @Andrei Bondarenko Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you....

1 kudos

02-15-2023 10:04:18 PM

5 More Replies

by SS0201 • New Contributor II

02-09-2023 9:36:56 PM

1558 Views
1 replies
0 kudos

Unable to connect to Azure Cosmos DB Cassandra API table using Azure databricks job

Getting below error:Query [id = , runId = ] terminated with exception: Failed to open native connection to Cassandra at {<name>.cassandra.cosmosdb.azure.com:10350} :: Method com/microsoft/azure/cosmosdb/cassandra/CosmosDbConnectionFactory$.createSess...

Data Engineering

1558 Views
1 replies
0 kudos

02-09-2023 9:36:56 PM

View Replies

Latest Reply

Debayan
Databricks Employee

02-12-2023 9:00:50 PM

0 kudos

Hi, The error looks like there is an issue connecting to csms-ws-ddicsg-dev-001.cassandra.cosmosdb.azure.com:10350. Could you please reverify this in networking config? Also, it will be helpful if you raise an Azure case simultaneously to check the n...

0 kudos

02-12-2023 9:00:50 PM

by Databricks_-Dat • New Contributor II

02-09-2023 2:11:34 AM

2565 Views
2 replies
4 kudos

what is the supported mssql connector for Databricks runtime 11.3LTS Scala 2.12 Spark 3.3.0?

We were using mssql connector -com.microsoft.azure:spark-mssql-connector_2.12_3.0:1.0.0-alpha with 10.3LTS DBR. As we need to upgrade to higher version of DBR to make use of new functions like unpivot/melt in the notebooks. -com.microsoft.azure:spark...

Data Engineering

2565 Views
2 replies
4 kudos

02-09-2023 2:11:34 AM

View Replies

Latest Reply

ranged_coop
Valued Contributor II

02-10-2023 3:45:52 AM

4 kudos

Is the spark 3.3 series even supported by the connector yet ?As per the [github link](https://github.com/microsoft/sql-spark-connector#current-releases) - assuming this is the library you are trying to use ?The latest Spark 2.4.x compatible connector...

4 kudos

02-10-2023 3:45:52 AM

1 More Replies

by yousry • New Contributor II

01-18-2023 5:06:39 AM

5966 Views
2 replies
2 kudos

Resolved! What is the best way to find deltalake version on OSS and Databricks at runtime?

To identify certain deltalake features available on a certain installation, it is important to have a robust way to identify deltalake version. For OSS, I found that the below Scala snippet will do the job.import io.delta println(io.delta.VERSION)Not...

Data Engineering

5966 Views
2 replies
2 kudos

01-18-2023 5:06:39 AM

View Replies

Latest Reply

shan_chandra
Databricks Employee

02-02-2023 10:09:25 AM

2 kudos

@Yousry Mohamed - could you please check the DBR runtime release notes for the Delta lake API compatibility matrix section ( DBR version vs Delta lake compatible version) for the mapping.Reference: https://docs.databricks.com/release-notes/runtime/r...

2 kudos

02-02-2023 10:09:25 AM

1 More Replies