Data Engineering

Forum Posts

Sorted by:

by Bill • New Contributor III

05-02-2022 6:37:20 AM

1371 Views
5 replies
2 kudos

Resolved! How to access tables created in 2017

In 2017 while working on my Masters degree, I created some tables that I would like to access again. Back then I could just write SQL and find them but today that doesn't work. I suspect it has something to do with Delta Lake. What do I have to do to...

Data Engineering

1371 Views
5 replies
2 kudos

05-02-2022 6:37:20 AM

View Replies

Latest Reply

Bill
New Contributor III

05-07-2022 7:01:35 AM

2 kudos

That did it. Thanks

2 kudos

05-07-2022 7:01:35 AM

4 More Replies

by Anonymous • Not applicable

05-06-2022 2:53:17 PM

689 Views
1 replies
1 kudos

Resolved! Unable to start cluster on E2 Workspace

Hello Community,I'm trying to create and start my first cluster on my E2 Databricks Workspace on AWS; however, the cluster is created but after STARTING the cluster immediately the cluster status goes to TERMINATING. Logs provided by Databricks show ...

Data Engineering

689 Views
1 replies
1 kudos

05-06-2022 2:53:17 PM

View Replies

Latest Reply

Anonymous
Not applicable

05-06-2022 4:06:56 PM

1 kudos

Update:It was an error on my side with the KMS key.

1 kudos

05-06-2022 4:06:56 PM

by Taha_Hussain • Valued Contributor II

05-05-2022 3:13:02 PM

765 Views
1 replies
6 kudos

Databricks Office Hours Our next Office Hours session is scheduled for May 18th from 8:00 am - 9:00am PT. Do you have questions about how to set up or...

Databricks Office HoursOur next Office Hours session is scheduled for May 18th from 8:00 am - 9:00am PT.Do you have questions about how to set up or use Databricks? Do you want to learn more about the best practices for deploying your use case or tip...

Data Engineering

765 Views
1 replies
6 kudos

05-05-2022 3:13:02 PM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

05-06-2022 9:06:08 AM

6 kudos

Just registered!

6 kudos

05-06-2022 9:06:08 AM

by Ashley1 • Contributor

05-03-2022 6:35:15 PM

1224 Views
5 replies
1 kudos

Resolved! Can ADLS be mounted in DBFS using only ADLS account key?

I realise this is not an optimal configuration but I'm trying to pull together a POC and I'm not at the point that I wish to ask the AAD admins to create an application for OAuth authentication.I have been able to use direct references to the ADLS co...

Data Engineering

1224 Views
5 replies
1 kudos

05-03-2022 6:35:15 PM

View Replies

Latest Reply

Anonymous
Not applicable

05-04-2022 9:41:08 AM

1 kudos

Hey there @Ashley Betts Thank you for posting your question. And you found the solution.This is awesome!Would you be happy to mark the answer as best so that other members can find the solution more quickly?Cheers!

1 kudos

05-04-2022 9:41:08 AM

4 More Replies

by RengarLee • Contributor

04-28-2022 9:13:56 PM

2329 Views
5 replies
0 kudos

Resolved! How to improve Spark Streaming writer Input Rate and Processing rate?

Hi!I have many questions about Spark Streaming and Evnethub。Can you help me?Q1:How to improve Spark Streaming writer Input Rate and Processing rate?I connect Azure Eventhubs using Spark Streaming(Azure Databricks), but I found if I use display, this ...

Data Engineering

2329 Views
5 replies
0 kudos

04-28-2022 9:13:56 PM

View Replies

Latest Reply

RengarLee
Contributor

05-04-2022 12:51:44 AM

0 kudos

setMaxEventsPerTrigger not equal to numInputRow is my problem

0 kudos

05-04-2022 12:51:44 AM

4 More Replies

by shan_chandra • Honored Contributor III

05-03-2022 12:45:19 PM

1964 Views
1 replies
3 kudos

Resolved! How to execute matplotlib animations in a Databricks notebook?

How to execute matplotlib animations in a Databricks notebook?

Data Engineering

1964 Views
1 replies
3 kudos

05-03-2022 12:45:19 PM

View Replies

Latest Reply

shan_chandra
Honored Contributor III

05-03-2022 12:47:08 PM

3 kudos

Please refer to the below example code and , use displayHTML(ani.to_jshtml()) to execute matplotlib animations in a databricks notebookimport matplotlib.pyplot as plt import matplotlib.animation import numpy as np t = np.linspace(0,2*np.pi) x = np.si...

3 kudos

05-03-2022 12:47:08 PM

by Orianh • Valued Contributor II

05-02-2022 6:03:18 AM

1433 Views
2 replies
2 kudos

Resolved! pyodbc read only connection.

Hey Guys, Is there a way to open pyodbc read only connection with simba spark driver? At the moment, I'm able to execute queries such as select , delete, insert into - basically every sql statement using pyodbc. I tried to open pyodbc connection but ...

Data Engineering

1433 Views
2 replies
2 kudos

05-02-2022 6:03:18 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

05-03-2022 7:50:05 AM

2 kudos

This readonly=True is working only on some drivers. Just create additional users with granted read-only permission.

2 kudos

05-03-2022 7:50:05 AM

1 More Replies

by sbahm • New Contributor III

03-14-2022 11:44:14 AM

1357 Views
4 replies
4 kudos

Resolved! Issue with adding gitlab credentials to the databricks for "Git integration"

Hi,we have configured our infrastructure by terraform in AZURE, now we want to config GitLab integration with databriks to automate notebook and job deployment. I sow that now this step is available only via databricks UI interface, can you share som...

Data Engineering

1357 Views
4 replies
4 kudos

03-14-2022 11:44:14 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

03-14-2022 3:31:02 PM

4 kudos

Actually Repos API is already available https://docs.databricks.com/dev-tools/api/latest/repos.html#operation/create-repo

4 kudos

03-14-2022 3:31:02 PM

3 More Replies

by alejandrofm • Valued Contributor

03-31-2022 7:39:01 AM

2924 Views
7 replies
8 kudos

Resolved! Pandas.spark.checkpoint() doesn't broke lineage

Hi, I'm doing some something simple on Databricks notebook:spark.sparkContext.setCheckpointDir("/tmp/") import pyspark.pandas as ps sql=("""select field1, field2 From table Where date>='2021-01.01""") df = ps.sql(sql) df.spark.checkpoint()That...

Data Engineering

2924 Views
7 replies
8 kudos

03-31-2022 7:39:01 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

05-03-2022 5:31:34 AM

8 kudos

If you need checkpointing, please try the below code. Thanks to persist, you will avoid reprocessing:df = ps.sql(sql).persist() df.spark.checkpoint()

8 kudos

05-03-2022 5:31:34 AM

6 More Replies

by Mr__E • Contributor II

05-02-2022 1:16:01 PM

657 Views
1 replies
1 kudos

Resolved! SSO and cluster creation restriction

Accounts added after we turned on SSO don't allow me to restrict their cluster creation abilities. How can I undo this, so I can prevent business people from writing to ETLed data?

Data Engineering

657 Views
1 replies
1 kudos

05-02-2022 1:16:01 PM

View Replies

Latest Reply

Mr__E
Contributor II

05-02-2022 1:21:57 PM

1 kudos

Nevermind. Turns out someone was giving everyone admin privileges when they weren't supposed to and I didn't notice.

1 kudos

05-02-2022 1:21:57 PM

by govind • New Contributor

07-28-2021 7:49:40 AM

1207 Views
4 replies
0 kudos

Write 160M rows with 300 columns into Delta Table using Databricks?

Hi, I am using databricks to load data from one delta table into another delta table. I'm using SIMBA Spark JDBC connector to pull data from delta table in my source instance and writing into delta table in my databricks instance. The source has...

Data Engineering

1207 Views
4 replies
0 kudos

07-28-2021 7:49:40 AM

View Replies

Latest Reply

Anonymous
Not applicable

05-02-2022 8:41:34 AM

0 kudos

Hi @govind@dqlabs.ai Just wanted to check in if you were able to resolve your issue or do you need more help? We'd love to hear from you.Thanks!

0 kudos

05-02-2022 8:41:34 AM

3 More Replies

by AlexDavies • Contributor

01-24-2022 7:33:22 AM

3644 Views
12 replies
3 kudos

Resolved! Report on SQL queries that are being executed

We have a SQL workspace with a cluster running that services a number of self service reports against a range of datasets. We want to be able to analyse and report on the queries our self service users are executing so we can get better visibility of...

Data Engineering

3644 Views
12 replies
3 kudos

01-24-2022 7:33:22 AM

View Replies

Latest Reply

Anonymous
Not applicable

05-02-2022 8:21:26 AM

3 kudos

Hey there @Alex Davies Hope you are doing great. Just checking in if you were able to resolve your issue or do you need more help? We'd love to hear from you.Thanks!

3 kudos

05-02-2022 8:21:26 AM

11 More Replies

by Vikram • New Contributor II

03-28-2022 6:00:57 PM

1593 Views
4 replies
4 kudos

Resolved! CVE-2022-0778

How can we update the OpenSSL version for the cluster to address this vulnerability ?https://ubuntu.com/security/CVE-2022-0778Tried with this global init script to auto update the openssl version but does not seem to work as apt-utils is missing. apt...

Data Engineering

1593 Views
4 replies
4 kudos

03-28-2022 6:00:57 PM

View Replies

Latest Reply

Atanu
Esteemed Contributor

04-28-2022 5:53:41 PM

4 kudos

I can see below from our internal communication. CVSSv3 score: 4.0 (Medium) AV:N/AC:H/PR:N/UI:N/S:C/C:N/I:N/A:LReference: https://www.openssl.org/news/secadv/20220315.txtSeverity: HighThe BN_mod_sqrt() function, which computes a modular square root, ...

4 kudos

04-28-2022 5:53:41 PM

3 More Replies

by pavanb • New Contributor II

04-05-2022 4:50:37 AM

7439 Views
3 replies
3 kudos

Resolved! memory issues - databricks

Hi All, All of a sudden in our Databricks dev environment, we are getting exceptions related to memory such as out of memory , result too large etc.Also, the error message is not helping to identify the issue.Can someone please guide on what would be...

Data Engineering

7439 Views
3 replies
3 kudos

04-05-2022 4:50:37 AM

View Replies

Latest Reply

pavanb
New Contributor II

04-06-2022 2:43:20 AM

3 kudos

Thanks for the response @Hubert Dudek .if i run the same code in test environment , its getting successfully completed and in dev its giving out of memory issue. Also the configuration of test nand dev environment is exactly same.

3 kudos

04-06-2022 2:43:20 AM

2 More Replies

by Rk2 • New Contributor II

04-19-2022 12:09:46 AM

735 Views
3 replies
4 kudos

Resolved! scheduling a job with multiple notebooks using common parameter

I have a practical use casethree notebooks (pyspark ) all have one common parameter. need to schedule all three notebooks in a sequence is there any way to run them by setting one parameter value, as they are same in all. please suggest the ...

Data Engineering

735 Views
3 replies
4 kudos

04-19-2022 12:09:46 AM

View Replies

Latest Reply

Kaniz
Community Manager

04-26-2022 3:16:32 AM

4 kudos

Hi @Ramesh Kotha , Just a friendly follow-up. Do you still need help, or @Hubert Dudek (Customer) 's response help you to find the solution? Please let us know.

4 kudos

04-26-2022 3:16:32 AM

2 More Replies

User

Count

1602

736

343

284

247

Databricks

Forum Posts

Resolved! How to access tables created in 2017

Resolved! Unable to start cluster on E2 Workspace

Databricks Office Hours Our next Office Hours session is scheduled for May 18th from 8:00 am - 9:00am PT. Do you have questions about how to set up or...

Resolved! Can ADLS be mounted in DBFS using only ADLS account key?

Resolved! How to improve Spark Streaming writer Input Rate and Processing rate?

Resolved! How to execute matplotlib animations in a Databricks notebook?

Resolved! pyodbc read only connection.

Resolved! Issue with adding gitlab credentials to the databricks for "Git integration"

Resolved! Pandas.spark.checkpoint() doesn't broke lineage

Resolved! SSO and cluster creation restriction

Write 160M rows with 300 columns into Delta Table using Databricks?

Resolved! Report on SQL queries that are being executed

Resolved! CVE-2022-0778

Resolved! memory issues - databricks

Resolved! scheduling a job with multiple notebooks using common parameter

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...