Topics with Label: Databricks Cluster

Forum Posts

Sorted by:

by pokus • New Contributor III

03-21-2023 2:23:27 AM

9546 Views
3 replies
2 kudos

Resolved! use DeltaLog class in databricks cluster

I need to use DeltaLog class in the code to get the AddFiles dataset. I have to keep the implemented code in a repo and run it in databricks cluster. Some docs say to use org.apache.spark.sql.delta.DeltaLog class, but it seems databricks gets rid of ...

Data Engineering

9546 Views
3 replies
2 kudos

03-21-2023 2:23:27 AM

View Replies

Latest Reply

NandiniN
Databricks Employee

a month ago

2 kudos

Hi @pokus , You don't need to access via reflection. You can Access DeltaLog with spark._jvm:Unity Catalog and DeltaLake tables expose their metadata and transaction log via the JVM backend. Using spark._jvm, you can interact with DeltaLog Thanks!

2 kudos

a month ago

2 More Replies

by Direo • Contributor II

03-22-2023 1:35:47 AM

35647 Views
6 replies
1 kudos

Resolved! Importing CA certificate into a Databricks cluster

Hi!I was following guide outlined here:https://kb.databricks.com/en_US/python/import-custom-ca-cert(also tried this: https://stackoverflow.com/questions/73043589/configuring-tls-ca-on-databricks)to add ca root certificate into Databricks cluster, but...

Data Engineering

35647 Views
6 replies
1 kudos

03-22-2023 1:35:47 AM

View Replies

Latest Reply

jash281098
New Contributor II

05-11-2025 9:31:47 AM

1 kudos

@Debayan One question - Will same approach work for JKS file containing private key certificate for X.509 authentication to Mongo Atlas database.Usual way of adding below spark config's is not working. spark.driver.extraJavaOptions -Djavax.net.ssl.ke...

1 kudos

05-11-2025 9:31:47 AM

5 More Replies

by vinaykumar • New Contributor III

05-09-2023 5:23:39 AM

10686 Views
7 replies
6 kudos

Reading Iceberg table present in S3 from databricks console using spark given none error .

Hi Team , I am facing issue while reading iceberg table from S3 and getting none error when read the data . below steps I followed .Added Iceberg Spark connector library to your Databricks cluster. 2. Cluster Configuration to Enable Iceberg ...

Data Engineering

10686 Views
7 replies
6 kudos

05-09-2023 5:23:39 AM

View Replies

Latest Reply

jayKumar
New Contributor II

04-07-2025 1:28:56 PM

6 kudos

Did anyone find solution to use unity catalog and Iceberg table in databricks?

6 kudos

04-07-2025 1:28:56 PM

6 More Replies

by negrinij • New Contributor II

05-21-2023 8:39:13 PM

38133 Views
4 replies
2 kudos

Understanding Used Memory in Databricks Cluster

Hello, I wonder if anyone could give me any insights regarding used memory and how could I change my code to "release" some memory as the code runs. I am using a Databricks Notebook.Basically, what we need to do is perform a query, create a spark sql...

Data Engineering

38133 Views
4 replies
2 kudos

05-21-2023 8:39:13 PM

View Replies

Latest Reply

loic
Contributor

02-04-2025 2:13:11 AM

2 kudos

I have exactly the same kind of problem.I really do not understand why my driver goes out of memory meanwhile I do not cache anything in Spark.Since I don't cache anything, I expect references to objects that are not used anymore to be freed.Even a s...

2 kudos

02-04-2025 2:13:11 AM

3 More Replies

by michaelh • New Contributor III

03-10-2022 3:28:55 AM

5369 Views
5 replies
4 kudos

Resolved! AWS Databricks Cluster terminated.Reason:Container launch failure

We're developing custom runtime for databricks cluster. We need to version and archive our clusters for client. We made it run successfully in our own environment but we're not able to make it work in client's environment. It's large corporation with...

Data Engineering

5369 Views
5 replies
4 kudos

03-10-2022 3:28:55 AM

View Replies

Latest Reply

NandiniN
Databricks Employee

01-18-2025 9:52:50 PM

4 kudos

This appears to be an issue with the security group. Kindly review security group inbound/outbound rules.

4 kudos

01-18-2025 9:52:50 PM

4 More Replies

by naveenreddy1 • New Contributor II

11-21-2019 8:40:58 PM

19715 Views
4 replies
0 kudos

Reason: Remote RPC client disassociated. Likely due to containers exceeding thresholds, or network issues. Check driver logs for WARN messages. Driver stacktrace

We are using the databricks 3 node cluster with 32 GB memory. It is working fine but some times it automatically throwing the error: Remote RPC client disassociated. Likely due to containers exceeding thresholds, or network issues.

Data Engineering

19715 Views
4 replies
0 kudos

11-21-2019 8:40:58 PM

View Replies

Latest Reply

RodrigoDe_Freit
New Contributor II

12-10-2019 11:55:58 AM

0 kudos

If your job fails follow this:According to https://docs.databricks.com/jobs.html#jar-job-tips: "Job output, such as log output emitted to stdout, is subject to a 20MB size limit. If the total output has a larger size, the run will be canceled and ma...

0 kudos

12-10-2019 11:55:58 AM

3 More Replies

by Confused • New Contributor III

04-04-2022 3:57:50 AM

54798 Views
6 replies
3 kudos

Resolved! Configuring pip index-url and using artifacts-keyring

Hi I would like to use the azure artifact feed as my default index-url when doing a pip install on a Databricks cluster. I understand I can achieve this by updating the pip.conf file with my artifact feed as the index-url. Does anyone know where i...

Data Engineering

54798 Views
6 replies
3 kudos

04-04-2022 3:57:50 AM

View Replies

Latest Reply

murtazahzaveri
New Contributor II

10-24-2024 2:28:17 PM

3 kudos

For Authentication you can provide below config on cluster's Spark Environment Variables,PIP_EXTRA_INDEX_URL=https://username:password@pkgs.sample.com/sample/_packaging/artifactory_name/pypi/simple/.Also, you can store the value in Databricks secret

3 kudos

10-24-2024 2:28:17 PM

5 More Replies

by ae20cg • New Contributor III

10-19-2022 2:59:24 PM

5765 Views
5 replies
9 kudos

Databricks Cluster Web terminal different permissions with tmux and xterm.

I am launching web terminal on my databricks cluster and when I am using the ephemeral xterm instance I am easily able to navigate to desired directory in `Workspace` and run anything... for example `ls ./` When I switch to tmux so that I can preserv...

Data Engineering

5765 Views
5 replies
9 kudos

10-19-2022 2:59:24 PM

View Replies

Latest Reply

alenka
New Contributor III

07-31-2023 7:56:28 AM

9 kudos

Hey there, fellow data explorer pals! I totally get your excitement when launching that web terminal on your Databricks cluster and feeling the power of running commands like 'ls ./' in the ephemeral xterm instance. It's like traversing the vast univ...

9 kudos

07-31-2023 7:56:28 AM

4 More Replies

by RajeshRK • Contributor II

01-06-2022 8:14:40 AM

10272 Views
3 replies
0 kudos

Need help to analyze databricks logs for a long-running job.

Hi Team,We have a job it completes in 3 minutes in one Databricks cluster, if we run the same job in another databricks cluster it is taking 3 hours to complete.I am quite new to Databricks and need your guidance on how to find out where databricks s...

Data Engineering

10272 Views
3 replies
0 kudos

01-06-2022 8:14:40 AM

View Replies

Latest Reply

AmitKP
New Contributor II

04-07-2024 12:53:50 PM

0 kudos

Hi @Retired_mod ,I am saving logs of my databricks Job Compute From ADF, How can i open those files that present in dbfs location.

0 kudos

04-07-2024 12:53:50 PM

2 More Replies

by Rajaniesh • New Contributor III

06-22-2023 9:45:40 PM

3578 Views
2 replies
1 kudos

URGENT HELP NEEDED: Python functions deployed in the cluster throwing the error

Hi,I have created a python wheel with the following code. And the package name is rule_engine"""The entry point of the Python Wheel"""import sysfrom pyspark.sql.functions import expr, coldef get_rules(tag): """ loads data quality rules from a table ...

Data Engineering

3578 Views
2 replies
1 kudos

06-22-2023 9:45:40 PM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

12-12-2023 11:25:09 AM

1 kudos

You can find more details and examples here https://docs.databricks.com/en/workflows/jobs/how-to/use-python-wheels-in-workflows.html#use-a-python-wheel-in-a-databricks-job

1 kudos

12-12-2023 11:25:09 AM

1 More Replies

by dukebaslangic • New Contributor II

06-15-2023 11:07:24 PM

2945 Views
3 replies
3 kudos

Resolved! Databricks performance related documentation/books

Hi,Do you know any good resources about Databricks performance improvements(like improving query performances, monitoring/resolving performance bottlenecks etc)?Thanks

Data Engineering

2945 Views
3 replies
3 kudos

06-15-2023 11:07:24 PM

View Replies

Latest Reply

Anonymous
Not applicable

06-17-2023 12:18:26 AM

3 kudos

Hi @Ömer Özsakarya We haven't heard from you since the last response from @Lakshay Goel , and I was checking back to see if her suggestions helped you.Or else, If you have any solution, please share it with the community, as it can be helpful to ...

3 kudos

06-17-2023 12:18:26 AM

2 More Replies

by Kaijser • New Contributor II

06-14-2023 2:52:03 AM

2687 Views
1 replies
2 kudos

Installing private python Azure DevOps repository without revealing personal access token in pyproject.toml

I want to install a .whl file on my Databricks cluster which includes a private Azure DevOps repository as a dependency in its pyproject.toml file, i.e.:[project] name = "test" description = "test_description." version = "0.1.0" authors = [ { name ...

Data Engineering

2687 Views
1 replies
2 kudos

06-14-2023 2:52:03 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-15-2023 11:50:22 PM

2 kudos

Hi @Aaron Kaijser Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question. Thanks.

2 kudos

06-15-2023 11:50:22 PM

by Phani1 • Valued Contributor II

04-14-2023 1:14:27 AM

3935 Views
2 replies
1 kudos

Integration Dolly with Databricks

Hi Databricks Team,Could you please share any links /docs/Sample notebooks to integrate Dolly with Databricks, our aim is to generate SQL queries based on the free text and execute it via databricks cluster/SQL warehouse.

Data Engineering

3935 Views
2 replies
1 kudos

04-14-2023 1:14:27 AM

View Replies

Latest Reply

sean_owen
Databricks Employee

06-02-2023 11:48:09 AM

1 kudos

https://www.dbdemos.ai/demo.html?demoName=llm-dolly-chatbot is a good demonstration of Dolly (or really any LLM) for question answering. LLMs like this are not for SQL generation, but other LLMs are, like starcoderbase

1 kudos

06-02-2023 11:48:09 AM

1 More Replies

by ros • New Contributor III

05-15-2023 11:39:42 PM

3533 Views
2 replies
3 kudos

Apache Hudi Table creation using hudi maven library

I installed hudi maven library org.apache.hudi:hudi-spark3.3-bundle_2.12:0.13.0 in Dbricks Runtime Ver : 12.2 LTS (includes Apache Spark 3.3.2, Scala 2.12) with spark config :spark.sql.catalog.spark_catalog org.apache.spark.sql.hudi.catalog.HoodieCat...

Data Engineering

3533 Views
2 replies
3 kudos

05-15-2023 11:39:42 PM

View Replies

Latest Reply

ros
New Contributor III

05-31-2023 12:51:14 AM

3 kudos

@Shanmugavel Chandrakasu %sql create table hudi_cow_pt_tbl ( id bigint, name string, ts bigint, dt string, hh string ) using hudi tblproperties ( type = 'cow', primaryKey = 'id', preCombineField = 'ts' ) partitioned by (dt, hh) location '/mnt/data/h...

3 kudos

05-31-2023 12:51:14 AM

1 More Replies

by de-hru • New Contributor III

05-03-2023 5:26:09 AM

29474 Views
4 replies
1 kudos

Resolved! How to add pre-commit hook to the Git Client on Databricks Cluster?

I'd like to add a Git pre-commit hook to the Databricks Cluster.This pre-commit hook should be executed when pushing to GitHub.Why would I need a pre-commit hook on a Databricks Cluster?My goal is to run blackbricks and format all notebooks automatic...

Data Engineering

29474 Views
4 replies
1 kudos

05-03-2023 5:26:09 AM

View Replies

Latest Reply

Anonymous
Not applicable

05-19-2023 12:45:40 AM

1 kudos

Hi @Dejan Hrubenja Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Tha...

1 kudos

05-19-2023 12:45:40 AM

3 More Replies