Data Engineering

Forum Posts

Sorted by:

by Serhii • Contributor

08-18-2022 1:40:05 AM

1596 Views
3 replies
1 kudos

Could not launch jobs due to node_type_id (instance) unavailability

I am running hourly job on a cluster using p3.2xlarge GPU instance, but sometimes cluster couldn't start due to instance unavailability. I wander is there is any fallback mechanism to, for example, try a different instance type if one is not availabl...

Data Engineering

1596 Views
3 replies
1 kudos

08-18-2022 1:40:05 AM

View Replies

Latest Reply

abagshaw
New Contributor III

06-27-2023 11:57:30 AM

1 kudos

(AWS only) For anyone experiencing capacity related cluster launch failures on non-GPU instance types, AWS Fleet instance types are now GA and available for clusters and instance pools. They help improve chance of successful cluster launch by allowi...

1 kudos

06-27-2023 11:57:30 AM

2 More Replies

by darkraisisi • New Contributor

05-25-2023 8:52:23 AM

797 Views
0 replies
0 kudos

Is there a way to manually update the cuda required file in the db runtime? There are some rather annoying bugs still in TF 2.11 that have been fixed ...

Is there a way to manually update the cuda required file in the db runtime?There are some rather annoying bugs still in TF 2.11 that have been fixed in TF 2.12.Sadly the latest DB runtime 13.1 (beta) only supports the older TF 2.11 even tho 2.12 was ...

Data Engineering

797 Views
0 replies
0 kudos

05-25-2023 8:52:23 AM

by zzy • New Contributor III

02-01-2023 8:22:06 AM

1888 Views
3 replies
2 kudos

Why is pytorch cuda total memory not aligned with the memory size of GPU cluster I created?

No matter GPU cluster of which size I create, cuda total capacity is always ~16 Gb. Does anyone know what is the issue?The code I use to get the total capacity:torch.cuda.get_device_properties(0).total_memory

Data Engineering

1888 Views
3 replies
2 kudos

02-01-2023 8:22:06 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-08-2023 7:30:30 PM

2 kudos

Hi @Simon Zhang Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so w...

2 kudos

04-08-2023 7:30:30 PM

2 More Replies

by jamesw • New Contributor II

01-10-2023 6:04:31 PM

2274 Views
2 replies
1 kudos

Ganglia not working with custom container services

Setup:custom docker container starting from the "databricksruntime/gpu-conda:cuda11" base image layer10.4 LTS (includes Apache Spark 3.2.1, Scala 2.12)multi-node, p3.8xlarge GPU computeWhen I try to view Ganglia metrics I am met with "502 Bad Gatewa...

Data Engineering

2274 Views
2 replies
1 kudos

01-10-2023 6:04:31 PM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

01-16-2023 7:43:18 AM

1 kudos

Hi @James W (Customer) , We haven’t heard from you since the last response from @Vivian Wilfred, and I was checking back to see if his suggestions helped you. Or else, If you have any solution, please do share that with the community as it can be ...

1 kudos

01-16-2023 7:43:18 AM

1 More Replies

by vishallakha • New Contributor II

12-27-2022 10:35:56 PM

1044 Views
1 replies
2 kudos

How to Enable Files in Repos in DBR 7.3 LTS ML ?

we need a custom version of a GPU cluster with following requirements for a certain project:Ubuntu 18.04Cuda 10.1.Tesla T4 GPU.Availability of /Workspace/Repos folder.All of these requirements are available with DBR ML 7.3 LTS. But one critical compo...

Data Engineering

1044 Views
1 replies
2 kudos

12-27-2022 10:35:56 PM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

01-02-2023 1:00:27 PM

2 kudos

Hi, To work with non-notebook files in Databricks Repos, you must be running Databricks Runtime 8.4 or above.https://docs.databricks.com/files/workspace.html#configure-support-for-workspace-files

2 kudos

01-02-2023 1:00:27 PM

by Kaniz_Fatma • Community Manager

09-21-2021 10:52:17 AM

864 Views
1 replies
1 kudos

Does Apache Spark 3 support GPU usage for Spark RDDs?

Data Engineering

864 Views
1 replies
1 kudos

09-21-2021 10:52:17 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-17-2022 11:32:49 PM

1 kudos

You can have a look on this doc, I hope it helpshttps://www.databricks.com/session_na20/deep-dive-into-gpu-support-in-apache-spark-3-x

1 kudos

12-17-2022 11:32:49 PM

by alonisser • Contributor

11-13-2022 9:26:39 AM

940 Views
1 replies
6 kudos

ML inference server (with REST api) how can I specify a GPU cluster.?

is there an API for that? as I couldn't find a way to do this through the UIClassic serving for now (didn't get access to the new "serverless" offering)

Data Engineering

940 Views
1 replies
6 kudos

11-13-2022 9:26:39 AM

View Replies

Latest Reply

alonisser
Contributor

11-19-2022 6:35:35 AM

6 kudos

any clues?

6 kudos

11-19-2022 6:35:35 AM

by asanchez75 • New Contributor

09-16-2022 2:20:53 AM

983 Views
0 replies
0 kudos

How to compare Spark performance under different hardware (GPU vs CPU)

Hello,I found some benchmarks between GPU and CPU Spark-based systems that are not performed in the same hardware. Is this faire since a powerful CPU server could eventually outperforms a GPU server?For example,Here, the performance comparison is don...

Data Engineering

983 Views
0 replies
0 kudos

09-16-2022 2:20:53 AM

by VictorP • New Contributor

06-29-2022 1:14:32 PM

1459 Views
2 replies
4 kudos

Resolved! Does databricks run on GPU?

Does databricks run on GPU?

Data Engineering

1459 Views
2 replies
4 kudos

06-29-2022 1:14:32 PM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

06-30-2022 12:33:20 AM

4 kudos

Hi @Victor Prosolin , We haven’t heard from you on the last response from @Ron DeFreitas , and I checked back to see if his response helped you. Or else, If you have any solution, please share it with the community as it can be helpful to others.

4 kudos

06-30-2022 12:33:20 AM

1 More Replies

by Anonymous • Not applicable

11-03-2021 2:51:11 PM

2749 Views
4 replies
2 kudos

Resolved! Anyone using RAPIDS and cuGraph on a current runtime?

We're in the process of migrating a large graph computation workload to nvidia RAPIDS + cuGraph for GPU acceleration. The package isn't a part of the base runtime and it is available by conda package management only, so can't be installed via init sc...

Data Engineering

2749 Views
4 replies
2 kudos

11-03-2021 2:51:11 PM

View Replies

Latest Reply

Anonymous
Not applicable

11-16-2021 11:32:00 AM

2 kudos

Thanks @Prabakar Ammeappin , we're looking at this. Strangely, the last commit removed the rapids libraries from the base cuda-images. We're adding them back in.

2 kudos

11-16-2021 11:32:00 AM

3 More Replies