Machine Learning

Forum Posts

Sorted by:

by sanjay • Valued Contributor II

02-09-2023 7:25:49 AM

36945 Views
2 replies
1 kudos

Resolved! torch.cuda.OutOfMemoryError: CUDA out of memory

Hi,I am using pynote/whisper large model and trying to process data using spark UDF and getting following error.torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 172.00 MiB (GPU 0; 14.76 GiB total capacity; 6.07 GiB already allocated...

Machine Learning

36945 Views
2 replies
1 kudos

02-09-2023 7:25:49 AM

View Replies

Latest Reply

JMTech18
New Contributor II

09-23-2024 4:34:40 AM

1 kudos

Try to run these codesimport torchtorch.cuda.empty_cache()And make sure to find the optimize batch size otherwise the error can occur again

1 kudos

09-23-2024 4:34:40 AM

1 More Replies

by elgeo • Valued Contributor II

11-15-2022 3:44:52 AM

3362 Views
3 replies
2 kudos

Table name as a parameter in SQL UDF

Hello experts,We would like to create a UDF function with input parameter a table_name. Please check the below simple example:CREATE OR REPLACE FUNCTION F_NAME(v_table_name STRING, v_w...

Machine Learning

3362 Views
3 replies
2 kudos

11-15-2022 3:44:52 AM

View Replies

Latest Reply

alm
New Contributor III

05-06-2024 4:42:39 AM

2 kudos

Did you find a solutions? I'm having the same problem

2 kudos

05-06-2024 4:42:39 AM

2 More Replies

by Hubert-Dudek • Esteemed Contributor III

02-27-2023 4:54:51 AM

1305 Views
1 replies
5 kudos

Databricks has introduced new functionality for serving machine learning models through a serverless REST API, enabling the consumption of models outs...

Databricks has introduced new functionality for serving machine learning models through a serverless REST API, enabling the consumption of models outside of Databricks. While serving the model via REST API is ideal for external use cases, it is recom...

Machine Learning

1305 Views
1 replies
5 kudos

02-27-2023 4:54:51 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

03-07-2023 10:25:05 AM

5 kudos

Thank you for sharing this!!!

5 kudos

03-07-2023 10:25:05 AM

by anvil • New Contributor II

01-24-2023 1:14:46 PM

3065 Views
3 replies
4 kudos

Are UDFs necessary for applying models from ML libraries at scale ?

Hello,I recently finished the "scalable machine learning with apache spark" course and saw that SKLearn models could be applied faster in a distributed manner when used in pandas UDFs or with mapInPandas() method. Spark MLlib models don't need this k...

Machine Learning

3065 Views
3 replies
4 kudos

01-24-2023 1:14:46 PM

View Replies

Latest Reply

Manoj12421
Valued Contributor II

02-08-2023 11:17:49 AM

4 kudos

MlLib is in the maintenance model and udf is not used by creating model in most cases

4 kudos

02-08-2023 11:17:49 AM

2 More Replies

by Vicky1215 • New Contributor II

01-13-2023 7:13:59 AM

6757 Views
6 replies
2 kudos

Solution for - "PythonException: 'ModuleNotFoundError: No module named 'spacy'

I am actually trying to extract the adjective and noun phrases from the text column in spark data frame for which I've written the udf and applying on cleaned text column. However, I am getting this error.from pyspark.sql.functions import udffrom pys...

Machine Learning

6757 Views
6 replies
2 kudos

01-13-2023 7:13:59 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

01-13-2023 11:44:04 PM

2 kudos

only init script will work here

2 kudos

01-13-2023 11:44:04 PM

5 More Replies

by Dan_Z • Databricks Employee

10-22-2021 9:06:35 AM

731 Views
0 replies
0 kudos

spark.apache.org

mapInPandas is one of the most powerful Spark functions. It uses an arrow-like in-memory data structure to split up Spark Data Frames into chunks and feeding them to a function that takes a Pandas DF as input and output. Check it out here:https://spa...

Machine Learning

731 Views
0 replies
0 kudos

10-22-2021 9:06:35 AM

Databricks Community

Resolved! torch.cuda.OutOfMemoryError: CUDA out of memory

Table name as a parameter in SQL UDF

Databricks has introduced new functionality for serving machine learning models through a serverless REST API, enabling the consumption of models outs...

Are UDFs necessary for applying models from ML libraries at scale ?

Solution for - "PythonException: 'ModuleNotFoundError: No module named 'spacy'

spark.apache.org