cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Performance issue while calling Sagemaker Endpoint in pyspark udf

sanjay
Valued Contributor II

Hi,

I have pyspark dataframe which calls pyspark udf which in turn calls sagemaker endpoint. But when dataframe has more rows, endpoint start failing. Also it takes longer to process.

Please suggest how to call sagemaker endpoint from pyspark.

Regards,

Sanjay

1 REPLY 1

sanjay
Valued Contributor II

Thank you @Retired_mod for prompt response. 

"send batch to sagemaker", you mean sending multiple data records in every sagemaker call? As sagemaker has 60 second timeout, will this not timeout the request if there are multiple records in single call

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group