Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-13-2023 10:45 AM
Hi,
I have UDF which runs for each spark dataframe row, does some complex processing and return string output. But it takes very long if data is 15000 rows. I have configured cluster with autoscaling, but its not spinning more servers.
Please suggest how to make UDF fasters or any reference implementations.
Regards,
Sanjay
Labels:
- Labels:
-
PySpark UDF