04-18-2024 04:09 AM
Hello:)
as part of deploying an app that previously ran directly on emr to databricks, we are running experiments using LTS 9.1, and getting the following error:
PythonException: An exception was thrown from a UDF: 'pyspark.serializers.SerializationError: Caused by Traceback (most recent call last):
File "/databricks/spark/python/pyspark/serializers.py", line 165, in _read_with_length
return self.loads(obj)
File "/databricks/spark/python/pyspark/serializers.py", line 469, in loads
return pickle.loads(obj, encoding=encoding)
File "<frozen importlib._bootstrap>", line 991, in _find_and_load
File "<frozen importlib._bootstrap>", line 975, in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 671, in _load_unlocked
File "<frozen importlib._bootstrap_external>", line 844, in exec_module
File "<frozen importlib._bootstrap_external>", line 980, in get_code
File "<frozen importlib._bootstrap_external>", line 1038, in get_data
OSError: [Errno 78] Remote address changed'. Full traceback below:
Traceback (most recent call last):
File "/databricks/spark/python/pyspark/serializers.py", line 165, in _read_with_length
return self.loads(obj)
File "/databricks/spark/python/pyspark/serializers.py", line 469, in loads
return pickle.loads(obj, encoding=encoding)
File "<frozen importlib._bootstrap>", line 991, in _find_and_load
File "<frozen importlib._bootstrap>", line 975, in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 671, in _load_unlocked
File "<frozen importlib._bootstrap_external>", line 844, in exec_module
File "<frozen importlib._bootstrap_external>", line 980, in get_code
File "<frozen importlib._bootstrap_external>", line 1038, in get_data
OSError: [Errno 78] Remote address changed
the same udf works in emr, and is part of the codebase that databricks uses as git source,
did anyone encounter this error?
Would appreciate any advice in troubleshooting it, Thanks!
04-18-2024 03:46 PM
@liormayn - could you please let us know if you had a chance to run it on DBR 10.4 LTS?
04-21-2024 12:24 AM
yes:) @shan_chandra this is what happened when i tried 10.4 lts :
https://community.databricks.com/t5/community-discussions/error-while-encoding-java-lang-runtimeexce...
05-01-2024 01:40 AM
Hi @liormayn ,
Are you still facing the issue? This was faced in mid March and issue was fixed. It can happen for some pip install when the libraries are in Workspace. But if you are still facing the issue, I would suggest you to create a support ticket for this as it may need involved investigation.
05-02-2024 06:33 AM
Hey @NandiniN 🙂 The error currently stopped happening, but we are not feeling "safe" yet,
could you tell me when the fix was published? just so we try and pin point to see if the fix is what solved it?
05-05-2024 10:15 AM
Hi @liormayn ,
I can understand. I see the fix went on 20 March 2024, you would have to restart the clusters.
Thanks!
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group