02-22-2024 11:38 AM
I am trying to run the following chunk of code in the cell of a Databricks notebook (using Databricks runtime 14.3 LTS, Apache spark 3.5.0, scala 2.12):
spark.sql("CREATE OR REPLACE table sample_catalog.sample_schema.sample_table_tmp AS SELECT * FROM sample_catalog.sample_schema.sample_table")
df = spark.sql("SELECT * FROM sample_catalog.sample_schema.sample_table_tmp")
if df is not None:
if not df.isEmpty():
display(df)
del(df)
spark.sql("DROP TABLE IF EXISTS sample_catalog.sample_schema.sample_table_tmp")
After successfully running the "display() function, the above code errors out, during the last line when it is trying to drop the table, with the following message:
2024-02-22 17:12:10,508 34507 ERROR _handle_rpc_error GRPC Error received
Traceback (most recent call last):
File "/databricks/spark/python/pyspark/sql/connect/client/core.py", line 1312, in _analyze
resp = self._stub.AnalyzePlan(req, metadata=self._builder.metadata())
File "/databricks/python/lib/python3.10/site-packages/grpc/_channel.py", line 946, in __call__
return _end_unary_response_blocking(state, call, False, None)
File "/databricks/python/lib/python3.10/site-packages/grpc/_channel.py", line 849, in _end_unary_response_blocking
raise _InactiveRpcError(state)
grpc._channel._InactiveRpcError: <_InactiveRpcError of RPC that terminated with:
status = StatusCode.INTERNAL
details = "[TABLE_OR_VIEW_NOT_FOUND] The table or view `sample_catalog`.`sample_schema`.`sample_table_tmp` cannot be found. Verify the spelling and correctness of the schema and catalog.
If you did not qualify the name with a schema, verify the current_schema() output, or qualify the name with the correct schema and catalog.
To tolerate the error on drop use DROP VIEW IF EXISTS or DROP TABLE IF EXISTS. SQLSTATE: 42P01; line 1 pos 14;
'Project [*]
+- 'UnresolvedRelation [sample_catalog, sample_schema, sample_table_tmp], [], false
"
debug_error_string = "UNKNOWN:Error received from peer unix:/databricks/sparkconnect/grpc.sock {created_time:"2024-02-22T17:12:10.508374255+00:00", grpc_status:13, grpc_message:"[TABLE_OR_VIEW_NOT_FOUND] The table or view `sample_catalog`.`sample_schema`.`sample_table_tmp` cannot be found. Verify the spelling and correctness of the schema and catalog.\nIf you did not qualify the name with a schema, verify the current_schema() output, or qualify the name with the correct schema and catalog.\nTo tolerate the error on drop use DROP VIEW IF EXISTS or DROP TABLE IF EXISTS. SQLSTATE: 42P01; line 1 pos 14;\n\'Project [*]\n+- \'UnresolvedRelation [sample_catalog, sample_schema, sample_table_tmp], [], false\n"}"
It's not clear to me that the error message is accurately explaining the problem here.
1) The table in question does not appear to be missing, as the display() command worked fine, so that table was clearly visible,
2) the last command uses "DROP TABLE IF EXISTS", not "DROP TABLE", and
3) the exact same code above works fine when I move the last line of code (i.e. the "spark.sql("DROP TABLE ...)") into a subsequent cell, and run the the two cells consecutively.
I'm wondering if there is something going on behind the scenes with the data distributing, that is complicating things. Could anyone please explain to me what is causing this error?
03-11-2024 02:28 PM
Per correspondence with Databricks, this appears to have been a bug which they are in the process of resolving. As an additional observation, in the above code, the table appears to get deleted, in spite of the error message.
03-20-2024 12:20 AM
Hi, any update on the issue? I am facing the same error while writing stream from autoloader connected to Azure Blob.
2024-03-20 02:12:11,971 1542 ERROR _handle_rpc_error GRPC Error received Traceback (most recent call last): File "/databricks/spark/python/pyspark/sql/connect/client/core.py", line 1469, in _execute_and_fetch_as_iterator for b in generator: File "/usr/lib/python3.10/_collections_abc.py", line 330, in __next__ return self.send(None) File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 132, in send if not self._has_next(): File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 193, in _has_next raise e File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 165, in _has_next self._current = self._call_iter( File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 280, in _call_iter raise e File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 262, in _call_iter return iter_fun() File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 166, in <lambda> lambda: next(self._iterator) # type: ignore[arg-type] File "/databricks/python/lib/python3.10/site-packages/grpc/_channel.py", line 426, in __next__ return self._next() File "/databricks/python/lib/python3.10/site-packages/grpc/_channel.py", line 826, in _next raise self grpc._channel._MultiThreadedRendezvous: <_MultiThreadedRendezvous of RPC that terminated with: status = StatusCode.INTERNAL details = "" debug_error_string = "UNKNOWN:Error received from peer unix:/databricks/sparkconnect/grpc.sock {created_time:"2024-03-20T02:12:11.970061559+00:00", grpc_status:13, grpc_message:""}" >
09-23-2024 02:07 PM
Hello, any updates on this issue? Also facing a simular issue while joining data in the unity catalog,
Error received from peer unix:/databricks/sparkconnect/grpc.sock {created_time:"2024-09-23T17:50:35.235314263+00:00", grpc_status:2, grpc_message:"" Exception: [RETRIES_EXCEEDED] The maximum number of retries has been exceeded.
3 weeks ago
Following. Also having this issue, but within the context of pivoting a DF, then aggregating by *
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group