The issue is that the package com.crealytics:spark-excel_2.12:3.5.0_0.20.3 is intermittently unavailable i.e. most of the times excel import works and few times it fails with exception (org.apache.spark.SparkClassNotFoundException).
I have installed maven package com.crealytics:spark-excel_2.12:3.5.0_0.20.3 on a bricks cluster(14.2) with spark_version: "14.2.x-scala2.12" and "effective_spark_version": "14.2.x-photon-scala2.12". I'm using databricks-connect python library to import files from Azure blob storage from another application. Any help is appreciated.
ERROR _handle_rpc_error GRPC Error received Traceback (most recent call last): File "/usr/local/lib/python3.10/site-packages/pyspark/sql/connect/client/core.py", line 1235, in _analyze resp = self._stub.AnalyzePlan(req, metadata=self._builder.metadata()) File "/usr/local/lib/python3.10/site-packages/grpc/_channel.py", line 1030, in __call__ return _end_unary_response_blocking(state, call, False, None) File "/usr/local/lib/python3.10/site-packages/grpc/_channel.py", line 910, in _end_unary_response_blocking raise _InactiveRpcError(state) # pytype: disable=not-instantiable grpc._channel._InactiveRpcError: <_InactiveRpcError of RPC that terminated with: status = StatusCode.INTERNAL details = "[DATA_SOURCE_NOT_FOUND] Failed to find data source: com.crealytics.spark.excel. Please find packages at `https://spark.apache.org/third-party-projects.html`. SQLSTATE: 42K02" debug_error_string = "UNKNOWN:Error received from peer {grpc_message:"[DATA_SOURCE_NOT_FOUND] Failed to find data source: com.crealytics.spark.excel. Please find packages at `https://spark.apache.org/third-party-projects.html`. SQLSTATE: 42K02", grpc_status:13, created_time:"2024-01-19T06:45:01.99118045+00:00"}"