I am reading a 83MB json file using " spark.read.json(storage_path)", when I display the data is seems displaying fine, but when I try command line count, it complains about file size , being more than 400MB, which is not true.
Photon JSON reader error: JSON document exceeded maximum allowed size 400.0 MiB. Any single document must fit within this memory budget.
at 0x4fe8a91 <photon>.UnrecoverableError(external/workspace_spark_3_3/photon/io/json/simd-json-util.h:33)
at 0x4fe8974 <photon>.TryLoadDocumentsFromStream(external/workspace_spark_3_3/photon/io/json/simd-json-reader.cc:286)
at 0x4fe907c <photon>.TryLoadDocumentsFromStream(external/workspace_spark_3_3/photon/io/json/simd-json-reader.cc:287)
at 0x4fe907c <photon>.TryLoadDocumentsFromStream(external/workspace_spark_3_3/photon/io/json/simd-json-reader.cc:287)
at 0x4fe907c <photon>.TryLoadDocumentsFromStream(external/workspace_spark_3_3/photon/io/json/simd-json-reader.cc:287)
at 0x4fe907c <photon>.TryLoadDocumentsFromStream(external/workspace_spark_3_3/photon/io/json/simd-json-reader.cc:287)
at 0x4fe907c <photon>.TryLoadDocumentsFromStream(external/workspace_spark_3_3/photon/io/json/simd-json-reader.cc:287)
at 0x4fe907c <photon>.TryLoadDocumentsFromStream(external/workspace_spark_3_3/photon/io/json/simd-json-reader.cc:287)
at 0x4fe907c <photon>.TryLoadDocumentsFromStream(external/workspace_spark_3_3/photon/io/json/simd-json-reader.cc:287)
at 0x4fe907c <photon>.TryLoadDocumentsFromStream(external/workspace_spark_3_3/photon/io/json/simd-json-reader.cc:287)
at 0x4fe929f <photon>.NextInput(external/workspace_spark_3_3/photon/io/json/simd-json-reader.cc:308)
at 0x4ac26de <photon>.OpenFileForReading(external/workspace_spark_3_3/photon/exec-nodes/json-file-scan-node.cc:501)
at 0x4ac1058 <photon>.OpenImpl(external/workspace_spark_3_3/photon/exec-nodes/json-file-scan-node.cc:402)
at 0x49cc47c <photon>.OpenImpl(external/workspace_spark_3_3/photon/exec-nodes/grouping-agg-node.cc:92)
at 0x49cc47c <photon>.OpenImpl(external/workspace_spark_3_3/photon/exec-nodes/shuffle-sink-node.cc:146)
at com.databricks.photon.JniApiImpl.open(Native Method)
at com.databricks.photon.JniApi.open(JniApi.scala)
at com.databricks.photon.JniExecNode.open(JniExecNode.java:64)
at com.databricks.photon.PhotonShuffleMapStageExec.$anonfun$preShuffleRDDInternal$9(PhotonExec.scala:809)
at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)
at com.databricks.photon.PhotonExec.timeit(PhotonExec.scala:344)
What can cause this error ? is it a bug ? same works fine when I switch off the photon engine.