Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-09-2022 05:42 AM
Can you confirm this is a known issue?
Running into same issue, example to test in 1 cell.
# using Arrow fails on HighConcurrency-cluster with PassThrough in runtime 10.4 (and 10.5 and 11.0)
spark.conf.set("spark.sql.execution.arrow.pyspark.enabled", "true") # toggle to see difference
df = spark.createDataFrame(sc.parallelize(range(0, 100)), schema="int")
df.toPandas() # << error here
# Msg: arrow is not supported when using file-based collectIt does work on a Personal cluster (Standard / SingleNode) with PassthroughAuth.