I'm working through the code at the following, but getting an error:
https://github.com/PacktPublishing/Essential-PySpark-for-Scalable-Data-Analytics/blob/main/Chapter01...
Code:
%sql DROP TABLE IF EXISTS word_counts; CREATE TABLE word_counts (word STRING) USING csv OPTIONS("delimiter"=" ") LOCATION "/databricks-datasets/README.md"
Error:
UnityCatalogServiceException: [RequestId=e8a962d8-82f1-48d0-9cb8-9758daefb92d ErrorClass=INVALID_PARAMETER_VALUE] GenerateTemporaryPathCredential uri /dbfs/databricks-datasets/README.md is not a valid URI. Error message: INVALID_PARAMETER_VALUE: Missing cloud file system scheme.
Details:
I'm using the Databricks Community Edition, with an Apache Spark Cluster on AWS for compute. I can see that the file is an internal file and is available in the list using "%fs ls /databricks-datasets/"
Questions:
Can anyone point me in the right direction on how to resolve this? I'd like to make sure I can properly work with SQL and the internal files provided with the community edition in order to complete all my learning objectives.
#community edition