09-28-2024 01:31 PM
I am trying to read in a JSON file with this in SQL Editor & it fails with None.get
CREATE TEMPORARY VIEW multilineJson
USING json
OPTIONS (path="/Volumes/my_catalog/my_schema/jsondir/test.json", multiline=true);
None.get is all the error it has.The permissions are fine, I have granted all privileges to all users & I am able to: LIST '/Volumes/my_catalog/my_schema/jsondir/test.json' without errors. The Diagnose error co-pilot wants me to use s3 path...
Sample JSON used -
[
{
"author": "user1",
"efficacy": 0.0,
"rating": 5,
"custom_labels": [ "label1", "label2" ]
},
{
"author": "user2",
"efficacy": 0.9,
"rating": 4,
"custom_labels": [ "label3", "label4" ]
}
]
Thanks !
10-03-2024 12:20 AM
Hi @sathyafmt ,
Are you using Serverless SQL Warehouse?
It seems accessing Volumes in Serverless SQL Warehouse is not implemented yet.
When running the code in SQL Warehouse I am getting the same error as you:
When running in notebook, it works just fine
Could you check whether you can create a table and it reads from the directory?
CREATE TABLE my_json_table
USING json
OPTIONS (
path '/Volumes/my_catalog/my_schema/jsondir/test.json',
multiline 'true'
);
When running this using SQL Serverless Warehouse I got an error:
Then when trying to enable volumes I got an error:
To sum up I believe as of now in Serverless SQL Warehouse volumes are not supported.
09-29-2024 11:16 AM
Hi @sathyafmt ,
None.get is a scala erros that suggests that a value does not exist.
Maybe it is about the path?
I wonder whether you can run the below code to make sure that the file is read to dataframe:
df = spark.read.option("multiline", "true").json("/Volumes/my_catalog/my_schema/jsondir/test.json")
df.show()
If yes, try to create your view using python:
df.createOrReplaceTempView("my_view")
09-29-2024 11:52 PM - edited 09-30-2024 12:03 AM
Hi @sathyafmt ,
What runtime are you on? There could be some limitations applied to Scala depending on runtime you choose.
10-02-2024 08:50 AM
thanks @szymon_dybczak @filipniziol for your response.
I am just doing this in the SQL Editor (not in python notebook or scala). I don't know how to get the runtime version .. lmk how to get it & I'll add that info.
Thanks again!
10-03-2024 12:20 AM
Hi @sathyafmt ,
Are you using Serverless SQL Warehouse?
It seems accessing Volumes in Serverless SQL Warehouse is not implemented yet.
When running the code in SQL Warehouse I am getting the same error as you:
When running in notebook, it works just fine
Could you check whether you can create a table and it reads from the directory?
CREATE TABLE my_json_table
USING json
OPTIONS (
path '/Volumes/my_catalog/my_schema/jsondir/test.json',
multiline 'true'
);
When running this using SQL Serverless Warehouse I got an error:
Then when trying to enable volumes I got an error:
To sum up I believe as of now in Serverless SQL Warehouse volumes are not supported.
10-03-2024 10:52 AM - edited 10-03-2024 10:54 AM
@filipniziol - Yes, I was on Serverless SQL Warehouse. It works with "CERATE TABLE .. " thx! I am surprised that the warehouse type is impacting this feature.
But I got the SQL from databricks documentation -https://docs.databricks.com/en/query/formats/json.html#multi-line-mode & I was hoping it would work..
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group