Team ,
I am trying understand how the parquet files and JSON under the delta log folder stores the data behind the scenes
Table Creation:
from delta.tables import *
DeltaTable.create(spark) \
.tableName("employee") \
.addColumn("id", "INT") \
.addColumn("name", "STRING") \
.addColumn("dept", "STRING")\
.addColumn("salary", "INT") \
.location("/FileStore/tables/delta/demo2") \
.execute()
Step 2: %sql
INSERT INTO employee values(100,"Ram","CSE",1000)
Step 3:
%sql
select * from delta.`/FileStore/tables/delta/demo2`
![Ramakrishnan83_0-1710772217666.png Ramakrishnan83_0-1710772217666.png](/t5/image/serverpage/image-id/6689i61FE9698C200B8E0/image-size/medium/is-moderation-mode/true?v=v2&px=400)
Note: I made 2 inserts , so 2 parquet files
Challenge:
I am trying to read the JSON, CRC and Parquet files to see the contents in it . But I am getting the errors
![Ramakrishnan83_1-1710772318911.png Ramakrishnan83_1-1710772318911.png](/t5/image/serverpage/image-id/6690i4327F64E535DAC8E/image-size/medium/is-moderation-mode/true?v=v2&px=400)
Output of this command give me the structure of a JSON , not the actual data stored
![Ramakrishnan83_2-1710772374126.png Ramakrishnan83_2-1710772374126.png](/t5/image/serverpage/image-id/6691iA34660D1DA586B0A/image-size/medium/is-moderation-mode/true?v=v2&px=400)
Parquet file reading throws this error .
Note: My cluster is running with DBR 12.2 LTS