cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

Unstructured Data - PDF and a semi-structured data

MattM
New Contributor III

I have a scenario where one source is unstructered pdf files and another source is semi-structered JSON files. I get files from these two sources on a daily basis into an ADLS storage. What is the best way to load this into a medallion structure by starting to load from RAW->Silver->Gold. I need to report on the conformed data based on both sources from Gold table. Thanks.

1 REPLY 1

Kaniz
Community Manager
Community Manager

Hi @Matt M​, Please import this notebook and read this excellent Medallion Architecture article. Let us know how it goes. Thanks.

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.