cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Can I Replicate Azure Document Intelligence's Custom Table Extraction in Databricks?

AlbertWang
Valued Contributor

I am using Azure Document Intelligence to get data from a table in a PDF file. The table's headers do not visually align with the values. Therefore, the standard and pre-built models cannot correctly read the data.

I have built a custom-trained Azure Document Intelligence model and can read the data perfectly. When I trained the model, I used the Azure Document Intelligence feature and first ran a layout scan of the PDF file. Then, I created a new table type field and manually labelled and aligned each value detected on the PDF to one cell in the table field. After adding 4 PDF files, I could train a reasonably good model.

I want to know whether I can do the same/similar thing on Databricks using only Databricks's features? Not using Azure Document Intelligence.

0 REPLIES 0

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now