cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Unstructured Data (Images) training in Databricks

aswinkks
New Contributor III

I'm looking for a solution where

1. Need a delta table that saves the pointers(path) of the images in volume

2. Train a Pytorch, tensorflow or keras model with the data from delta lake.

I tried multiple methods :

1. Storing the data as vectors or binary instead of path - I don't need this as it's not efficient

2. Used deltatorch instead of Pandas as Pytorch doesn't support spark APIs - Seems to have some issues and Dont need pandas  it's not supported for Big Data

3. I have created a delta table with pointers(path) to the image, but the issue is that when converted to python objects, It's not able to read the file from "dbfs:"

Can anyone please tell if you have trained a model according to the requirement I have posted

 

1 ACCEPTED SOLUTION

Accepted Solutions

BigRoux
Databricks Employee
Databricks Employee
2 REPLIES 2

BigRoux
Databricks Employee
Databricks Employee

aswinkks
New Contributor III

Thanks @BigRoux , the solution helps. 

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now