Databricks Community

NathanLaw · ‎05-19-2022

We are converting Pyspark dataframe to Tensorflow using PetaStorm and have encountered a “data adapter” error. What do you recommend for diagnosing and fixing this error?

https://docs.microsoft.com/en-us/azure/databricks/applications/machine-learning/load-data/petastorm

https://docs.microsoft.com/en-us/azure/databricks/_static/notebooks/deep-learning/petastorm-spark-co...

Thanks for help

Anonymous · ‎06-06-2022

Hi @Nathan Law following up did you get a chance to check @Kaniz Fatma 's previous comments ?

NathanLaw · ‎06-06-2022

Hi,

From the Petastorm example:

# Make sure the number of partitions is at least the number of workers which is required for distributed training.

I am testing an recommendation to not use Autoscaling. I'll report back with findings.

Nathan

Anonymous · ‎07-19-2022

Hey there @Nathan Law

Hope all is well!

Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? It would be really helpful for the other members too.

We'd love to hear from you.

Cheers!