05-19-2022 10:52 AM
We are converting Pyspark dataframe to Tensorflow using PetaStorm and have encountered a “data adapter” error. What do you recommend for diagnosing and fixing this error?
https://docs.microsoft.com/en-us/azure/databricks/applications/machine-learning/load-data/petastorm
Thanks for help
06-06-2022 06:00 AM
Hi @Nathan Law following up did you get a chance to check @Kaniz Fatma 's previous comments ?
06-06-2022 06:30 AM
Hi,
From the Petastorm example:
# Make sure the number of partitions is at least the number of workers which is required for distributed training.
I am testing an recommendation to not use Autoscaling. I'll report back with findings.
07-19-2022 08:05 AM
Hey there @Nathan Law
Hope all is well!
Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? It would be really helpful for the other members too.
We'd love to hear from you.
Cheers!
07-19-2022 08:24 AM
Making progress but still working through issues. I'll post findings when completed.
07-19-2022 08:35 AM
Hey @Nathan Law
Thank you so much for getting back to us. We will await your response.
We really appreciate your time.
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group