3 weeks ago
Thanks for the Databricks community and maintaining such a valuable platform.
I would like to inquire if there is a planned timeline for upgrading the GraphFrame library. We’ve noticed that the latest release on GitHub is v0.9.3, while the Databricks Runtime for Machine Learning (17.3 LTS) is still using v0.8.4-db1-spark3.5.
We’re particularly interested in recent updates such as the early stopping feature in Pregel (PR #550):
Releases · graphframes/graphframes
Databricks Runtime 17.3 LTS for Machine Learning | Databricks on AWS
Thanks!
3 weeks ago
For PySpark.
3 weeks ago - last edited 3 weeks ago
I don't see any dates for it. But you can try this work around.
If you need access to the latest GraphFrames features
Manual Installation: You can manually install the GraphFrames v0.9.3 JAR in your cluster.
3 weeks ago
You can try to add to your cluster mvn dependency manually ... For example, for spark 3.5.x it will be like:
io.graphframes:graphframes-spark3_2.12:0.10.0
and add a PyPi dependency graphframes-py. Adding maven coordinates should download and install all the JVM dependencies.
But most probably it won't work on DBR ML runtimes because you will have in CP two differently named graphframes JARs, but with the same namespace and barely anyone will tell you how it will be resolved in runtime... I think the best way is just using generic runtime instead of DBR ML.
2 weeks ago
Greeting @toproximahk , thanks for the kind words and for the detailed pointers.
org.graphframes:graphframes_2.13:0.8.4-db1-spark3.5 on both CPU and GPU clusters, as listed in the Java/Scala libraries section of the 17.3 LTS ML release notes.0.8.4-db1-spark3.5 (17.0, 17.1, 17.2, 17.3), and there’s no change log entry pointing to a move to 0.9.x.Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now