08-23-2022 01:20 AM
How to Install Libraries on Databricks
You can install libraries in Databricks at the cluster level for libraries commonly used on a cluster, at the notebook-level using %pip, or using global init scripts when you have libraries that should be installed on all clusters.
The Databricks ML runtime comes with many commonly used ML libraries pre-installed including sklearn, Tensorflow, and XGBoost (AWS | Azure | GCP)
You can learn more at this page (AWS | Azure | GCP). Let us know in the comments if you have any questions on library dependency!
08-25-2022 01:39 AM
@Isaac Gritz Adding a few more docs links for > https://docs.databricks.com/libraries/index.html
08-25-2022 01:42 AM
Thanks Abishek!
08-26-2022 08:35 PM
Good post @Isaac Gritz and thanks @Abishek Subramanian for adding those links.
08-26-2022 08:56 PM
I saw couple of you posts and it seems that you are sharing the information from the public docs. instead of sharing the docs information you can post some errors and the troubleshooting and resolution steps. That will be much helpful. Thanks.
08-26-2022 09:03 PM
Hi team, I am a Databricks employee and I am posting common questions I have received from Databricks customers in the past and answers I would typically provide. I reached out to @Lindsay Olson to validate this approach. Happy to remove any posts or find a new forum/format if you would like.
09-25-2022 01:13 PM
It can be a risky to install libraries without any sort of oversite/security structure to ensure those libraries have no vulnerabilities. I think more caution needs to be added to the wording of these documents to express that. All of the libraries we use go through a vetting process before they can actually be installed.
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group