โ08-23-2022 01:20 AM
How to Install Libraries on Databricks
You can install libraries in Databricks at the cluster level for libraries commonly used on a cluster, at the notebook-level using %pip, or using global init scripts when you have libraries that should be installed on all clusters.
The Databricks ML runtime comes with many commonly used ML libraries pre-installed including sklearn, Tensorflow, and XGBoost (AWS | Azure | GCP)
You can learn more at this page (AWS | Azure | GCP). Let us know in the comments if you have any questions on library dependency!
โ08-25-2022 01:39 AM
@Isaac Gritzโ Adding a few more docs links for > https://docs.databricks.com/libraries/index.html
โ08-25-2022 01:42 AM
Thanks Abishek!
โ08-26-2022 08:35 PM
Good post @Isaac Gritzโ and thanks @Abishek Subramanianโ for adding those links.โ
โ08-26-2022 08:56 PM
I saw couple of you posts and it seems that you are sharing the information from the public docs. โinstead of sharing the docs information you can post some errors and the troubleshooting and resolution steps. That will be much helpful. Thanks.
โ08-26-2022 09:03 PM
Hi team, I am a Databricks employee and I am posting common questions I have received from Databricks customers in the past and answers I would typically provide. I reached out to @Lindsay Olsonโ to validate this approach. Happy to remove any posts or find a new forum/format if you would like.
โ09-25-2022 01:13 PM
It can be a risky to install libraries without any sort of oversite/security structure to ensure those libraries have no vulnerabilities. I think more caution needs to be added to the wording of these documents to express that. All of the libraries we use go through a vetting process before they can actually be installed.
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโt want to miss the chance to attend and share knowledge.
If there isnโt a group near you, start one and help create a community that brings people together.
Request a New Group