cancel
Showing results for 
Search instead for 
Did you mean: 
Warehousing & Analytics
Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
cancel
Showing results for 
Search instead for 
Did you mean: 

Unable to install kneed library in cluster with DBR version 5.5 LTS

User16753724663
Valued Contributor

I have an issue to install and use kneed python libary. 

https://pypi.org/project/kneed/

I can install it and check it from log.

[Install command]

%sh

pip install kneed

[log]

Installing collected packages: kneed

Successfully installed kneed-0.7.0

but when I call this library it shows error:

ModuleNotFoundError: No module named 'kneed'

I got this error after changing cluster to "5.5 LTS ML (includes Apache Spark 2.4.3, Scala 2.11)"

1 REPLY 1

User16753724663
Valued Contributor

The kneed library has a dependency and we need to install them as well in order to work:

numpy==1.18

scipy==1.1.0

scikit-learn==0.21.3

Once we install the above libraries using GUI, we can run the below command to check the installed library with the correct version:

%sh
/databricks/python3/bin/pip3 list

Since the DBR 5.5 has both python version ie python2 and python3, hence we need to use the above command.

Please use the specific versions only.

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group