06-01-2023 09:04 AM
I'm using Azure Databricks and I'd like to create a project virtual environment, persisted on a shared compute cluster. As the cluster is shared for many projects, it is necessary to have virtual environments if I want to execute code runs from within Databricks repos. This virtual environment should be easily created via a requirements.txt or conda.yaml file, and activated via the magic command %conda activate <env_name> (or a similar fashion) at the start of a notebook. I cannot find any documentation that lists down the steps, but it seems to be doable according to @Suteja Kanuri answer (option2) another question thread here.
Thank you.
06-05-2023 03:37 PM
Hi, You can refer to https://www.databricks.com/blog/2020/06/17/simplify-python-environment-management-on-databricks-runt.... Also, please look into the limitations.
Please tag @Debayan with your next response which will notify me, Thank you!
06-05-2023 07:47 PM
Hi @Debayan Mukherjee ,
Thank you for your response, I have read this article and here are my understanding, please feedback if they are correct:
Can you help with some additional questions:
Thank you for your kind assistance.
06-08-2023 12:19 AM
Hi,
Thanks.
06-13-2023 12:13 AM
Thank you, let me know when you have the information
06-12-2023 08:53 PM
Hi @Joshua L
We haven't heard from you since the last response from @Debayan Mukherjee , and I was checking back to see if her suggestions helped you.
Or else, If you have any solution, please share it with the community, as it can be helpful to others.
Also, Please don't forget to click on the "Select As Best" button whenever the information provided helps resolve your question.
06-13-2023 12:12 AM
Hi @Vidula Khanna
I'm still waiting for @Debayan Mukherjee responses to the other 2 questions as stated in his reply.
Thank you
06-14-2023 10:52 PM
Hi @Joshua L , Appreciate your patience on the same.
I have checked with the SME and got the confirmation that it’s not supported and in fact it has been deprecated. I don’t think we plan to support it anytime.
Also, we are not aware about any Azure DB tools as such on this, but this can be followed:
https://packaging.python.org/en/latest/guides/installing-using-pip-and-virtual-environments/
https://docs.python.org/3/tutorial/venv.html
Which is also the same as in installing packages into the system.
Here, without installing the package into the system , we dont think it is possible. Can you clarify if not system installed then what is the expectation? Even if it is a tool, it will install few packages in the system. Please let me know if I have misunderstood the requirement.
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group