How to use SparkNLP library and JohnSnowLabs maven coordinates in cluster which is not connected to internet
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-19-2023 07:20 PM
Hi,
I am trying SparkNLP library for the first time. The cluster I'm using is corporate and cannot be connected to internet. I can only download packages that are provided to us or by using a jar file.
I've three questions:
- What jar files do I need to install SparkNLP library for NLP work. I will be needing BERT transformers and encoders as well as other packages required for NER work using SparkNLP library.
- How can I add the proper johnsnowlabs maven coordinates and jar file to my cluster when it's not connected to internet
- How can I reference these installed libraries in my notebook that is running on the cluster with the packages installed
Thanks!
1 REPLY 1

Anonymous
Not applicable
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-21-2023 10:22 PM
Hi @Samy Syed
Hope everything is going great.
Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so we can help you.
Cheers!

