<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How do I install a non python/R/maven/jar library into a cluster? in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18869#M1025</link>
    <description>&lt;P&gt;@&lt;A href="https://community.databricks.com/s/profile/0058Y00000AhBvtQAF" alt="https://community.databricks.com/s/profile/0058Y00000AhBvtQAF" target="_blank"&gt;Bradley&lt;/A&gt;&amp;nbsp;(Customer), wget will download the file on the driver, so then it will need to be moved to the filesystem. You can see the distributed file system with %sh ls /dbfs/&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 02 Jun 2022 23:46:57 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2022-06-02T23:46:57Z</dc:date>
    <item>
      <title>How do I install a non python/R/maven/jar library into a cluster?</title>
      <link>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18867#M1023</link>
      <description>&lt;P&gt;I'm trying to install a non standard package into the cluster using the init scripts. The package I'm trying to install needs to be downloaded using wget, and uncompressed using tar. Then added to the PATH, or at least I need to know where the downloaded files live.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Here are the contents of my init scripts:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;wget &lt;A href="https://github.com/COMBINE-lab/salmon/releases/download/v1.8.0/salmon-1.8.0_linux_x86_64.tar.gz" target="test_blank"&gt;https://github.com/COMBINE-lab/salmon/releases/download/v1.8.0/salmon-1.8.0_linux_x86_64.tar.gz&lt;/A&gt;
tar xzvf salmon-1.8.0_linux_x86_64.tar.gz
SALMON_PATH=$(readlink -f ./salmon-1.8.0_linux_x86_64/bin/)
export PATH="$SALMON_PATH:$PATH"&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;But this isn't working, at least in a notebook.&lt;/P&gt;</description>
      <pubDate>Thu, 02 Jun 2022 19:28:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18867#M1023</guid>
      <dc:creator>Bradley</dc:creator>
      <dc:date>2022-06-02T19:28:54Z</dc:date>
    </item>
    <item>
      <title>Re: How do I install a non python/R/maven/jar library into a cluster?</title>
      <link>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18868#M1024</link>
      <description>&lt;P&gt;@&lt;A href="https://community.databricks.com/s/profile/0058Y00000AhBvtQAF" alt="https://community.databricks.com/s/profile/0058Y00000AhBvtQAF" target="_blank"&gt;Bradley&lt;/A&gt;&amp;nbsp;(Customer), You can try to provide the directory option while extracting the tar file. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Or you can download the file to the dbfs location and then use the init script to copy it to the path where it should be in the cluster.&lt;/P&gt;</description>
      <pubDate>Thu, 02 Jun 2022 23:12:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18868#M1024</guid>
      <dc:creator>Prabakar</dc:creator>
      <dc:date>2022-06-02T23:12:10Z</dc:date>
    </item>
    <item>
      <title>Re: How do I install a non python/R/maven/jar library into a cluster?</title>
      <link>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18869#M1025</link>
      <description>&lt;P&gt;@&lt;A href="https://community.databricks.com/s/profile/0058Y00000AhBvtQAF" alt="https://community.databricks.com/s/profile/0058Y00000AhBvtQAF" target="_blank"&gt;Bradley&lt;/A&gt;&amp;nbsp;(Customer), wget will download the file on the driver, so then it will need to be moved to the filesystem. You can see the distributed file system with %sh ls /dbfs/&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 02 Jun 2022 23:46:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18869#M1025</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-06-02T23:46:57Z</dc:date>
    </item>
    <item>
      <title>Re: How do I install a non python/R/maven/jar library into a cluster?</title>
      <link>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18870#M1026</link>
      <description>&lt;P&gt;@Bradley Wright​&amp;nbsp; Could you please share the error message you are receiving when using the INIT script?&lt;/P&gt;</description>
      <pubDate>Mon, 06 Jun 2022 04:10:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18870#M1026</guid>
      <dc:creator>User16764241763</dc:creator>
      <dc:date>2022-06-06T04:10:18Z</dc:date>
    </item>
    <item>
      <title>Re: How do I install a non python/R/maven/jar library into a cluster?</title>
      <link>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18872#M1028</link>
      <description>&lt;P&gt;Thank you for the support. Yes, I was able to find a working solution.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;I placed the files into the distributed file system, dbfs. For others, this can be done manually using the databricks cli, or using the init scripts. In this case I found it easier to use the databricks cli.&lt;/LI&gt;&lt;LI&gt;I appended the path to the binary file to the OS environment path. I'm using python so for me it looks like this: &lt;/LI&gt;&lt;/OL&gt;&lt;PRE&gt;&lt;CODE&gt;os.environ['PATH'] += ':/dbfs/FileStore/salmon/bin/'&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;And that was it! I stored the file paths to the input data in a dataframe, then used spark to iterate across the rows of the dataframe calling a custom function that calls the binary file.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jun 2022 17:13:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18872#M1028</guid>
      <dc:creator>Bradley</dc:creator>
      <dc:date>2022-06-20T17:13:14Z</dc:date>
    </item>
    <item>
      <title>Re: How do I install a non python/R/maven/jar library into a cluster?</title>
      <link>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18874#M1030</link>
      <description>&lt;P&gt;I am happy that my suggestion added some pointers for you to resolve the issue.​&lt;/P&gt;</description>
      <pubDate>Sat, 16 Jul 2022 00:27:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/how-do-i-install-a-non-python-r-maven-jar-library-into-a-cluster/m-p/18874#M1030</guid>
      <dc:creator>Prabakar</dc:creator>
      <dc:date>2022-07-16T00:27:46Z</dc:date>
    </item>
  </channel>
</rss>

