<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic how to include a third-party Maven package in MLflow model serving job cluster in Azure Databricks in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-to-include-a-third-party-maven-package-in-mlflow-model/m-p/17060#M11132</link>
    <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;We try to use MLflow Model Serving, this service will enable realtime model serving behind a REST API interface; it will launch a single-node cluster that will host our model.&lt;/P&gt;
&lt;P&gt;The issue happens when the single-node cluster try to get the environment ready base on a conda.yaml file that created when log the model using MLflow. But it looks like I can only specify a pip install but not a Maven package.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;conda_env = _mlflow_conda_env(
&lt;P&gt;additional_conda_deps=None,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;additional_pip_deps=["cloudpickle=={}".format(cloudpickle.&lt;I&gt;version&lt;/I&gt;), "scikit-learn=={}".format(sklearn.&lt;I&gt;version&lt;/I&gt;),"pyspark==3.0.0".format(pyspark.&lt;I&gt;version&lt;/I&gt;))],&lt;P&gt;&lt;/P&gt;additional_conda_channels=None,&lt;P&gt;&lt;/P&gt;)
&lt;P&gt;how can I tell the cluster to install a maven jar file?&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 29 Jul 2021 21:11:45 GMT</pubDate>
    <dc:creator>Celia</dc:creator>
    <dc:date>2021-07-29T21:11:45Z</dc:date>
    <item>
      <title>how to include a third-party Maven package in MLflow model serving job cluster in Azure Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-include-a-third-party-maven-package-in-mlflow-model/m-p/17060#M11132</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;We try to use MLflow Model Serving, this service will enable realtime model serving behind a REST API interface; it will launch a single-node cluster that will host our model.&lt;/P&gt;
&lt;P&gt;The issue happens when the single-node cluster try to get the environment ready base on a conda.yaml file that created when log the model using MLflow. But it looks like I can only specify a pip install but not a Maven package.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;conda_env = _mlflow_conda_env(
&lt;P&gt;additional_conda_deps=None,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;additional_pip_deps=["cloudpickle=={}".format(cloudpickle.&lt;I&gt;version&lt;/I&gt;), "scikit-learn=={}".format(sklearn.&lt;I&gt;version&lt;/I&gt;),"pyspark==3.0.0".format(pyspark.&lt;I&gt;version&lt;/I&gt;))],&lt;P&gt;&lt;/P&gt;additional_conda_channels=None,&lt;P&gt;&lt;/P&gt;)
&lt;P&gt;how can I tell the cluster to install a maven jar file?&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 29 Jul 2021 21:11:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-include-a-third-party-maven-package-in-mlflow-model/m-p/17060#M11132</guid>
      <dc:creator>Celia</dc:creator>
      <dc:date>2021-07-29T21:11:45Z</dc:date>
    </item>
    <item>
      <title>Re: how to include a third-party Maven package in MLflow model serving job cluster in Azure Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-include-a-third-party-maven-package-in-mlflow-model/m-p/17061#M11133</link>
      <description>&lt;P&gt;I don't believe you can do that at the moment. Is it required for a Python model? only Python-based models can really be served this way at the moment&lt;/P&gt;</description>
      <pubDate>Wed, 01 Sep 2021 17:45:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-include-a-third-party-maven-package-in-mlflow-model/m-p/17061#M11133</guid>
      <dc:creator>sean_owen</dc:creator>
      <dc:date>2021-09-01T17:45:16Z</dc:date>
    </item>
    <item>
      <title>Re: how to include a third-party Maven package in MLflow model serving job cluster in Azure Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-include-a-third-party-maven-package-in-mlflow-model/m-p/17062#M11134</link>
      <description>&lt;P&gt;Unfortunately we came across this same issue. We were trying to use MLFlow Serve to produce an API that could take text input and pass it through some NLP. In this instance we had installed a maven package on the cluster, so the experiment would run fine in a notebook, but MLFlow would fail as it couldn't install the maven package. As an alternative, it would help to be able to modify the job cluster that is provisioned to add additional libraries/packages that are required, that we can not specify in the conda definition. &lt;/P&gt;</description>
      <pubDate>Tue, 14 Sep 2021 13:55:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-include-a-third-party-maven-package-in-mlflow-model/m-p/17062#M11134</guid>
      <dc:creator>BeardyMan</dc:creator>
      <dc:date>2021-09-14T13:55:21Z</dc:date>
    </item>
  </channel>
</rss>

