<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to input initial centroids to K-Means or GMM Clustering in SparkML ? in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/how-to-input-initial-centroids-to-k-means-or-gmm-clustering-in/m-p/16405#M865</link>
    <description>&lt;P&gt;Hi @Kaniz Fatma​&amp;nbsp;, thanks for your response. Since MLlib is currently in the maintenance mode we don't want to use it in production and hence want to use SparkML (data frame based APIs). Here is the problem. In SparkML libraries there is no mention of how to input user specified centroids for K-Means or GMM. That was my question. &lt;/P&gt;</description>
    <pubDate>Tue, 05 Jul 2022 13:39:56 GMT</pubDate>
    <dc:creator>HariK1</dc:creator>
    <dc:date>2022-07-05T13:39:56Z</dc:date>
    <item>
      <title>How to input initial centroids to K-Means or GMM Clustering in SparkML ?</title>
      <link>https://community.databricks.com/t5/machine-learning/how-to-input-initial-centroids-to-k-means-or-gmm-clustering-in/m-p/16403#M863</link>
      <description>&lt;P&gt;Hi, I want to use KMeans Model or Gaussian Mixture Model algorithm for clustering using the SparkML library, in which I want to specify the initial centroids. The option of giving initial centroids is there in the Spark MLlib (RDD based APIs) however not available in the Pyspark DataFrame based APIs (SparkML).  Since Spark MLlib is in the maintenance mode, I do not want to use it and instead use SaprkML library.  Is anyone knows a workaround? Thank You.&lt;/P&gt;</description>
      <pubDate>Tue, 28 Jun 2022 15:36:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/how-to-input-initial-centroids-to-k-means-or-gmm-clustering-in/m-p/16403#M863</guid>
      <dc:creator>HariK1</dc:creator>
      <dc:date>2022-06-28T15:36:43Z</dc:date>
    </item>
    <item>
      <title>Re: How to input initial centroids to K-Means or GMM Clustering in SparkML ?</title>
      <link>https://community.databricks.com/t5/machine-learning/how-to-input-initial-centroids-to-k-means-or-gmm-clustering-in/m-p/16405#M865</link>
      <description>&lt;P&gt;Hi @Kaniz Fatma​&amp;nbsp;, thanks for your response. Since MLlib is currently in the maintenance mode we don't want to use it in production and hence want to use SparkML (data frame based APIs). Here is the problem. In SparkML libraries there is no mention of how to input user specified centroids for K-Means or GMM. That was my question. &lt;/P&gt;</description>
      <pubDate>Tue, 05 Jul 2022 13:39:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/how-to-input-initial-centroids-to-k-means-or-gmm-clustering-in/m-p/16405#M865</guid>
      <dc:creator>HariK1</dc:creator>
      <dc:date>2022-07-05T13:39:56Z</dc:date>
    </item>
    <item>
      <title>Re: How to input initial centroids to K-Means or GMM Clustering in SparkML ?</title>
      <link>https://community.databricks.com/t5/machine-learning/how-to-input-initial-centroids-to-k-means-or-gmm-clustering-in/m-p/16406#M866</link>
      <description>&lt;P&gt;@Kaniz Fatma​&amp;nbsp;I still haven't got an answer to my question!!!&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2022 14:52:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/how-to-input-initial-centroids-to-k-means-or-gmm-clustering-in/m-p/16406#M866</guid>
      <dc:creator>HariK1</dc:creator>
      <dc:date>2022-08-04T14:52:28Z</dc:date>
    </item>
  </channel>
</rss>

