<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic In Spark MLlib, what is the difference between an estimator and a transformer? in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/in-spark-mllib-what-is-the-difference-between-an-estimator-and-a/m-p/23204#M1313</link>
    <description />
    <pubDate>Thu, 17 Jun 2021 15:05:21 GMT</pubDate>
    <dc:creator>User16826992666</dc:creator>
    <dc:date>2021-06-17T15:05:21Z</dc:date>
    <item>
      <title>In Spark MLlib, what is the difference between an estimator and a transformer?</title>
      <link>https://community.databricks.com/t5/machine-learning/in-spark-mllib-what-is-the-difference-between-an-estimator-and-a/m-p/23204#M1313</link>
      <description />
      <pubDate>Thu, 17 Jun 2021 15:05:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/in-spark-mllib-what-is-the-difference-between-an-estimator-and-a/m-p/23204#M1313</guid>
      <dc:creator>User16826992666</dc:creator>
      <dc:date>2021-06-17T15:05:21Z</dc:date>
    </item>
    <item>
      <title>Re: In Spark MLlib, what is the difference between an estimator and a transformer?</title>
      <link>https://community.databricks.com/t5/machine-learning/in-spark-mllib-what-is-the-difference-between-an-estimator-and-a/m-p/23205#M1314</link>
      <description>&lt;P&gt;These terms are borrowed from scikit-learn, and the idea is the same. A transformer is just a component of a pipeline that transforms the data in some way. An estimator is also a transfomer, but one that additionally needs to be 'fit' on data before it knows how to transform.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;For example, a StringTokenizer is just a transformer, because it does not need to see any data to know what to do, to tokenize strings. A machine learning model like LogisticRegression is also a transformer, because it transforms data by adding a prediction. However it has to be fit on data first before it can do so. So it is (also) an estimator.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Jun 2021 18:21:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/in-spark-mllib-what-is-the-difference-between-an-estimator-and-a/m-p/23205#M1314</guid>
      <dc:creator>sean_owen</dc:creator>
      <dc:date>2021-06-17T18:21:49Z</dc:date>
    </item>
  </channel>
</rss>

