<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: What's the difference between SparkML and Spark MLlib? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/what-s-the-difference-between-sparkml-and-spark-mllib/m-p/23234#M15999</link>
    <description>&lt;P&gt;They're not really different. Before DataFrames in Spark, older implementations of ML algorithms build on the RDD API. This is generally called "Spark MLlib". After DataFrames, some newer implementations were added as wrappers on top of the old ones that extended the API to work with DataFrames. This is sometimes called "Spark ML". Both are kind of interchangeable; both are still there, and in some cases there are implementations in the 'old' Spark MLlib that aren't also present in the 'newer' Spark ML APIs. So while you probably mostly use the DataFrame API versions, they're kind of one unit, and either term is OK. I think "Spark ML" is more preferred for the newer API, but you'll see MLlib used a lot still.&lt;/P&gt;</description>
    <pubDate>Thu, 17 Jun 2021 18:23:47 GMT</pubDate>
    <dc:creator>sean_owen</dc:creator>
    <dc:date>2021-06-17T18:23:47Z</dc:date>
    <item>
      <title>What's the difference between SparkML and Spark MLlib?</title>
      <link>https://community.databricks.com/t5/data-engineering/what-s-the-difference-between-sparkml-and-spark-mllib/m-p/23233#M15998</link>
      <description>&lt;P&gt;I have heard people talk about SparkML but when reading documentation it talks about MLlib. I don't understand the difference, could anyone help me understand this?&lt;/P&gt;</description>
      <pubDate>Thu, 17 Jun 2021 15:02:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/what-s-the-difference-between-sparkml-and-spark-mllib/m-p/23233#M15998</guid>
      <dc:creator>User16826992666</dc:creator>
      <dc:date>2021-06-17T15:02:38Z</dc:date>
    </item>
    <item>
      <title>Re: What's the difference between SparkML and Spark MLlib?</title>
      <link>https://community.databricks.com/t5/data-engineering/what-s-the-difference-between-sparkml-and-spark-mllib/m-p/23234#M15999</link>
      <description>&lt;P&gt;They're not really different. Before DataFrames in Spark, older implementations of ML algorithms build on the RDD API. This is generally called "Spark MLlib". After DataFrames, some newer implementations were added as wrappers on top of the old ones that extended the API to work with DataFrames. This is sometimes called "Spark ML". Both are kind of interchangeable; both are still there, and in some cases there are implementations in the 'old' Spark MLlib that aren't also present in the 'newer' Spark ML APIs. So while you probably mostly use the DataFrame API versions, they're kind of one unit, and either term is OK. I think "Spark ML" is more preferred for the newer API, but you'll see MLlib used a lot still.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Jun 2021 18:23:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/what-s-the-difference-between-sparkml-and-spark-mllib/m-p/23234#M15999</guid>
      <dc:creator>sean_owen</dc:creator>
      <dc:date>2021-06-17T18:23:47Z</dc:date>
    </item>
  </channel>
</rss>

