<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Is it ok to join on the decimal type fields? How does it affect the performance? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/is-it-ok-to-join-on-the-decimal-type-fields-how-does-it-affect/m-p/19703#M13245</link>
    <description />
    <pubDate>Wed, 30 Nov 2022 13:13:04 GMT</pubDate>
    <dc:creator>DK03</dc:creator>
    <dc:date>2022-11-30T13:13:04Z</dc:date>
    <item>
      <title>Is it ok to join on the decimal type fields? How does it affect the performance?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-ok-to-join-on-the-decimal-type-fields-how-does-it-affect/m-p/19703#M13245</link>
      <description />
      <pubDate>Wed, 30 Nov 2022 13:13:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-ok-to-join-on-the-decimal-type-fields-how-does-it-affect/m-p/19703#M13245</guid>
      <dc:creator>DK03</dc:creator>
      <dc:date>2022-11-30T13:13:04Z</dc:date>
    </item>
    <item>
      <title>Re: Is it ok to join on the decimal type fields? How does it affect the performance?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-ok-to-join-on-the-decimal-type-fields-how-does-it-affect/m-p/19704#M13246</link>
      <description>&lt;P&gt;Sure it is ok.&lt;/P&gt;&lt;P&gt;The performance of a join is mainly determined by the shuffle itself, potential data skew and the type of join (broadcasthasjoin, shufflehashjoin etc)&lt;/P&gt;</description>
      <pubDate>Wed, 30 Nov 2022 13:17:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-ok-to-join-on-the-decimal-type-fields-how-does-it-affect/m-p/19704#M13246</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-11-30T13:17:09Z</dc:date>
    </item>
    <item>
      <title>Re: Is it ok to join on the decimal type fields? How does it affect the performance?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-ok-to-join-on-the-decimal-type-fields-how-does-it-affect/m-p/19705#M13247</link>
      <description>&lt;P&gt;As @Werner Stinckens​&amp;nbsp;said, it would be ok. But generally decimal column joins are not recommended as other factors come into play like the precision, length etc..&lt;/P&gt;&lt;P&gt;.Also when you are joining in on decimal columns, be sure to check out the abs value of the difference between the two column values is nearly 0 basis on your requirement. e.g. 0.0000000001. This is because you don't want to mess up the join just because of a 1 in a billion difference or error which might be creeping in because of some transformations somewhere or data quality issues. &lt;/P&gt;&lt;P&gt;Do research well before doing joins on decimal columns.&lt;/P&gt;&lt;P&gt;Cheers. &lt;/P&gt;</description>
      <pubDate>Wed, 30 Nov 2022 18:02:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-ok-to-join-on-the-decimal-type-fields-how-does-it-affect/m-p/19705#M13247</guid>
      <dc:creator>UmaMahesh1</dc:creator>
      <dc:date>2022-11-30T18:02:46Z</dc:date>
    </item>
  </channel>
</rss>

