<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic While accessing the data on recipient side using delta_sharing.load_table_changes_as_spark(), it shows data of all versions. in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/while-accessing-the-data-on-recipient-side-using-delta-sharing/m-p/15197#M9554</link>
    <description>&lt;P&gt;When I tried to access specific version data and set the arguments value to the specific number, I get all version data.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data1 = delta_sharing.load_table_changes_as_spark(table_url, starting_version=1, ending_version=1)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data2 = delta_sharing.load_table_changes_as_spark(table_url, starting_version=2, ending_version=2)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Here data1 and data2 gives the same data. When I check the same version data using load_table_changes_as_pandas(), it gives specific version data.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data1 = delta_sharing.load_table_changes_as_pandas(table_url, starting_version=1, ending_version=1)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data2 = delta_sharing.load_table_changes_as_pandas(table_url, starting_version=2, ending_version=2)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In the pandas scenario, data1 is having version 1 data and data2 is having version 2 data. Both of these, data1 and data2, are having different data which was as expected.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;What we have to do to get the specific version data in spark dataframe using  load_table_changes_as_spark function?&lt;/P&gt;</description>
    <pubDate>Thu, 22 Dec 2022 05:04:43 GMT</pubDate>
    <dc:creator>Mahesh_789</dc:creator>
    <dc:date>2022-12-22T05:04:43Z</dc:date>
    <item>
      <title>While accessing the data on recipient side using delta_sharing.load_table_changes_as_spark(), it shows data of all versions.</title>
      <link>https://community.databricks.com/t5/data-engineering/while-accessing-the-data-on-recipient-side-using-delta-sharing/m-p/15197#M9554</link>
      <description>&lt;P&gt;When I tried to access specific version data and set the arguments value to the specific number, I get all version data.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data1 = delta_sharing.load_table_changes_as_spark(table_url, starting_version=1, ending_version=1)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data2 = delta_sharing.load_table_changes_as_spark(table_url, starting_version=2, ending_version=2)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Here data1 and data2 gives the same data. When I check the same version data using load_table_changes_as_pandas(), it gives specific version data.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data1 = delta_sharing.load_table_changes_as_pandas(table_url, starting_version=1, ending_version=1)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data2 = delta_sharing.load_table_changes_as_pandas(table_url, starting_version=2, ending_version=2)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In the pandas scenario, data1 is having version 1 data and data2 is having version 2 data. Both of these, data1 and data2, are having different data which was as expected.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;What we have to do to get the specific version data in spark dataframe using  load_table_changes_as_spark function?&lt;/P&gt;</description>
      <pubDate>Thu, 22 Dec 2022 05:04:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/while-accessing-the-data-on-recipient-side-using-delta-sharing/m-p/15197#M9554</guid>
      <dc:creator>Mahesh_789</dc:creator>
      <dc:date>2022-12-22T05:04:43Z</dc:date>
    </item>
  </channel>
</rss>

