<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Updating tables from SQL Server to Databricks in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/updating-tables-from-sql-server-to-databricks/m-p/38061#M26550</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I have SQL Server tables which are the primary location for all live transactions happen and currently I read them through pyspark as dataframes and overwrite them everyday to have the latest copy of them in Databricks. The problem is it takes long time to copy all tables from SQL Server to Databricks and the tables don't have a primary key like an id, some tables have timestamps. The question is how can I read only the changes and write them to my Databricks tables in an efficient way without the need to rewrite the whole table again and again?&lt;/P&gt;</description>
    <pubDate>Thu, 20 Jul 2023 22:24:22 GMT</pubDate>
    <dc:creator>YS1</dc:creator>
    <dc:date>2023-07-20T22:24:22Z</dc:date>
    <item>
      <title>Updating tables from SQL Server to Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/updating-tables-from-sql-server-to-databricks/m-p/38061#M26550</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I have SQL Server tables which are the primary location for all live transactions happen and currently I read them through pyspark as dataframes and overwrite them everyday to have the latest copy of them in Databricks. The problem is it takes long time to copy all tables from SQL Server to Databricks and the tables don't have a primary key like an id, some tables have timestamps. The question is how can I read only the changes and write them to my Databricks tables in an efficient way without the need to rewrite the whole table again and again?&lt;/P&gt;</description>
      <pubDate>Thu, 20 Jul 2023 22:24:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/updating-tables-from-sql-server-to-databricks/m-p/38061#M26550</guid>
      <dc:creator>YS1</dc:creator>
      <dc:date>2023-07-20T22:24:22Z</dc:date>
    </item>
    <item>
      <title>Re: Updating tables from SQL Server to Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/updating-tables-from-sql-server-to-databricks/m-p/38205#M26573</link>
      <description>&lt;P&gt;Do you have any audit column in the sql server table which can be used for incremental loading?&lt;/P&gt;&lt;P&gt;Is Sql server table full load each time?&lt;/P&gt;</description>
      <pubDate>Sat, 22 Jul 2023 23:46:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/updating-tables-from-sql-server-to-databricks/m-p/38205#M26573</guid>
      <dc:creator>Stephen678</dc:creator>
      <dc:date>2023-07-22T23:46:02Z</dc:date>
    </item>
    <item>
      <title>Re: Updating tables from SQL Server to Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/updating-tables-from-sql-server-to-databricks/m-p/38208#M26575</link>
      <description>&lt;P&gt;For the first question, some tables have pkid or timestamp columns, but some don't.&lt;/P&gt;&lt;P&gt;For the second question, some tables gets only appended to while others are truncated and created again (on SQL Server) but on Databricks currently we overwrite them everyday.&lt;/P&gt;</description>
      <pubDate>Sun, 23 Jul 2023 04:10:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/updating-tables-from-sql-server-to-databricks/m-p/38208#M26575</guid>
      <dc:creator>YS1</dc:creator>
      <dc:date>2023-07-23T04:10:49Z</dc:date>
    </item>
    <item>
      <title>Re: Updating tables from SQL Server to Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/updating-tables-from-sql-server-to-databricks/m-p/38213#M26578</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/83285"&gt;@YS1&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Hope you are well. Just wanted to see if you were able to find an answer to your question and would you like to mark an answer as best? It would be really helpful for the other members too.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Cheers!&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 23 Jul 2023 04:37:42 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/updating-tables-from-sql-server-to-databricks/m-p/38213#M26578</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-07-23T04:37:42Z</dc:date>
    </item>
  </channel>
</rss>

