<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to write Change Data from Delta Lake to aws dynamodb in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28266#M20089</link>
    <description>&lt;P&gt;@Werner Stinckens​&amp;nbsp;Thanks for the reply&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have gone through the CDF docs. But was wondering if there is some connector available to write data from Delta Table directly to dynamodb.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Eg: Read data from Delta table with change data mode on and write to dynamodb directly.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Or are you suggesting to get the Change Data and then write to DynamoDB&lt;/P&gt;</description>
    <pubDate>Mon, 14 Feb 2022 16:06:47 GMT</pubDate>
    <dc:creator>hari</dc:creator>
    <dc:date>2022-02-14T16:06:47Z</dc:date>
    <item>
      <title>How to write Change Data from Delta Lake to aws dynamodb</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28260#M20083</link>
      <description>&lt;P&gt;Is there some direct way to write data from DeltaLake to AWS DynamoDB.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If there is none, Is there any way to do the same.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Feb 2022 12:47:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28260#M20083</guid>
      <dc:creator>hari</dc:creator>
      <dc:date>2022-02-14T12:47:14Z</dc:date>
    </item>
    <item>
      <title>Re: How to write Change Data from Delta Lake to aws dynamodb</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28261#M20084</link>
      <description>&lt;P&gt;This is something that might be achievable via delta live tables and delta sharing in future.&lt;/P&gt;&lt;P&gt;But for now, what you can do is track changes in delta tables when you do an upsert. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;To know more you'll have to describe how your delta tables are getting updated? What's the source, frequency of data operations in delta lake?&lt;/P&gt;</description>
      <pubDate>Mon, 14 Feb 2022 13:15:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28261#M20084</guid>
      <dc:creator>AmanSehgal</dc:creator>
      <dc:date>2022-02-14T13:15:27Z</dc:date>
    </item>
    <item>
      <title>Re: How to write Change Data from Delta Lake to aws dynamodb</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28262#M20085</link>
      <description>&lt;P&gt;The data will be updated as batches nightly&lt;/P&gt;</description>
      <pubDate>Mon, 14 Feb 2022 14:12:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28262#M20085</guid>
      <dc:creator>hari</dc:creator>
      <dc:date>2022-02-14T14:12:43Z</dc:date>
    </item>
    <item>
      <title>Re: How to write Change Data from Delta Lake to aws dynamodb</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28263#M20086</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;P&gt;A way&amp;nbsp;to track INSERT/UPDATE/DELETE would be to split out your upsert process as a 3 part process. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Step1: Include an update timestamp field in your target table.&lt;/P&gt;&lt;P&gt;Step2: In your incoming refresh dataset, add a update timestamp column and set it to current timestamp.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;UPDATE count&lt;/B&gt;&lt;/P&gt;&lt;P&gt;Step3: Run upsert job with just whenMatchedUpdate clause. Count rows that have current timestamp for update.&lt;/P&gt;&lt;P&gt;Also, get an extract of the rows with current timestamp - these are the rows that you'll UPDATE in DyanmoDB&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;INSERT count&lt;/B&gt;&lt;/P&gt;&lt;P&gt;Step5: Get the row count of table - RC1.&lt;/P&gt;&lt;P&gt;Step6: Run upsert job with just whenNotMatchedInsert clause. Get row count of table after insert - RC2&lt;/P&gt;&lt;P&gt;INSERT count = RC2-RC1&lt;/P&gt;&lt;P&gt;Get an extract of the rows with current timestamp and perform a left-anti join with UPDATE df - these are the rows that you'll INSERT in DyanmoDB&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;DELETE count&lt;/B&gt;&lt;/P&gt;&lt;P&gt;Step7: Get the row count of table - RC1.&lt;/P&gt;&lt;P&gt;Step8: Run upsert job with just whenNotMatchedDelete clause. Get row count of table after insert - RC2&lt;/P&gt;&lt;P&gt;DELETE count = RC1-RC2&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;From your source data, there would be a way to identify the records that are to be deleted. Grab those indices/primary keys to delete records.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Feb 2022 14:43:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28263#M20086</guid>
      <dc:creator>AmanSehgal</dc:creator>
      <dc:date>2022-02-14T14:43:08Z</dc:date>
    </item>
    <item>
      <title>Re: How to write Change Data from Delta Lake to aws dynamodb</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28264#M20087</link>
      <description>&lt;P&gt;there is change data feed on delta tables:&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.databricks.com/delta/delta-change-data-feed.html" alt="https://docs.databricks.com/delta/delta-change-data-feed.html" target="_blank"&gt;https://docs.databricks.com/delta/delta-change-data-feed.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 14 Feb 2022 15:27:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28264#M20087</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-02-14T15:27:54Z</dc:date>
    </item>
    <item>
      <title>Re: How to write Change Data from Delta Lake to aws dynamodb</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28265#M20088</link>
      <description>&lt;P&gt;@Aman Sehgal​&amp;nbsp; Thanks for the reply&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Are you suggesting collecting the update/delete set and then manually writing to dynamodb?&lt;/P&gt;</description>
      <pubDate>Mon, 14 Feb 2022 16:04:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28265#M20088</guid>
      <dc:creator>hari</dc:creator>
      <dc:date>2022-02-14T16:04:56Z</dc:date>
    </item>
    <item>
      <title>Re: How to write Change Data from Delta Lake to aws dynamodb</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28266#M20089</link>
      <description>&lt;P&gt;@Werner Stinckens​&amp;nbsp;Thanks for the reply&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have gone through the CDF docs. But was wondering if there is some connector available to write data from Delta Table directly to dynamodb.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Eg: Read data from Delta table with change data mode on and write to dynamodb directly.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Or are you suggesting to get the Change Data and then write to DynamoDB&lt;/P&gt;</description>
      <pubDate>Mon, 14 Feb 2022 16:06:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28266#M20089</guid>
      <dc:creator>hari</dc:creator>
      <dc:date>2022-02-14T16:06:47Z</dc:date>
    </item>
    <item>
      <title>Re: How to write Change Data from Delta Lake to aws dynamodb</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28267#M20090</link>
      <description>&lt;P&gt;I suggest fetching the change data and write that to dynamodb indeed.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Feb 2022 16:17:31 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28267#M20090</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-02-14T16:17:31Z</dc:date>
    </item>
    <item>
      <title>Re: How to write Change Data from Delta Lake to aws dynamodb</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28268#M20091</link>
      <description>&lt;P&gt;Hi @Harikrishnan P H​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Did @Werner Stinckens​&amp;nbsp;reply help you to resolved your issue? if yes, please mark it as best. if not, please let us know.&lt;/P&gt;</description>
      <pubDate>Tue, 15 Mar 2022 00:55:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-write-change-data-from-delta-lake-to-aws-dynamodb/m-p/28268#M20091</guid>
      <dc:creator>jose_gonzalez</dc:creator>
      <dc:date>2022-03-15T00:55:55Z</dc:date>
    </item>
  </channel>
</rss>

