<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Seeking Practical Example for Structured Streaming with Delta Tables in Medallion Architecture in Administration &amp; Architecture</title>
    <link>https://community.databricks.com/t5/administration-architecture/seeking-practical-example-for-structured-streaming-with-delta/m-p/100848#M2445</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/110502"&gt;@szymon_dybczak&lt;/a&gt;&amp;nbsp;, Thank you very much. Your reply provided me with an excellent reference solution. I had been struggling with structured streaming, and your help was incredibly valuable and insightful.&lt;/P&gt;</description>
    <pubDate>Wed, 04 Dec 2024 06:33:00 GMT</pubDate>
    <dc:creator>JissMathew</dc:creator>
    <dc:date>2024-12-04T06:33:00Z</dc:date>
    <item>
      <title>Seeking Practical Example for Structured Streaming with Delta Tables in Medallion Architecture</title>
      <link>https://community.databricks.com/t5/administration-architecture/seeking-practical-example-for-structured-streaming-with-delta/m-p/100400#M2423</link>
      <description>&lt;P&gt;Hi everyone,&lt;/P&gt;&lt;P&gt;I’m working on implementing &lt;STRONG&gt;Structured Streaming&lt;/STRONG&gt; in Databricks to capture &lt;STRONG&gt;Change Data Capture (CDC)&lt;/STRONG&gt; as part of a &lt;STRONG&gt;Medallion Architecture&lt;/STRONG&gt; (Bronze, Silver, and Gold layers). While Microsoft’s documentation provides a theoretical approach, I’m looking for &lt;STRONG&gt;hands-on examples or code snippets&lt;/STRONG&gt; that you’ve successfully used in a real-world project.&lt;/P&gt;&lt;P&gt;Specifically, I’d like to understand:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;How to ingest data into a Delta table (Bronze layer) using Auto Loader or another streaming method.&lt;/LI&gt;&lt;LI&gt;How to process this data incrementally to create CDC and propagate changes to Silver and Gold layers.&lt;/LI&gt;&lt;LI&gt;Any recommendations for configurations or optimizations to manage schema evolution and large datasets effectively.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;If anyone has experience with this and can share &lt;STRONG&gt;practical examples&lt;/STRONG&gt; or insights beyond the documentation, it would be greatly appreciated!&lt;/P&gt;&lt;P&gt;Thank you in advance!&lt;/P&gt;</description>
      <pubDate>Fri, 29 Nov 2024 08:33:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/seeking-practical-example-for-structured-streaming-with-delta/m-p/100400#M2423</guid>
      <dc:creator>JissMathew</dc:creator>
      <dc:date>2024-11-29T08:33:36Z</dc:date>
    </item>
    <item>
      <title>Re: Seeking Practical Example for Structured Streaming with Delta Tables in Medallion Architecture</title>
      <link>https://community.databricks.com/t5/administration-architecture/seeking-practical-example-for-structured-streaming-with-delta/m-p/100427#M2424</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/132169"&gt;@JissMathew&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Do you have access to databricks academy? I believe in their data engineering track there's pleny of example notebooks.&lt;BR /&gt;Or you can try dbdemos. For example, here you can find demo notebook for autoloader&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://notebooks.databricks.com/demos/auto-loader/index.html" target="_blank" rel="noopener"&gt;Databricks Autoloader (cloudfile)&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;If you'd like to test it on your databricks instance just do the following:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;%pip install dbdemos&lt;/PRE&gt;&lt;PRE&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; dbdemos&lt;/SPAN&gt;&lt;SPAN&gt;dbdemos.install(&lt;/SPAN&gt;&lt;SPAN&gt;'auto-loader'&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/PRE&gt;&lt;P&gt;For CDC pipeline you can use following:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://www.databricks.com/resources/demos/tutorials/data-engineering/cdc-pipeline-with-delta?itm_data=demo_center" target="_blank" rel="noopener"&gt;CDC Pipeline With Delta | Databricks&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 29 Nov 2024 12:31:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/seeking-practical-example-for-structured-streaming-with-delta/m-p/100427#M2424</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2024-11-29T12:31:43Z</dc:date>
    </item>
    <item>
      <title>Re: Seeking Practical Example for Structured Streaming with Delta Tables in Medallion Architecture</title>
      <link>https://community.databricks.com/t5/administration-architecture/seeking-practical-example-for-structured-streaming-with-delta/m-p/100848#M2445</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/110502"&gt;@szymon_dybczak&lt;/a&gt;&amp;nbsp;, Thank you very much. Your reply provided me with an excellent reference solution. I had been struggling with structured streaming, and your help was incredibly valuable and insightful.&lt;/P&gt;</description>
      <pubDate>Wed, 04 Dec 2024 06:33:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/seeking-practical-example-for-structured-streaming-with-delta/m-p/100848#M2445</guid>
      <dc:creator>JissMathew</dc:creator>
      <dc:date>2024-12-04T06:33:00Z</dc:date>
    </item>
  </channel>
</rss>

