<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to build architecture for Batch as well Stream Data Pipeline in Databricks in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-to-build-architecture-for-batch-as-well-stream-data-pipeline/m-p/122604#M46824</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am planning to Create a Data Lake house using Azure and Databricks.&lt;/P&gt;&lt;P&gt;Earlier i planned to do with Azure, but use cases looks complex.&lt;/P&gt;&lt;P&gt;Can someone please help me with suggestions.&lt;/P&gt;&lt;P&gt;Source System : SAP, SALESFORCE, SAP CAR, Adobe Clickstream.&lt;/P&gt;&lt;P&gt;Consumer : Salesforce, Spryker, Mad Mobile [API led Integration]&lt;/P&gt;&lt;P&gt;How to Handle Analytical Data&lt;/P&gt;&lt;P&gt;How to Handle Transactional data&lt;/P&gt;&lt;P&gt;Error Handling and connectivity&lt;/P&gt;&lt;P&gt;Real time data consume by spryker every 15 sec.&lt;/P&gt;&lt;P&gt;Thanks a lot for suggestion&lt;/P&gt;</description>
    <pubDate>Tue, 24 Jun 2025 04:39:01 GMT</pubDate>
    <dc:creator>Pratikmsbsvm</dc:creator>
    <dc:date>2025-06-24T04:39:01Z</dc:date>
    <item>
      <title>How to build architecture for Batch as well Stream Data Pipeline in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-build-architecture-for-batch-as-well-stream-data-pipeline/m-p/122604#M46824</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am planning to Create a Data Lake house using Azure and Databricks.&lt;/P&gt;&lt;P&gt;Earlier i planned to do with Azure, but use cases looks complex.&lt;/P&gt;&lt;P&gt;Can someone please help me with suggestions.&lt;/P&gt;&lt;P&gt;Source System : SAP, SALESFORCE, SAP CAR, Adobe Clickstream.&lt;/P&gt;&lt;P&gt;Consumer : Salesforce, Spryker, Mad Mobile [API led Integration]&lt;/P&gt;&lt;P&gt;How to Handle Analytical Data&lt;/P&gt;&lt;P&gt;How to Handle Transactional data&lt;/P&gt;&lt;P&gt;Error Handling and connectivity&lt;/P&gt;&lt;P&gt;Real time data consume by spryker every 15 sec.&lt;/P&gt;&lt;P&gt;Thanks a lot for suggestion&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jun 2025 04:39:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-build-architecture-for-batch-as-well-stream-data-pipeline/m-p/122604#M46824</guid>
      <dc:creator>Pratikmsbsvm</dc:creator>
      <dc:date>2025-06-24T04:39:01Z</dc:date>
    </item>
    <item>
      <title>Re: How to build architecture for Batch as well Stream Data Pipeline in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-build-architecture-for-batch-as-well-stream-data-pipeline/m-p/122803#M46874</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/143693"&gt;@Pratikmsbsvm&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;The appropriate approach would be:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Data Ingestion:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Ingest data from SAP, SAP CAR, and Salesforce using Azure Data Factory or third-party connectors. For near real-time updates, enable CDC-based ingestion.&lt;BR /&gt;&lt;BR /&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Data Lakehouse Storage:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Store all raw data in Azure Data Lake Storage (ADLS) as Delta Lake tables to ensure ACID transactions and reliable data handling.&lt;BR /&gt;&lt;BR /&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Analytical Data Handling:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Use Databricks SQL to power BI dashboards, reports, and analytical workloads on top of your gold layer.&lt;BR /&gt;&lt;BR /&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Data Processing:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Organize data using the Medallion architecture:&lt;UL&gt;&lt;LI&gt;Bronze - Raw ingested data&lt;/LI&gt;&lt;LI&gt;Silver - Cleaned and conformed data&lt;/LI&gt;&lt;LI&gt;Gold - Aggregated, business-ready data for reporting and consumption&lt;BR /&gt;&lt;BR /&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Real-Time Delivery:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;For Spryker’s 15-second real-time requirement, use Databricks Structured Streaming with Azure Event Hubs or Kafka.&lt;/LI&gt;&lt;LI&gt;Serve data to consumers like Salesforce, Spryker, and Mad Mobile via APIs or by sharing gold tables through REST endpoints or direct access.&lt;BR /&gt;&lt;BR /&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Error Handling &amp;amp; Monitoring:&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;Monitor pipelines using Azure Monitor and Databricks system tables to catch failures or delays early.&lt;/LI&gt;&lt;LI&gt;Set up alerts and logging to track job health and ensure data quality across the pipeline.&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Wed, 25 Jun 2025 12:11:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-build-architecture-for-batch-as-well-stream-data-pipeline/m-p/122803#M46874</guid>
      <dc:creator>SP_6721</dc:creator>
      <dc:date>2025-06-25T12:11:44Z</dc:date>
    </item>
    <item>
      <title>Re: How to build architecture for Batch as well Stream Data Pipeline in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-build-architecture-for-batch-as-well-stream-data-pipeline/m-p/122841#M46883</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/156441"&gt;@SP_6721&lt;/a&gt;&amp;nbsp;: Thanks a lot. but how to handle Transactional data. do I need to add Azure SQL ? Please suggest.&lt;/P&gt;</description>
      <pubDate>Wed, 25 Jun 2025 14:44:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-build-architecture-for-batch-as-well-stream-data-pipeline/m-p/122841#M46883</guid>
      <dc:creator>Pratikmsbsvm</dc:creator>
      <dc:date>2025-06-25T14:44:58Z</dc:date>
    </item>
  </channel>
</rss>

