<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Databricks Connecting to ADLS Gen2 vs Azure SQL in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/databricks-connecting-to-adls-gen2-vs-azure-sql/m-p/122304#M46736</link>
    <description>&lt;P class=""&gt;&lt;SPAN&gt;Use&amp;nbsp;&lt;/SPAN&gt;Azure SQL Spark Connector. This method allows Databricks to read from and write to Azure SQL Database efficiently, supporting both bulk operations and secure authentication.&lt;/P&gt;&lt;P&gt;Azure sql :&amp;nbsp;&lt;SPAN&gt;Install connector, configure JDBC, use Key Vault, set permissions&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Adls Gen2 : Register service principal, assign roles, use Key Vault, configure Spark, secure networking&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;For new deployments, Databricks recommends using&amp;nbsp;&lt;STRONG&gt;Unity Catalog&lt;/STRONG&gt;&amp;nbsp;to manage and secure access to cloud storage, including ADLS Gen2, as it simplifies governance and access control&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 20 Jun 2025 03:43:57 GMT</pubDate>
    <dc:creator>kavithai</dc:creator>
    <dc:date>2025-06-20T03:43:57Z</dc:date>
    <item>
      <title>Databricks Connecting to ADLS Gen2 vs Azure SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-connecting-to-adls-gen2-vs-azure-sql/m-p/122200#M46695</link>
      <description>&lt;P&gt;What is the best approach to connect Databricks with Azure SQL or connect Databricks with ADLS Gen2.&lt;/P&gt;&lt;P&gt;I am designing the system where I am planning to Integrate Databricks to Azure.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;May someone share experience Pros and cons of approach and best practices for&amp;nbsp;connecting Databricks with Azure SQL or connect Databricks with ADLS Gen2.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Thanks a lot&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 19 Jun 2025 04:12:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-connecting-to-adls-gen2-vs-azure-sql/m-p/122200#M46695</guid>
      <dc:creator>Datamate</dc:creator>
      <dc:date>2025-06-19T04:12:12Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Connecting to ADLS Gen2 vs Azure SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-connecting-to-adls-gen2-vs-azure-sql/m-p/122279#M46729</link>
      <description>&lt;P&gt;&lt;BR /&gt;Both Azure SQL and ADLS Gen2 can be integrated with Databricks, but the use case and best practices differ.&lt;/P&gt;&lt;P&gt;### &lt;span class="lia-unicode-emoji" title=":link:"&gt;🔗&lt;/span&gt; **Connecting Databricks to ADLS Gen2**&lt;/P&gt;&lt;P&gt;**Best for:** Storing large volumes of raw/semi-structured/structured data (data lake use cases).&lt;/P&gt;&lt;P&gt;**Approach:**&lt;/P&gt;&lt;P&gt;* Use **Azure Data Lake Storage Gen2** with **ABFS (Azure Blob File System)** or **OAuth (Service Principal or Managed Identity)** for secure access.&lt;BR /&gt;* Mount the ADLS Gen2 path using `dbutils.fs.mount()` (if needed) or access it directly with ABFS path.&lt;/P&gt;&lt;P&gt;**Pros:**&lt;/P&gt;&lt;P&gt;* Scalable and cost-effective for large data volumes.&lt;BR /&gt;* Native support in Databricks for Delta Lake (ACID).&lt;BR /&gt;* Supports Medallion Architecture (Bronze, Silver, Gold).&lt;BR /&gt;* Ideal for big data and ML pipelines.&lt;/P&gt;&lt;P&gt;**Cons:**&lt;/P&gt;&lt;P&gt;* Not ideal for transactional updates (OLTP-style).&lt;BR /&gt;* Slightly more setup required for secure integration (Key Vault, Service Principal, etc.).&lt;/P&gt;&lt;P&gt;---&lt;/P&gt;&lt;P&gt;### &lt;span class="lia-unicode-emoji" title=":link:"&gt;🔗&lt;/span&gt; **Connecting Databricks to Azure SQL Database**&lt;/P&gt;&lt;P&gt;**Best for:** Reading/writing smaller volumes of structured data or integrating with existing RDBMS systems.&lt;/P&gt;&lt;P&gt;**Approach:**&lt;/P&gt;&lt;P&gt;* Use the **JDBC driver** to read/write from Azure SQL.&lt;BR /&gt;* Use **Azure Key Vault** to securely store credentials.&lt;BR /&gt;* Consider **incremental loads** with change tracking or timestamps.&lt;/P&gt;&lt;P&gt;**Pros:**&lt;/P&gt;&lt;P&gt;* Good for point lookups, dimension tables, and operational reporting.&lt;BR /&gt;* Easy to query structured, relational data.&lt;BR /&gt;* Suitable for lightweight transactional writes.&lt;/P&gt;&lt;P&gt;**Cons:**&lt;/P&gt;&lt;P&gt;* Not optimized for large batch reads/writes.&lt;BR /&gt;* Limited parallelism compared to data lake storage.&lt;BR /&gt;* Higher cost for data movement at scale.&lt;/P&gt;&lt;P&gt;---&lt;/P&gt;&lt;P&gt;### &lt;span class="lia-unicode-emoji" title=":white_heavy_check_mark:"&gt;✅&lt;/span&gt; **Best Practice Suggestion**&lt;/P&gt;&lt;P&gt;* **Use ADLS Gen2** as your **primary data lake** for raw and processed data.&lt;BR /&gt;* **Connect to Azure SQL** for:&lt;/P&gt;&lt;P&gt;* pulling reference data (dimensions, lookup tables),&lt;BR /&gt;* writing back small outputs (like reporting metrics or curated outputs).&lt;/P&gt;&lt;P&gt;Combining both is often the most robust architecture.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 19 Jun 2025 17:12:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-connecting-to-adls-gen2-vs-azure-sql/m-p/122279#M46729</guid>
      <dc:creator>Yogesh_Verma_</dc:creator>
      <dc:date>2025-06-19T17:12:29Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Connecting to ADLS Gen2 vs Azure SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-connecting-to-adls-gen2-vs-azure-sql/m-p/122304#M46736</link>
      <description>&lt;P class=""&gt;&lt;SPAN&gt;Use&amp;nbsp;&lt;/SPAN&gt;Azure SQL Spark Connector. This method allows Databricks to read from and write to Azure SQL Database efficiently, supporting both bulk operations and secure authentication.&lt;/P&gt;&lt;P&gt;Azure sql :&amp;nbsp;&lt;SPAN&gt;Install connector, configure JDBC, use Key Vault, set permissions&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Adls Gen2 : Register service principal, assign roles, use Key Vault, configure Spark, secure networking&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;For new deployments, Databricks recommends using&amp;nbsp;&lt;STRONG&gt;Unity Catalog&lt;/STRONG&gt;&amp;nbsp;to manage and secure access to cloud storage, including ADLS Gen2, as it simplifies governance and access control&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 20 Jun 2025 03:43:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-connecting-to-adls-gen2-vs-azure-sql/m-p/122304#M46736</guid>
      <dc:creator>kavithai</dc:creator>
      <dc:date>2025-06-20T03:43:57Z</dc:date>
    </item>
  </channel>
</rss>

