<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Set up connection to on prem sql server in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/set-up-connection-to-on-prem-sql-server/m-p/69166#M33832</link>
    <description>&lt;P&gt;Two other things we would generally recommend are 1.) Direct Connect and 2.) Using AWS DMS, or any other CDC tool set.&amp;nbsp; There are lots.&amp;nbsp; We have Query Federation now in the Unity Catalog, so you could set up the connection that way, but JDBC/ODBC connections are prone to errors / dropped connections, so for production data engineering tasks its better to load into S3 using a CDC tool first and use Databricks to load the data after.&amp;nbsp; If that's too heavy of a lift, then its pretty easy to use pyodbc and the sql server driver on a cluster once you setup the network access.&lt;/P&gt;</description>
    <pubDate>Thu, 16 May 2024 13:44:25 GMT</pubDate>
    <dc:creator>Cary</dc:creator>
    <dc:date>2024-05-16T13:44:25Z</dc:date>
    <item>
      <title>Set up connection to on prem sql server</title>
      <link>https://community.databricks.com/t5/data-engineering/set-up-connection-to-on-prem-sql-server/m-p/69159#M33827</link>
      <description>&lt;P&gt;Ive just set up our databricks environment. Hosted in AWS.&amp;nbsp;&lt;/P&gt;&lt;P&gt;We have an on prem SQL server and would like to connect .&amp;nbsp;&lt;/P&gt;&lt;P&gt;How can i do that?&lt;/P&gt;</description>
      <pubDate>Thu, 16 May 2024 12:32:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/set-up-connection-to-on-prem-sql-server/m-p/69159#M33827</guid>
      <dc:creator>Silabs</dc:creator>
      <dc:date>2024-05-16T12:32:13Z</dc:date>
    </item>
    <item>
      <title>Re: Set up connection to on prem sql server</title>
      <link>https://community.databricks.com/t5/data-engineering/set-up-connection-to-on-prem-sql-server/m-p/69163#M33830</link>
      <description>&lt;P&gt;The connection itself can be done using ODBC/JDBC, but that is not the issue.&lt;BR /&gt;What you need to achieve is allowing AWS to connect to the on-prem server.&lt;BR /&gt;On AWS you can do that using PrivateLink and a VPC Endpoint, or use a VPN.&lt;BR /&gt;There might be more though (haven't worked in AWS for a while).&lt;/P&gt;</description>
      <pubDate>Thu, 16 May 2024 13:11:20 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/set-up-connection-to-on-prem-sql-server/m-p/69163#M33830</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2024-05-16T13:11:20Z</dc:date>
    </item>
    <item>
      <title>Re: Set up connection to on prem sql server</title>
      <link>https://community.databricks.com/t5/data-engineering/set-up-connection-to-on-prem-sql-server/m-p/69166#M33832</link>
      <description>&lt;P&gt;Two other things we would generally recommend are 1.) Direct Connect and 2.) Using AWS DMS, or any other CDC tool set.&amp;nbsp; There are lots.&amp;nbsp; We have Query Federation now in the Unity Catalog, so you could set up the connection that way, but JDBC/ODBC connections are prone to errors / dropped connections, so for production data engineering tasks its better to load into S3 using a CDC tool first and use Databricks to load the data after.&amp;nbsp; If that's too heavy of a lift, then its pretty easy to use pyodbc and the sql server driver on a cluster once you setup the network access.&lt;/P&gt;</description>
      <pubDate>Thu, 16 May 2024 13:44:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/set-up-connection-to-on-prem-sql-server/m-p/69166#M33832</guid>
      <dc:creator>Cary</dc:creator>
      <dc:date>2024-05-16T13:44:25Z</dc:date>
    </item>
    <item>
      <title>Re: Set up connection to on prem sql server</title>
      <link>https://community.databricks.com/t5/data-engineering/set-up-connection-to-on-prem-sql-server/m-p/69191#M33848</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/105316"&gt;@Silabs&lt;/a&gt;&amp;nbsp;good day!&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;To connect your Databricks environment (hosted on AWS) to your on-premise SQL server, follow these steps:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;1. Network Setup: Establish a connection between your SQL server and the Databricks virtual private cloud (VPC) using VPN or AWS Direct Connect. Test the network connectivity by running the command inside a Databricks notebook:&amp;nbsp;&lt;/SPAN&gt;&lt;CODE class="c-mrkdwn__code" data-stringify-type="code"&gt;%sh nc -vz &amp;lt;SQL Server IP or hostname&amp;gt; &amp;lt;port&amp;gt;&lt;/CODE&gt;&lt;SPAN&gt;. Replace&amp;nbsp;&lt;/SPAN&gt;&lt;CODE class="c-mrkdwn__code" data-stringify-type="code"&gt;&amp;lt;SQL Server IP or hostname&amp;gt;&lt;/CODE&gt;&lt;SPAN&gt;&amp;nbsp;and&amp;nbsp;&lt;/SPAN&gt;&lt;CODE class="c-mrkdwn__code" data-stringify-type="code"&gt;&amp;lt;port&amp;gt;&lt;/CODE&gt;&lt;SPAN&gt;&amp;nbsp;with your SQL Server's details.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;2. Install Databricks SQL Connector: On your development machine, install the Databricks SQL Connector for Python library using either&amp;nbsp;&lt;/SPAN&gt;&lt;CODE class="c-mrkdwn__code" data-stringify-type="code"&gt;pip install databricks-sql-connector&lt;/CODE&gt;&lt;SPAN&gt;&amp;nbsp;or&amp;nbsp;&lt;/SPAN&gt;&lt;CODE class="c-mrkdwn__code" data-stringify-type="code"&gt;python -m pip install databricks-sql-connector&lt;/CODE&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;3. Gather Connection Information: Collect the following information for the Databricks cluster or SQL warehouse: the server hostname of the cluster (available in the Advanced Options &amp;gt; JDBC/ODBC tab for your cluster) and the HTTP path of the cluster (also available in the Advanced Options &amp;gt; JDBC/ODBC tab).&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;4. Configure a Connection to SQL Server: Use the Databricks SQL Connector and the previously gathered connection information to configure a connection to your SQL Server. Ensure the user or service principal connecting to the Databricks cluster or SQL warehouse has the necessary permissions (CAN ATTACH TO, CAN RESTART, and CAN USE).&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;I hope this helps. Please let us know if you have any questions or concerns.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Kind regards,&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Yesh&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 16 May 2024 16:32:32 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/set-up-connection-to-on-prem-sql-server/m-p/69191#M33848</guid>
      <dc:creator>Yeshwanth</dc:creator>
      <dc:date>2024-05-16T16:32:32Z</dc:date>
    </item>
  </channel>
</rss>

