cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

Connect Databricks hosted on Azure, with RDS on AWS.

danial
New Contributor II

We have Databricks set up and running on Azure. Now we want to connect it with RDS (AWS) to transfer data from RDS to Azure DataLake using the Databricks.

I could find the documentation on how to do it within the same cloud (Either AWS or Azure) but not cross cloud.

So was wondering what would be the best approach. Also, the RDS has additional layer of basic http authentication for additional security.

Highly appreciate the help.

2 REPLIES 2

Anonymous
Not applicable

@Danial Malik​ :

To connect Databricks running on Azure with RDS running on AWS, you can use JDBC or ODBC connectivity.

Here are the general steps:

  1. Set up a Virtual Private Network (VPN) or a peering connection between your Azure and AWS environments to ensure secure communication between them.
  2. Install the JDBC or ODBC driver for RDS on the Databricks cluster. You can find the driver download and installation instructions in the AWS RDS documentation.
  3. Configure the JDBC or ODBC connection to connect to the RDS database using the appropriate driver, database URL, username, and password. You may also need to configure additional parameters, such as SSL settings, depending on your RDS configuration.
  4. Write Spark code in Databricks to read data from RDS using the JDBC or ODBC connection, and then write the data to Azure DataLake.

Regarding the additional layer of basic http authentication on RDS, you will need to provide the appropriate credentials in the JDBC or ODBC connection configuration.

Keep in mind that cross-cloud data transfer can have performance and security implications, so you may want to consider optimizing your network and data transfer protocols to ensure efficient and secure data transfer.

Hope this helps!

Anonymous
Not applicable

Hi @Danial Malik​ 

Hope everything is going great.

Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so we can help you. 

Cheers!

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.