Data Engineering

Forum Posts

Sorted by:

by gillzer84 • New Contributor

04-27-2022 8:12:51 AM

5886 Views
4 replies
5 kudos

An example how to connect to SQL Server data using windows authentication

We use SQL Server to store data. I would like to connect to SQL to pull manipulate and sometimes push data back. I've seen some examples online of connecting but I cannot successfully re-create.

Data Engineering

5886 Views
4 replies
5 kudos

04-27-2022 8:12:51 AM

View Replies

Latest Reply

Junee
New Contributor III

06-29-2022 5:50:14 AM

5 kudos

You can use jTDS library from maven, add this to your cluster. Once installed, you can write the below code to connect to your Database.Code in Scala will be:import java.util.Properties val driverClass = "net.sourceforge.jtds.jdbc.Driver" val serve...

5 kudos

06-29-2022 5:50:14 AM

3 More Replies

by Tico23 • Contributor

02-28-2023 12:07:22 PM

18031 Views
12 replies
10 kudos

Connecting SQL Server (on-premise) to Databricks via jdbc:sqlserver

Is it possible to connect to SQL Server on-premise (Not Azure) from Databricks?I tried to ping my virtualbox VM (with Windows Server 2022) from within Databricks and the request timed out.%sh ping 122.138.0.14This is what my connection might look l...

Data Engineering

18031 Views
12 replies
10 kudos

02-28-2023 12:07:22 PM

View Replies

Latest Reply

BharathKumarS
New Contributor II

09-08-2024 9:04:24 PM

10 kudos

I tried to connect to localhost sql server through databricks community edition, but it failed. I have created an IP rule on port 1433 allowed inbound connection from all public network, but still didn't connect. I tried locally using python its work...

10 kudos

09-08-2024 9:04:24 PM

11 More Replies

by Hardy • New Contributor III

06-19-2023 4:17:21 AM

10172 Views
5 replies
6 kudos

The driver could not establish a secure connection to SQL Server by using Secure Sockets Layer (SSL) encryption

I am trying to connect to SQL through JDBC from databricks notebook. (Below is my notebook command)val df = spark.read.jdbc(jdbcUrl, "[MyTableName]", connectionProperties) println(df.schema)When I execute this command, with DBR 10.4 LTS it works fin...

Data Engineering

10172 Views
5 replies
6 kudos

06-19-2023 4:17:21 AM

View Replies

Latest Reply

DBXC
Contributor

10-13-2023 7:28:22 AM

6 kudos

Try to add the following parameters to your SQL connection string. It fixed my problem for 13.X and 12.X;trustServerCertificate=true;hostNameInCertificate=*.database.windows.net;

6 kudos

10-13-2023 7:28:22 AM

4 More Replies

by Kazer • New Contributor III

01-30-2023 1:47:33 AM

9247 Views
2 replies
1 kudos

Resolved! com.microsoft.sqlserver.jdbc.SQLServerException: The driver could not establish a secure connection to SQL Server by using Secure Sockets Layer (SSL) encryption.

Hi. I am trying to read from our Microsoft SQL Server from Azure Databricks via spark.read.jdbc() as described here: Query databases using JDBC - Azure Databricks | Microsoft Learn. The SQL Server is on an Azure VM in a virtual network peered with th...

Data Engineering

9247 Views
2 replies
1 kudos

01-30-2023 1:47:33 AM

View Replies

Latest Reply

databricks26
New Contributor II

01-02-2024 8:53:22 AM

1 kudos

Hi @Kazer ,Even if I use a new table name, I get the same error. Do you have any suggestions?Thanks,

1 kudos

01-02-2024 8:53:22 AM

1 More Replies

by Chris_Shehu • Valued Contributor III

03-21-2022 9:59:31 PM

29766 Views
5 replies
5 kudos

Resolved! What is the best way to handle big data sets?

I'm trying to find the best strategy for handling big data sets. In this case I have something that is 450 million records. I'm pulling the data from SQL Server very quickly but when I try to push the data to the Delta Table OR a Azure Container the...

Data Engineering

29766 Views
5 replies
5 kudos

03-21-2022 9:59:31 PM

View Replies

Latest Reply

Wilynan
New Contributor II

08-11-2023 6:41:05 AM

5 kudos

I think you should consult experts in Big Data for advice on this issue

5 kudos

08-11-2023 6:41:05 AM

4 More Replies

by umair_hanif • New Contributor II

06-22-2023 5:12:45 AM

3150 Views
2 replies
1 kudos

Ingesting more than 7 million rows into a SQL Server Table

Hi All, I hope you're super well. I need your recommendations and solution for my problem.I am using a Databricks instance DS12_v2 which has 28GB RAM and 4 cores. I am ingesting 7.2 million rows into a SQL Server table and it is taking 57 min - 1 hou...

Data Engineering

3150 Views
2 replies
1 kudos

06-22-2023 5:12:45 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-27-2023 8:25:45 AM

1 kudos

You can try to use BULK INSERT.https://learn.microsoft.com/en-us/sql/t-sql/statements/bulk-insert-transact-sql?view=sql-server-ver16Also using Data Factory instead of Databricks for the copy can be helpful.

1 kudos

06-27-2023 8:25:45 AM

1 More Replies

by Data_Analytics_ • New Contributor II

10-14-2021 7:52:33 AM

11987 Views
3 replies
3 kudos

Resolved! Connect SQL server using windows authentication

How do I connect to a on-premise SQL server using window authentication from a databricks notebook

Data Engineering

11987 Views
3 replies
3 kudos

10-14-2021 7:52:33 AM

View Replies

Latest Reply

User16829050420
Databricks Employee

10-14-2021 8:01:12 AM

3 kudos

We should have network setup from databricks Vnet to the on-prem SQL server. Then the connection from the databricks notebook using JDBC using Windows authenticated username/password - https://docs.microsoft.com/en-us/azure/databricks/data/data-sourc...

3 kudos

10-14-2021 8:01:12 AM

2 More Replies

by Istuti • Contributor

01-13-2023 8:02:55 PM

3762 Views
1 replies
2 kudos

Please guide on the algorithm for masking of column in databricks which is compatible (can be unmasked) with sqlserver.

Data Engineering

3762 Views
1 replies
2 kudos

01-13-2023 8:02:55 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-10-2023 8:02:26 AM

2 kudos

@Istuti Gupta :There are several algorithms you can use to mask a column in Databricks in a way that is compatible with SQL Server. One commonly used algorithm is called pseudonymization or tokenization.Here's an example of how you can implement pse...

2 kudos

04-10-2023 8:02:26 AM

by andrew0117 • Contributor

03-21-2023 4:35:19 PM

3049 Views
3 replies
2 kudos

Resolved! Will a table backed by a SQL server database table automatically get updated if the base table in SQL server database is updated?

If I creat a table using the code below: CREATE TABLE IF NOT EXISTS jdbcTableusing org.apache.spark.sql.jdbcoptions( url "sql_server_url", dbtable "sqlserverTable", user "username", password "password")will jdbcTable always be automatically sync...

Data Engineering

3049 Views
3 replies
2 kudos

03-21-2023 4:35:19 PM

View Replies

Latest Reply

pvignesh92
Honored Contributor

03-22-2023 1:30:19 AM

2 kudos

Hi @andrew li There is a feature introduced from DBR11 where you can directly ingest the data to the table from a selected list of sources. As you are creating a table, I believe this command will create a managed table by loading the data from the...

2 kudos

03-22-2023 1:30:19 AM

2 More Replies

by Skesaram • New Contributor II

02-07-2023 6:44:54 AM

1909 Views
1 replies
0 kudos

Need help to connect to local DB from Data bricks

jdbcHostname="478"jdbcPort=1433jdbcDatabase="Onprem_AzureDB"jdbcUsername="upendra"jdbcPassword="upendrakumar"jdbcDriver="com.microsoft.sqlserver.jdbc.SQLServerDriver"jdbcUrl=f"jdbc:sqlserver://{jdbcHostname}:{jdbcPort};databaseName={jdbcDatabase};use...

Data Engineering

1909 Views
1 replies
0 kudos

02-07-2023 6:44:54 AM

View Replies

Latest Reply

Debayan
Databricks Employee

02-07-2023 11:05:15 PM

0 kudos

Hi, Could you please verify the network connectivity from Databricks to the SQL server? Please make sure SQL:port is allowed in your firewall rules or security groups.

0 kudos

02-07-2023 11:05:15 PM

by Spauk • New Contributor II

01-03-2023 5:38:28 AM

25600 Views
5 replies
7 kudos

Resolved! Best Practices for naming Tables and Databases in Databricks

We moved in Databricks since few months from now, and before that we were in SQL Server.So, all our tables and databases follow the "camel case" rule.Apparently, in Databricks the rule is "lower case with underscore".Where can we find an official doc...

Data Engineering

25600 Views
5 replies
7 kudos

01-03-2023 5:38:28 AM

View Replies

Latest Reply

LandanG
Databricks Employee

01-03-2023 7:09:24 AM

7 kudos

Hi @Salah KHALFALLAH , looking at the documentation it appears that Databricks' preferred naming convention is lowercase and underscores as you mentioned.The reason for this is most likely because Databricks uses Hive Metastore, which is case insens...

7 kudos

01-03-2023 7:09:24 AM

4 More Replies

by pavanb • New Contributor II

04-05-2022 4:50:37 AM

12654 Views
3 replies
3 kudos

Resolved! memory issues - databricks

Hi All, All of a sudden in our Databricks dev environment, we are getting exceptions related to memory such as out of memory , result too large etc.Also, the error message is not helping to identify the issue.Can someone please guide on what would be...

Data Engineering

12654 Views
3 replies
3 kudos

04-05-2022 4:50:37 AM

View Replies

Latest Reply

pavanb
New Contributor II

04-06-2022 2:43:20 AM

3 kudos

Thanks for the response @Hubert Dudek .if i run the same code in test environment , its getting successfully completed and in dev its giving out of memory issue. Also the configuration of test nand dev environment is exactly same.

3 kudos

04-06-2022 2:43:20 AM

2 More Replies

by Michael_Galli • Contributor III

04-05-2022 7:55:48 AM

13125 Views
6 replies
3 kudos

Resolved! com.microsoft.sqlserver.jdbc.SQLServerException:The driver could not establish a secure connection to SQL Server by using SSL encr. Error: "Unexpected rethrowing"

Hi all,there is a random error when pushing data from Databricks to a Azure SQL Database.Anyone else also had this problem? Any ideas are appreciated.See stacktrace attached.Target: Azure SQL Database, Standard S6: 400 DTUsDatabricks Cluster config:"...

Data Engineering

13125 Views
6 replies
3 kudos

04-05-2022 7:55:48 AM

View Replies

Latest Reply

Michael_Galli
Contributor III

04-19-2022 11:07:57 PM

3 kudos

@Pearl Ubaru TLS 1.1 is already deprecated.Are there any concerns from your side to set TLS 1.2 in the connection string?

3 kudos

04-19-2022 11:07:57 PM

5 More Replies

by lizou • Contributor III

04-16-2022 10:27:26 PM

4848 Views
2 replies
2 kudos

Resolved! How to find the identity column seed value?

How to find the identity column seed value? A seed value is required when we need specifically like start generating new values from a number (most likely we need to keep the original key values when data is reloaded from another source, and any new ...

Data Engineering

4848 Views
2 replies
2 kudos

04-16-2022 10:27:26 PM

View Replies

Latest Reply

lizou
Contributor III

04-17-2022 2:51:10 PM

2 kudos

found it, thanks!of course, it will be nice to have a sql function available to query the value.example\"delta.identity.start\":984888,\"delta.identity.highWaterMark\":1004409,\"comment\":\"identity\",\"delta.identity.step\":1}

2 kudos

04-17-2022 2:51:10 PM

1 More Replies

by Sudeshna • New Contributor III

03-15-2022 11:49:37 AM

15026 Views
6 replies
7 kudos

Resolved! I am new to Databricks SQL and want to create a variable which can hold calculations either from static values or from select queries similar to SQL Server. Is there a way to do so?

I was trying to create a variable and i got the following error -command - SET a = 5;Error -Error running queryConfiguration a is not available.

Data Engineering

15026 Views
6 replies
7 kudos

03-15-2022 11:49:37 AM

View Replies

Latest Reply

BilalAslamDbrx
Databricks Employee

03-20-2022 1:35:27 AM

7 kudos

@Sudeshna Bhakat what @Joseph Kambourakis described works on clusters but is restricted on Databricks SQL endpoints i.e. only a limited number of SET commands are allowed. I suggest you explore the curly-braces (e.g. {{ my_variable }}) in Databrick...

7 kudos

03-20-2022 1:35:27 AM

5 More Replies

Databricks Community

An example how to connect to SQL Server data using windows authentication

Connecting SQL Server (on-premise) to Databricks via jdbc:sqlserver

The driver could not establish a secure connection to SQL Server by using Secure Sockets Layer (SSL) encryption

Resolved! com.microsoft.sqlserver.jdbc.SQLServerException: The driver could not establish a secure connection to SQL Server by using Secure Sockets Layer (SSL) encryption.

Resolved! What is the best way to handle big data sets?

Ingesting more than 7 million rows into a SQL Server Table

Resolved! Connect SQL server using windows authentication

Please guide on the algorithm for masking of column in databricks which is compatible (can be unmasked) with sqlserver.

Resolved! Will a table backed by a SQL server database table automatically get updated if the base table in SQL server database is updated?

Need help to connect to local DB from Data bricks

Resolved! Best Practices for naming Tables and Databases in Databricks

Resolved! memory issues - databricks

Resolved! com.microsoft.sqlserver.jdbc.SQLServerException:The driver could not establish a secure connection to SQL Server by using SSL encr. Error: "Unexpected rethrowing"

Resolved! How to find the identity column seed value?

Resolved! I am new to Databricks SQL and want to create a variable which can hold calculations either from static values or from select queries similar to SQL Server. Is there a way to do so?