Databricks Community

Rishabh-Pandey · ‎12-22-2022

hey i want to know can we connect databricks to the teradata database and if yes what will be the procedure ??? help would be appreciated

Rishabh Pandey

User16255483290 · ‎12-22-2022

@Rishabh Pandey You can follow the below docs on how to connect to teradata from databricks.

https://www.cdata.com/kb/tech/teradata-jdbc-azure-databricks.rst

https://medium.com/@Tulga.Unlusoy/accessing-teradata-from-databricks-for-rapid-experimentation-in-da...

View solution in original post

User16255483290 · ‎12-22-2022

@Rishabh Pandey You can follow the below docs on how to connect to teradata from databricks.

https://www.cdata.com/kb/tech/teradata-jdbc-azure-databricks.rst

https://medium.com/@Tulga.Unlusoy/accessing-teradata-from-databricks-for-rapid-experimentation-in-da...

Rishabh-Pandey · ‎12-27-2022

thanks

Rishabh Pandey

Harshjot · ‎12-23-2022

Hi @Rishabh Pandey just add Teradata JDBC jar to your data bricks cluster.

https://spark.apache.org/docs/latest/sql-data-sources-jdbc.html

Rishabh-Pandey · ‎12-27-2022

thanks

Rishabh Pandey

jose_gonzalez · ‎12-27-2022

use the JDBC driver from here https://docs.databricks.com/integrations/jdbc-odbc-bi.html

Rishabh-Pandey · ‎12-27-2022

thanks

Rishabh Pandey

JasonMamoa · ‎02-08-2023

Would this connection be encrypted end to end?

BroData · ‎05-12-2025

There are two main ways to connect to Teradata from Databricks using Python.

Way 1: Using Python Libraries (e.g., sqlalchemy, pyjdbc, pyodbc, jaydebeapi, and so on)

Pros: Provides a comprehensive solution, allowing us to: Query data, Trigger stored procedures, Perform other advanced database operations.

Cons: Only utilizes the driver node of the Databricks cluster. So, this way does not leverage the full distributed power of the Databricks cluster, which can lead to performance limitations for large datasets.

Way 2: Using PySpark and Spark JDBC API

Step 1: Install the Maven library terajdbc on the Databricks cluster.

Step 2: Read & Write

df = spark.read.format('jdbc') \
.option('driver', 'com.teradata.jdbc.TeraDriver') \
.option('url', 'jdbc:teradata://<host_name>/DBS_PORT=<port_number>,TMODE=ANSI,logmech=ldap') \
.option('user', '<user>') \
.option('password', '<password>') \
.option('query', '<query>') \
.load()

df.display()

df.write.format('jdbc') \
.option('driver', 'com.teradata.jdbc.TeraDriver') \
.option('url', 'jdbc:teradata://<host_name>/DBS_PORT=<port_number>,TMODE=ANSI,logmech=ldap') \
.option('user', '<user>') \
.option('password', '<password>') \
.option('dbtable', '<target_db>.<target_table>') \
.mode('<write_mode>') \
.save()

Note - a. Ensure the URL is correctly configured. b. Provide valid user credentials with appropriate access. c. Ensure <query> or <db_name>.<table_name> is accessible by the <user>.

Pros: Fully utilizes the distributed computing power of the Databricks cluster. So, this way offers excellent performance for reading and writing large datasets.

Cons: Spark JDBC API is primarily for DataFrame-based data I/O, not procedural/transactional logic. So, this way supports limited operations (like, we can't execute stored procedures and some other advanced database operations).

Thanks & Regards,

BroData

Databricks Community

connect databricks to teradata

Join Us as a Local Community Builder!

Lakehouse, Lagers & Legends — Bangalore Meetup | December 13

🌟 Community Pulse: Your Weekly Roundup! November 21 – 27, 2025

Join us for another BrickTalk: Vibe-Coding Databricks Apps in Replit with Augusto!

Celebrating Our First Brickster Champion: Louis Frolio

⭐ Setup Spark with Hadoop Anywhere : A DBR aligned local Spark+HDFS+Hive stack on Docker⭐