05-04-2023 04:00 AM
What are the steps needed to connect to a DB2-AS400 source to pull data to lake using Databricks? I believe it requires establishing a jdbc connection, but I couldnot find much details online
05-10-2023 07:59 AM
@Ajay Menon - This could be a two step process. If you are on Azure, we can retrieve the data from DB2 into a ADLS location by writing a function app using ADF. Use Autoloader to ingest the files from ADLS with CDC to a bronze delta table and follow the medallion architecture.
05-05-2023 06:44 AM
I`d start by loking what options you have for a DB2 library (probs JDBC) for python. Start here https://pypi.org/project/ibm-db/
05-05-2023 06:48 AM
Just thinking about it, your DB2 is probably on prem. In which case you`d have to have network connectivity directly between databricks and your on-premise servers, which probably doesn`t exist. In which case, you`ll probably have to start looking at a different way of data ingress (i.e. a cloud data orchestator with an on-prem connector like AF). Without knowing what your architecture\tech stack looks like it`s difficult to say.
05-10-2023 07:59 AM
@Ajay Menon - This could be a two step process. If you are on Azure, we can retrieve the data from DB2 into a ADLS location by writing a function app using ADF. Use Autoloader to ingest the files from ADLS with CDC to a bronze delta table and follow the medallion architecture.
05-19-2023 01:08 AM
Hi @Ajay Menon
Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help.
We'd love to hear from you.
Thanks!
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group