cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Pulling the data from OGG(oracle golden gate ) to unity catalog?

Soumenkumar
New Contributor
 
1 REPLY 1

K_Anudeep
New Contributor III

Hello @Soumenkumar ,

I believe OGG has a Databricks target that stages to cloud storage (ADLS on S3) and then runs MERGE into Delta tables. This is designed for UC and documented in the official Oracle Docs.

  • In Unity Catalog, create a storage credential and an external location to the ADLS or S3  which OGG will use as its staging area. 

  • Then, using OGG, configure the Databricks target with your staging path and table mapping; OGG will stage to ADLS/S3 and MERGE into UC tables.

If the above approach is slow(high latency) or doesnt work well, then

  • You can use the Kafka Handler in OGG to publish CDC (JSON/Avro) to Kafka.
  • In Databricks, read those topics with Structured Streaming and upsert into UC Delta tables (via foreachBatch + MERGE) or use DLT with APPLY CHANGES.

To my knowledge, using either of the above two approaches should be good. Please let me know if you have any further questions.

Thanks.

 

 

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local communityโ€”sign up today to get started!

Sign Up Now