I have data in PostgreSQL and Iโm using Lakeflow Connect via UI to ingest it into Databricks streaming tables.
Currently, each Lakeflow Connect pipeline only allows connecting one PostgreSQL table. I have around 300 tables, and creating pipelines manually for each table is time-consuming.
Iโm looking for a way to automate this process, where I can provide a PostgreSQL connection and table names (or a list/schema), and automatically generate and deploy the required Lakeflow Connect pipelines.
I explored Asset Bundles and YAML-based definitions, but it seems Lakeflow Connect resources are not fully supported there yet.
What would be a scalable or recommended approach to design this setup in Databricks?