I thought I might follow up this after getting it all working with the help of my local Databricks office. AS the CDC has been crated it scans metadata for the server that you connect to. This may get altered in a future release, I have no idea as to the benefits of either. It does it once at initial start of the cdc_gateway and it may do it periodically at some later time. It appears relatively benign to both the server and Databricks. The permissions that are required for CDC and it will fail if you don't have them right means it can't be limited to only looking at the database you have you connection for. The product seems good in public preview. This behaviour is a bit unnerving for initial deployment, but appears to cause no issues.