Using Databricks to transform cloudera lakehouse on-prem without bringing the data to cloud

Kurgod
New Contributor II

I am looking for a solution to connect databricks to cloudera lakehouse hosted on-prem and transform the data using databricks without bringing the data to databricks delta tables or cloud storage. once the transformation is done the data need to be written back to cloudera lakehouse itself

BR_DatabricksAI
Databricks Partner

Hello, 

What is your data volume? You can connect using  jdbc/odbc but this process will be slower if the data volume is too high.

Another way of connecting is if your cloudera storage is in HDFS then you can also connect through HDFS API as well. 

 

BR

Are there specific use case examples for these two methods? Could you please provide some guidance? We are also looking for this solution.