โ05-13-2022 04:52 AM
I wish to do some analysis on tables that are stored in dataverse in databricks. I know that PowerBi uses its Dataverse connector to fetch the data using a Dataverse's TDS endpoint. The tables that we import in PowerBi using this connector is nearly perfect with all the relationships accounted for.
I am trying to establish a similar connection in databricks using connectors or through azure data lake/data warehouse services. Connecting directly to the web app is not an option because it fetches the tables without proper lookup values and relationships.
Is there some suggestions or docs that I could follow?
โ10-07-2022 06:16 AM
I finally followed the following method for my requirement:
1) Connected the data verse tables from PowerApps to synapse workspace using the synapse link
2) Used the standard template provided in it to convert the multiple .csv files to parquet format
3) Translated the missing columns in the converted parquet file using the metadata information as given here in Access Dataverse choices
It is still difficult to translate the metadata within databricks since there could be multi-level relationships. But replicating the PowerBI dataverse connector seems to be currently not possible. So I am able to temporarily resolve this issue.
โ05-16-2022 03:47 AM
Hi @Arjun Bala Krishnanโ , If you're using standard dataflows that store the data in Dataverse, you can still connect to Dataverse from many Azure services. The following image shows that in Azure Data Factory, the output of dataflow from Dataverse can be used as a source.
Please go through this doc.
โ06-01-2022 07:12 AM
Hi @Kaniz Fatmaโ , could you share how to connect to Dataverse from Databricks specifically? This doesn't really answer the question, the doc doesn't help as well unfortunately.
โ06-02-2022 12:33 AM
Hi @Tomasz Starakiewiczโ, Databricks is a cloud-based service that provides data processing capabilities through Apache Sparkโข. When paired with the CData JDBC Driver, customers can use Databricks to perform data engineering and data science on live Microsoft Dataverse data. This article walks through hosting the CData JDBC Driver in Azure and connecting to and processing lives of Microsoft Dataverse data in Databricks.
Please let me know if this helps you.
โ07-15-2022 01:25 AM
CData drivers are not free. To connect data from one microsoft service to another must be without the help of outside (paid) solutions. This was also the first google suggestion.
โ05-16-2022 06:09 AM
Thank you @Kaniz Fatmaโ for the reply. Is it possible that PowerBi is using an internal azure datalake when importing tables using its TDS (SQL) endpoint connection? It would be helpful to know since I am struggling to reproduce the exact table that Bi imports on databricks (View table data in Power BI Desktop). I could then go ahead and try a connection using an external data lake storage.
โ06-02-2022 12:51 AM
Hi @Arjun Bala Krishnanโ, Power BI desktop already supports the Azure data lake Gen2 connector; you can use this to get data from your data sources.
โ07-15-2022 12:59 AM
I am not looking to connect data lake to power BI. The objective is to connect dataverse tables to databricks including the relationships.
โ07-11-2022 11:01 AM
Hey @Arjun Bala Krishnanโ and @Tomasz Starakiewiczโ
Hope everything is going great!
Does @Kaniz Fatmaโ's response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?
We'd love to hear from you.
Thanks!
โ07-15-2022 01:01 AM
Unfortunately the topic is not resolved so far. I have been in contact with MS Support ever since. Both mail and live support has not got us to the expected target. I will propose the best solution that I know of in this thread.
โ10-07-2022 06:16 AM
I finally followed the following method for my requirement:
1) Connected the data verse tables from PowerApps to synapse workspace using the synapse link
2) Used the standard template provided in it to convert the multiple .csv files to parquet format
3) Translated the missing columns in the converted parquet file using the metadata information as given here in Access Dataverse choices
It is still difficult to translate the metadata within databricks since there could be multi-level relationships. But replicating the PowerBI dataverse connector seems to be currently not possible. So I am able to temporarily resolve this issue.
โ10-07-2022 11:24 AM
Thank you for your sharing the solution with our community @Arjun Bala Krishnanโ !
Would you mind selecting your answer as the best? Thanks.
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโt want to miss the chance to attend and share knowledge.
If there isnโt a group near you, start one and help create a community that brings people together.
Request a New Group