Data Lineage in Unity Catalog not Populating
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
11-10-2023 12:17 PM
I have been trying to get the data lineage to populate with the simplest of queries on a unity enabled catalog with a unity enabled cluster. I am essentially running the example provided with more data to see how it works with various aggregates downstream.
CREATE OR REPLACE TABLE
unity_metastore.lineage_test.table1_standard (
_id INT,
food STRING,
score DECIMAL,
food_type STRING
);
INSERT INTO unity_metastore.lineage_test.table1_standard
(_id, food, score, food_type)
VALUES
(1, 'pizza', 9.5, 'italian'),
(2, 'tacos', 8.7, 'mexican'),
(3, 'pasta', 7.5, 'italian'),
(4, 'burrito', 9.0, 'mexican'),
(5, 'burger', 4.5, 'american'),
(6, 'sourkraut', 2.4, 'polish'),
(7, 'fried rice', 6.0, 'chinese'),
(8, 'hot pot', 9.0, 'chinese'),
(9, 'mole', 8.7, 'mexican'),
(10, 'dim sum', 7.0, 'chinese')
;
CREATE OR REPLACE TABLE
unity_metastore.lineage_test.table2_standard
AS SELECT
_id,
food,
score,
food_type
FROM unity_metastore.lineage_test.table1_standard
WHERE score > 5
;
However when I go to the lineage tab on the tables (which I have all privileges for the catalog and every schema/table underneath it) they are blank. There is history being collected and data is showing up in them.
I am running a cluster with 13.3 LTS and the following runtime settings enabled
spark.databricks.dataLineage.enabled true
spark.databricks.delta.dataLineage.enabled true
spark.databricks.delta.columnMapping.mode true
- Labels:
-
Delta Lake
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
11-19-2023 08:16 PM
Hi,
A few thoughts:
- Can you confirm that the cluster is running in either Access Mode "Assigned" or "Shared"
- You can confirm that UC is setup correctly using this guide: https://docs.databricks.com/data-governance/unity-catalog/index.html
- Consider doing a full refresh on the Catalog explorer page to help eliminate any strange browser caching behaviors
And if you are looking to enable column mapping mode, it can be added as a `TBLPROPERTIES` to a table using:
'delta.columnMapping.mode' = 'name'
...or at a session, cluster level with: `spark.databricks.delta.properties.defaults.columnMapping.mode`.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-01-2024 04:30 AM
I have exactly the same issue, create the sample tables , everything seems fine but I can not see any data in lineage tab page in databricks .
can anybody say some other thing to help me, because that answer doesn't work for me
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-16-2024 06:55 AM
You might need to update your outbound firewall rules to allow for connectivity to the Amazon Kinesis / Event Hubs endpoint.
https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/data-lineage