11-10-2023 12:17 PM
I have been trying to get the data lineage to populate with the simplest of queries on a unity enabled catalog with a unity enabled cluster. I am essentially running the example provided with more data to see how it works with various aggregates downstream.
CREATE OR REPLACE TABLE
unity_metastore.lineage_test.table1_standard (
_id INT,
food STRING,
score DECIMAL,
food_type STRING
);
INSERT INTO unity_metastore.lineage_test.table1_standard
(_id, food, score, food_type)
VALUES
(1, 'pizza', 9.5, 'italian'),
(2, 'tacos', 8.7, 'mexican'),
(3, 'pasta', 7.5, 'italian'),
(4, 'burrito', 9.0, 'mexican'),
(5, 'burger', 4.5, 'american'),
(6, 'sourkraut', 2.4, 'polish'),
(7, 'fried rice', 6.0, 'chinese'),
(8, 'hot pot', 9.0, 'chinese'),
(9, 'mole', 8.7, 'mexican'),
(10, 'dim sum', 7.0, 'chinese')
;
CREATE OR REPLACE TABLE
unity_metastore.lineage_test.table2_standard
AS SELECT
_id,
food,
score,
food_type
FROM unity_metastore.lineage_test.table1_standard
WHERE score > 5
;
However when I go to the lineage tab on the tables (which I have all privileges for the catalog and every schema/table underneath it) they are blank. There is history being collected and data is showing up in them.
I am running a cluster with 13.3 LTS and the following runtime settings enabled
spark.databricks.dataLineage.enabled true
spark.databricks.delta.dataLineage.enabled true
spark.databricks.delta.columnMapping.mode true
11-19-2023 08:16 PM
Hi,
A few thoughts:
And if you are looking to enable column mapping mode, it can be added as a `TBLPROPERTIES` to a table using:
'delta.columnMapping.mode' = 'name'
...or at a session, cluster level with: `spark.databricks.delta.properties.defaults.columnMapping.mode`.
03-01-2024 04:30 AM
I have exactly the same issue, create the sample tables , everything seems fine but I can not see any data in lineage tab page in databricks .
can anybody say some other thing to help me, because that answer doesn't work for me
08-16-2024 06:55 AM
You might need to update your outbound firewall rules to allow for connectivity to the Amazon Kinesis / Event Hubs endpoint.
https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/data-lineage
03-04-2025 04:01 AM
Cluster running in shared mode with Databricks runtime 15.4 LTS
UC setup as per Databricks guide. I can see system tables everything but data is not populating in the table_lineage table. EventHub Firewall 9093 port is also open.
Enabled runtime settings as well
spark.databricks.dataLineage.enabled true
spark.databricks.delta.dataLineage.enabled true
spark.databricks.delta.columnMapping.mode true
still no luck. Any other thing that can be done or checked. Quickhelp is appreciated
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now