I have been trying to get the data lineage to populate with the simplest of queries on a unity enabled catalog with a unity enabled cluster. I am essentially running the example provided with more data to see how it works with various aggregates downstream.
CREATE OR REPLACE TABLE
unity_metastore.lineage_test.table1_standard (
_id INT,
food STRING,
score DECIMAL,
food_type STRING
);
INSERT INTO unity_metastore.lineage_test.table1_standard
(_id, food, score, food_type)
VALUES
(1, 'pizza', 9.5, 'italian'),
(2, 'tacos', 8.7, 'mexican'),
(3, 'pasta', 7.5, 'italian'),
(4, 'burrito', 9.0, 'mexican'),
(5, 'burger', 4.5, 'american'),
(6, 'sourkraut', 2.4, 'polish'),
(7, 'fried rice', 6.0, 'chinese'),
(8, 'hot pot', 9.0, 'chinese'),
(9, 'mole', 8.7, 'mexican'),
(10, 'dim sum', 7.0, 'chinese')
;
CREATE OR REPLACE TABLE
unity_metastore.lineage_test.table2_standard
AS SELECT
_id,
food,
score,
food_type
FROM unity_metastore.lineage_test.table1_standard
WHERE score > 5
;
However when I go to the lineage tab on the tables (which I have all privileges for the catalog and every schema/table underneath it) they are blank. There is history being collected and data is showing up in them.
I am running a cluster with 13.3 LTS and the following runtime settings enabled
spark.databricks.dataLineage.enabled true
spark.databricks.delta.dataLineage.enabled true
spark.databricks.delta.columnMapping.mode true