cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Data Lineage in Unity Catalog not Populating

Ian_Neft
New Contributor

I have been trying to get the data lineage to populate with the simplest of queries on a unity enabled catalog with a unity enabled cluster.  I am essentially running the example provided with more data to see how it works with various aggregates downstream.  

 

 

 

 

CREATE OR REPLACE TABLE 
unity_metastore.lineage_test.table1_standard (
  _id INT,
  food STRING,
  score DECIMAL,
  food_type STRING
);

INSERT INTO unity_metastore.lineage_test.table1_standard 
  (_id, food, score, food_type)
VALUES
  (1, 'pizza', 9.5, 'italian'),
  (2, 'tacos', 8.7, 'mexican'),
  (3, 'pasta', 7.5, 'italian'),
  (4, 'burrito', 9.0, 'mexican'),
  (5, 'burger', 4.5, 'american'),
  (6, 'sourkraut', 2.4, 'polish'),
  (7, 'fried rice', 6.0, 'chinese'),
  (8, 'hot pot', 9.0, 'chinese'),
  (9, 'mole', 8.7, 'mexican'),
  (10, 'dim sum', 7.0, 'chinese')
;

CREATE OR REPLACE TABLE 
unity_metastore.lineage_test.table2_standard
AS SELECT
  _id,
  food,
  score,
  food_type
FROM unity_metastore.lineage_test.table1_standard
WHERE score > 5
;

 

 

 

 

 

 However when I go to the lineage tab on the tables (which I have all privileges for the catalog and every schema/table underneath it) they are blank.  There is history being collected and data is showing up in them. 

I am running a cluster with  13.3 LTS and the following runtime settings enabled 

spark.databricks.dataLineage.enabled true

spark.databricks.delta.dataLineage.enabled true

spark.databricks.delta.columnMapping.mode true

4 REPLIES 4

brockb
Databricks Employee
Databricks Employee

Hi, 

A few thoughts:

And if you are looking to enable column mapping mode, it can be added as a  `TBLPROPERTIES` to a table using:

'delta.columnMapping.mode' = 'name'

...or at a session, cluster level with: `spark.databricks.delta.properties.defaults.columnMapping.mode`.

davood_NL
New Contributor II

I have exactly the same issue, create the sample tables , everything seems fine but I can not see any data in lineage tab page in databricks .

can anybody say some other thing to help me, because that answer doesn't work for me

AlexYu
New Contributor III

pmahawar
New Contributor II

Cluster running in shared mode with Databricks runtime 15.4 LTS

UC setup as per Databricks guide. I can see system tables everything but data is not populating in the table_lineage table. EventHub Firewall 9093 port is also open.

Enabled runtime settings as well

spark.databricks.dataLineage.enabled true

spark.databricks.delta.dataLineage.enabled true

spark.databricks.delta.columnMapping.mode true 

still no luck. Any other thing that can be done or checked. Quickhelp is appreciated

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now