<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Any Advice on Dynamic Masking while maintaining performance? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/any-advice-on-dynamic-masking-while-maintaining-performance/m-p/132890#M49667</link>
    <description>&lt;P&gt;I plan to mask columns with a specific tag like "sensitive", "PII" which represents that the column values are ought to be seen by privileged user groups because they contain credentials or personal identity data.&lt;BR /&gt;&lt;BR /&gt;To implement that i plan to create a function and apply to a catalog by policy.&lt;BR /&gt;&lt;BR /&gt;However, I am worried on performance. did anyone try this and experience performance issue?&lt;BR /&gt;or is there anyone has better approach to perform the task?&lt;BR /&gt;&lt;BR /&gt;Note that i have a requirement not to encrpt data.&lt;BR /&gt;&lt;BR /&gt;Thank you in advance.&lt;/P&gt;</description>
    <pubDate>Wed, 24 Sep 2025 00:33:36 GMT</pubDate>
    <dc:creator>tana_sakakimiya</dc:creator>
    <dc:date>2025-09-24T00:33:36Z</dc:date>
    <item>
      <title>Any Advice on Dynamic Masking while maintaining performance?</title>
      <link>https://community.databricks.com/t5/data-engineering/any-advice-on-dynamic-masking-while-maintaining-performance/m-p/132890#M49667</link>
      <description>&lt;P&gt;I plan to mask columns with a specific tag like "sensitive", "PII" which represents that the column values are ought to be seen by privileged user groups because they contain credentials or personal identity data.&lt;BR /&gt;&lt;BR /&gt;To implement that i plan to create a function and apply to a catalog by policy.&lt;BR /&gt;&lt;BR /&gt;However, I am worried on performance. did anyone try this and experience performance issue?&lt;BR /&gt;or is there anyone has better approach to perform the task?&lt;BR /&gt;&lt;BR /&gt;Note that i have a requirement not to encrpt data.&lt;BR /&gt;&lt;BR /&gt;Thank you in advance.&lt;/P&gt;</description>
      <pubDate>Wed, 24 Sep 2025 00:33:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/any-advice-on-dynamic-masking-while-maintaining-performance/m-p/132890#M49667</guid>
      <dc:creator>tana_sakakimiya</dc:creator>
      <dc:date>2025-09-24T00:33:36Z</dc:date>
    </item>
    <item>
      <title>Re: Any Advice on Dynamic Masking while maintaining performance?</title>
      <link>https://community.databricks.com/t5/data-engineering/any-advice-on-dynamic-masking-while-maintaining-performance/m-p/132908#M49672</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/181698"&gt;@tana_sakakimiya&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Your approach—using Unity Catalog column tags (like "sensitive" or "PII") and applying masking policies based on those tags—is a&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;recommended and scalable way&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;to manage data access in Databricks, especially for compliance and privacy. Masking policies are evaluated at query time, performance impact is minimal if logic is simple. only complex masking expressions involving udf's or regex may slow it down.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I would suggest to use ABAC&amp;nbsp;&lt;/SPAN&gt;(attribute based access control) which is coming soon already in private preview.&amp;nbsp;&lt;SPAN&gt;ABAC allows you to control access to data based on attributes (tags, labels, or properties) of users, groups, or data objects, rather than just roles (RBAC). Dynamically evaluated and highly flexible. This approach avoid maintaining lot of roles with organisation changes.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;An simple example for column masking rule under ABAC:&lt;/P&gt;&lt;P&gt;SET RULE analyst_sales_mask&lt;BR /&gt;ON CATALOG business_unit&lt;BR /&gt;COLUMN MASK mask_pii&lt;BR /&gt;TO `privileged_employees`&lt;BR /&gt;FOR TABLES&lt;BR /&gt;WHEN has_tag(‘txn’)&lt;BR /&gt;WHEN COLUMNS col_has_tag(‘pii’)&lt;/P&gt;&lt;P&gt;Br&lt;/P&gt;&lt;P&gt;Saurabh&lt;/P&gt;</description>
      <pubDate>Wed, 24 Sep 2025 08:48:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/any-advice-on-dynamic-masking-while-maintaining-performance/m-p/132908#M49672</guid>
      <dc:creator>saurabh18cs</dc:creator>
      <dc:date>2025-09-24T08:48:04Z</dc:date>
    </item>
  </channel>
</rss>

