<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Databricks Upload local files (Create/Modify table) in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/databricks-upload-local-files-create-modify-table/m-p/105185#M42033</link>
    <description>&lt;P&gt;For&amp;nbsp;Sharing a CSV file containing PII data with another user who should not have access to PII data elements:&lt;/P&gt;
&lt;UL class="_1t7bu9h7 _1t7bu9h2"&gt;
&lt;LI&gt;You can use Databricks' Unity Catalog to manage and govern access to data. Unity Catalog allows you to define fine-grained access controls at the column level, ensuring that users without the necessary permissions cannot access PII data.&lt;/LI&gt;
&lt;LI&gt;You can create views that mask or exclude PII data for users who should not have access to it. This can be done using dynamic view functions, which return either encrypted or masked data based on the user's access level.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;&lt;STRONG&gt;Modifying a table not owned by the user:&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL class="_1t7bu9h7 _1t7bu9h2"&gt;
&lt;LI&gt;&lt;SPAN&gt;Users cannot modify tables they do not own unless they have been explicitly granted the necessary permissions. Unity Catalog provides a unified permission model to manage access policies consistently across data and AI assets.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;You can enforce access controls and permissions to prevent users from uploading files to the DBFS root. This can be managed through the Databricks workspace settings and Unity Catalog, ensuring that only authorized users have the necessary permissions to upload files to specific locations.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://www.databricks.com/blog/2020/11/20/enforcing-column-level-encryption-and-avoiding-data-duplication-with-pii.html" target="_blank"&gt;https://www.databricks.com/blog/2020/11/20/enforcing-column-level-encryption-and-avoiding-data-duplication-with-pii.html&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 10 Jan 2025 14:12:45 GMT</pubDate>
    <dc:creator>NandiniN</dc:creator>
    <dc:date>2025-01-10T14:12:45Z</dc:date>
    <item>
      <title>Databricks Upload local files (Create/Modify table)</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-upload-local-files-create-modify-table/m-p/71282#M35025</link>
      <description>&lt;P&gt;Hello Team,&lt;/P&gt;&lt;P&gt;I believe Databricks come out recently feature of Create or modify a table using file upload which is less than 2 GB (file format CSV, TSV, or JSON, Avro, Parquet, or text files to create or overwrite a managed Delta Lake table) on Self Serve workspace. (&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/ingestion/add-data/upload-data" target="_blank"&gt;https://learn.microsoft.com/en-us/azure/databricks/ingestion/add-data/upload-data&lt;/A&gt;)&lt;/P&gt;&lt;P&gt;I am looking for your guidance on below:&lt;/P&gt;&lt;P&gt;- How do we ensure that One user uploading file can not shared with another user?&lt;/P&gt;&lt;P&gt;- Do we know if Databricks Local File upload abide with Bell–LaPadula model? Here is the information on Bell–LaPadula model. &lt;A href="https://en.wikipedia.org/wiki/Bell%E2%80%93LaPadula_model" target="_blank"&gt;https://en.wikipedia.org/wiki/Bell%E2%80%93LaPadula_model&lt;/A&gt;&lt;/P&gt;&lt;P&gt;- What are the best practice abide with least privilege, need to know, and segregation duty for File Upload on Databricks Self-Serve Workspace?&lt;/P&gt;&lt;P&gt;- Can user overwrite the data (table) uploaded by another user?&lt;/P&gt;&lt;P&gt;- Can we use File upload on Non Secure Cluster?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you&lt;/P&gt;</description>
      <pubDate>Fri, 31 May 2024 20:25:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-upload-local-files-create-modify-table/m-p/71282#M35025</guid>
      <dc:creator>VJ3</dc:creator>
      <dc:date>2024-05-31T20:25:56Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Upload local files (Create/Modify table)</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-upload-local-files-create-modify-table/m-p/71294#M35026</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/95628"&gt;@VJ3&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The "&lt;SPAN&gt;Imported files are uploaded to a secure internal location within your account which is garbage collected daily.&lt;/SPAN&gt;"&lt;/P&gt;
&lt;P&gt;I created a new table and tried to check the path from the details but was not able to access the underlying file.&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Unity Catalog should help you with the permissions for the tables if you do not want other users to override.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;For access control we have the below that we follow, there is no explicit mention of the Bell–LaPadula model-&amp;nbsp;&lt;A href="https://docs.databricks.com/en/data-governance/table-acls/table-acl.html#enable-table-access-control-for-a-cluster" target="_blank"&gt;https://docs.databricks.com/en/data-governance/table-acls/table-acl.html#enable-table-access-control-for-a-cluster&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Can we use File upload on Non Secure Cluster? Are you facing any issue?&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;SPAN&gt;You can upload data to the staging area without connecting to compute resources, but you must select an active compute resource to preview and configure your table.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;You must have access to a running compute resource and permissions to create tables in a target schema.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Sat, 01 Jun 2024 04:30:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-upload-local-files-create-modify-table/m-p/71294#M35026</guid>
      <dc:creator>NandiniN</dc:creator>
      <dc:date>2024-06-01T04:30:19Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Upload local files (Create/Modify table)</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-upload-local-files-create-modify-table/m-p/71806#M35027</link>
      <description>&lt;P&gt;Hello Nandini,&lt;/P&gt;&lt;P&gt;Thank you for reply. Apologies for delay. Let's say I uploaded CSV file containing PII data using Upload feature available in Databricks UI. Will I be able to share that file with another user who should not have access to PII data elements? Can the user modify the table not owned by him? What is required to mask PII data before sharing the CSV file with another user? How do we ensure that user can not upload the file to DBFS root which is accessible to all users?&lt;/P&gt;&lt;P&gt;Thank you&lt;/P&gt;&lt;P&gt;Vijay&lt;/P&gt;</description>
      <pubDate>Wed, 05 Jun 2024 21:04:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-upload-local-files-create-modify-table/m-p/71806#M35027</guid>
      <dc:creator>VJ3</dc:creator>
      <dc:date>2024-06-05T21:04:28Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Upload local files (Create/Modify table)</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-upload-local-files-create-modify-table/m-p/105185#M42033</link>
      <description>&lt;P&gt;For&amp;nbsp;Sharing a CSV file containing PII data with another user who should not have access to PII data elements:&lt;/P&gt;
&lt;UL class="_1t7bu9h7 _1t7bu9h2"&gt;
&lt;LI&gt;You can use Databricks' Unity Catalog to manage and govern access to data. Unity Catalog allows you to define fine-grained access controls at the column level, ensuring that users without the necessary permissions cannot access PII data.&lt;/LI&gt;
&lt;LI&gt;You can create views that mask or exclude PII data for users who should not have access to it. This can be done using dynamic view functions, which return either encrypted or masked data based on the user's access level.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;&lt;STRONG&gt;Modifying a table not owned by the user:&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL class="_1t7bu9h7 _1t7bu9h2"&gt;
&lt;LI&gt;&lt;SPAN&gt;Users cannot modify tables they do not own unless they have been explicitly granted the necessary permissions. Unity Catalog provides a unified permission model to manage access policies consistently across data and AI assets.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;You can enforce access controls and permissions to prevent users from uploading files to the DBFS root. This can be managed through the Databricks workspace settings and Unity Catalog, ensuring that only authorized users have the necessary permissions to upload files to specific locations.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://www.databricks.com/blog/2020/11/20/enforcing-column-level-encryption-and-avoiding-data-duplication-with-pii.html" target="_blank"&gt;https://www.databricks.com/blog/2020/11/20/enforcing-column-level-encryption-and-avoiding-data-duplication-with-pii.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 10 Jan 2025 14:12:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-upload-local-files-create-modify-table/m-p/105185#M42033</guid>
      <dc:creator>NandiniN</dc:creator>
      <dc:date>2025-01-10T14:12:45Z</dc:date>
    </item>
  </channel>
</rss>

