<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Sync prod WS DBs to dev WS DBs in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/sync-prod-ws-dbs-to-dev-ws-dbs/m-p/33212#M24275</link>
    <description>&lt;P&gt;We have a couple sources we'd already set up to stream to prod using a 3p system. Is there a way to sync this directly to our dev workspace to build pipelines? eg. directly connecting to a cluster in prod and pull with a job cluster, dump to S3 and use autoloader, or maybe there's a way to create a shared DBFS and just share on this?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;We initially created the dev / prod workspaces using the automagical workspace creating tool, so I'm unfamiliar with how setting up a shared dbfs would work.&lt;/P&gt;</description>
    <pubDate>Mon, 29 Aug 2022 21:15:45 GMT</pubDate>
    <dc:creator>Mr__E</dc:creator>
    <dc:date>2022-08-29T21:15:45Z</dc:date>
    <item>
      <title>Sync prod WS DBs to dev WS DBs</title>
      <link>https://community.databricks.com/t5/data-engineering/sync-prod-ws-dbs-to-dev-ws-dbs/m-p/33212#M24275</link>
      <description>&lt;P&gt;We have a couple sources we'd already set up to stream to prod using a 3p system. Is there a way to sync this directly to our dev workspace to build pipelines? eg. directly connecting to a cluster in prod and pull with a job cluster, dump to S3 and use autoloader, or maybe there's a way to create a shared DBFS and just share on this?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;We initially created the dev / prod workspaces using the automagical workspace creating tool, so I'm unfamiliar with how setting up a shared dbfs would work.&lt;/P&gt;</description>
      <pubDate>Mon, 29 Aug 2022 21:15:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sync-prod-ws-dbs-to-dev-ws-dbs/m-p/33212#M24275</guid>
      <dc:creator>Mr__E</dc:creator>
      <dc:date>2022-08-29T21:15:45Z</dc:date>
    </item>
    <item>
      <title>Re: Sync prod WS DBs to dev WS DBs</title>
      <link>https://community.databricks.com/t5/data-engineering/sync-prod-ws-dbs-to-dev-ws-dbs/m-p/33213#M24276</link>
      <description>&lt;P&gt;DBFS can be used in many ways. &lt;/P&gt;&lt;P&gt;Please refer below: &lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Allows you to&amp;nbsp;&lt;A href="https://docs.databricks.com/dbfs/index.html#interact-files" alt="https://docs.databricks.com/dbfs/index.html#interact-files" target="_blank"&gt;interact with object storage&lt;/A&gt;&amp;nbsp;using directory and file semantics instead of cloud-specific API commands.&lt;/LI&gt;&lt;LI&gt;Allows you to&amp;nbsp;&lt;A href="https://docs.databricks.com/dbfs/index.html#mount-storage" alt="https://docs.databricks.com/dbfs/index.html#mount-storage" target="_blank"&gt;mount&lt;/A&gt;&amp;nbsp;cloud object storage locations so that you can map storage credentials to paths in the Databricks workspace.&lt;/LI&gt;&lt;LI&gt;Simplifies the process of persisting files to object storage, allowing virtual machines and attached volume storage to be safely deleted on cluster termination.&lt;/LI&gt;&lt;LI&gt;Provides a convenient location for storing init scripts, JARs, libraries, and configurations for cluster initialization.&lt;/LI&gt;&lt;LI&gt;Provides a convenient location for checkpoint files created during model training with OSS deep learning libraries.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;A href="https://docs.databricks.com/dbfs/index.html#what-can-you-do-with-dbfs" target="test_blank"&gt;https://docs.databricks.com/dbfs/index.html#what-can-you-do-with-dbfs&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Please let us know if this helps or you need further clarification on the same. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 30 Aug 2022 20:22:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sync-prod-ws-dbs-to-dev-ws-dbs/m-p/33213#M24276</guid>
      <dc:creator>Debayan</dc:creator>
      <dc:date>2022-08-30T20:22:38Z</dc:date>
    </item>
  </channel>
</rss>

