<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Is it possible to use other databases within Delta Live Tables (DLT)? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20822#M14101</link>
    <description>&lt;P&gt;no, but it could be the issue if it were enabled.&lt;/P&gt;&lt;P&gt;what happens if you read the parquet (or delta lake or csv or ....) directly, without using the hive_metastore?&lt;/P&gt;</description>
    <pubDate>Thu, 24 Nov 2022 14:21:52 GMT</pubDate>
    <dc:creator>-werners-</dc:creator>
    <dc:date>2022-11-24T14:21:52Z</dc:date>
    <item>
      <title>Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20817#M14096</link>
      <description>&lt;P&gt;I have set up a DLT with "testing" set as the target database. I need to join data that exists in a "keys" table in my "beta" database, but I get an AccessDeniedException, despite having full access to both databases via a normal notebook.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;A snippet demonstrating the issue is below (Note that I don't appear to have the code snippet button available in Chrome):&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The following would be data from the keys table:&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;key_df = spark.createDataFrame([[0, "State0"]], ['key', 'name'])&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Then the DLT:&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;@dlt.table()&lt;/P&gt;&lt;P&gt;def dlt_test_data():&lt;/P&gt;&lt;P&gt;    df = spark.createDataFrame([["40873", 0]], ['version', 'key'])&lt;/P&gt;&lt;P&gt;    return df&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;@dlt.table()&lt;/P&gt;&lt;P&gt;def dlt_test_silver():&lt;/P&gt;&lt;P&gt;    data_df = dlt.read("dlt_test_data")&lt;/P&gt;&lt;P&gt;    key_df = enum_df = spark.table("beta.keys")&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;    joined_df = data_df.join(key_df, "key")&lt;/P&gt;&lt;P&gt;    return joined_df&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I also tried importing the data in a view and reading from that in dlt_test_silver, but this also failed.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is it possible to use other databases within a DLT pipeline? If so, how?&lt;/P&gt;</description>
      <pubDate>Thu, 24 Nov 2022 08:49:26 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20817#M14096</guid>
      <dc:creator>477061</dc:creator>
      <dc:date>2022-11-24T08:49:26Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20818#M14097</link>
      <description>&lt;P&gt;what exactly do you mean by 'database'?  Actual RDBMS systems or databases in the Hive metastore (or Unity)?&lt;/P&gt;</description>
      <pubDate>Thu, 24 Nov 2022 09:49:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20818#M14097</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-11-24T09:49:05Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20819#M14098</link>
      <description>&lt;P&gt;Apologies, I'm new to Databricks. I mean databases in the Hive metastore&lt;/P&gt;</description>
      <pubDate>Thu, 24 Nov 2022 09:55:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20819#M14098</guid>
      <dc:creator>477061</dc:creator>
      <dc:date>2022-11-24T09:55:28Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20820#M14099</link>
      <description>&lt;P&gt;no problem.&lt;/P&gt;&lt;P&gt;It should work, hive databases are only a semantic view on top of the actual files residing in your data lake.&lt;/P&gt;&lt;P&gt;Have you enabled table access control?&lt;/P&gt;</description>
      <pubDate>Thu, 24 Nov 2022 10:01:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20820#M14099</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-11-24T10:01:17Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20821#M14100</link>
      <description>&lt;P&gt;Table access control is currently disabled, and admin is refusing to enable it. Does this need to be enabled for the DLT pipeline to use other hive metastore databases?&lt;/P&gt;</description>
      <pubDate>Thu, 24 Nov 2022 14:05:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20821#M14100</guid>
      <dc:creator>477061</dc:creator>
      <dc:date>2022-11-24T14:05:49Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20822#M14101</link>
      <description>&lt;P&gt;no, but it could be the issue if it were enabled.&lt;/P&gt;&lt;P&gt;what happens if you read the parquet (or delta lake or csv or ....) directly, without using the hive_metastore?&lt;/P&gt;</description>
      <pubDate>Thu, 24 Nov 2022 14:21:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20822#M14101</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-11-24T14:21:52Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20823#M14102</link>
      <description>&lt;P&gt;Right now - Yes, DLT only supports one target database. So if you need to push into different databases, then you may have two DLT pipelines.&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.databricks.com/workflows/delta-live-tables/delta-live-tables-quickstart.html#requirements" target="test_blank"&gt;https://docs.databricks.com/workflows/delta-live-tables/delta-live-tables-quickstart.html#requirements&lt;/A&gt;&lt;/P&gt;&lt;P&gt;hope this help cheer&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 26 Nov 2022 04:27:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20823#M14102</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-11-26T04:27:19Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20824#M14103</link>
      <description>&lt;P&gt;Thanks, yes, I knew about the limitation on the target database. There does not appear to be any documentation on the limitation on source databases (for clarity, I want to pull from 2 source databases, but push to 1 target database). But I assume it comes to the same thing&lt;/P&gt;</description>
      <pubDate>Mon, 28 Nov 2022 08:39:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20824#M14103</guid>
      <dc:creator>477061</dc:creator>
      <dc:date>2022-11-28T08:39:21Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20825#M14104</link>
      <description>&lt;P&gt;I have just tried to read from the parquet file directly and it fails with the same error.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Nov 2022 08:40:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20825#M14104</guid>
      <dc:creator>477061</dc:creator>
      <dc:date>2022-11-28T08:40:45Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20826#M14105</link>
      <description>&lt;P&gt;I guess it's one of those things that still have to be sorted out on DLT.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Nov 2022 08:42:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20826#M14105</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-11-28T08:42:43Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20828#M14107</link>
      <description>&lt;P&gt;Thanks, but no, I have not yet received an answer. Other customers have tried to help, but there is still no resolution. &lt;/P&gt;</description>
      <pubDate>Wed, 30 Nov 2022 12:56:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20828#M14107</guid>
      <dc:creator>477061</dc:creator>
      <dc:date>2022-11-30T12:56:02Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to use other databases within Delta Live Tables (DLT)?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20829#M14108</link>
      <description>&lt;P&gt;As an update to this issue: I was running the DLT pipeline on a personal cluster that had an instance profile defined (as per databricks best practises). As a result, the pipeline did not have permission to access other s3 resources (e.g other databases). The solution was to explicitly add the AWS permissions to the pipeline. To do this you need to edit the JSON settings of the pipeline, and add the "&lt;B&gt;aws_attributes" &lt;/B&gt;key to the "&lt;B&gt;clusters" &lt;/B&gt;definition (discussed &lt;A href="https://docs.databricks.com/workflows/delta-live-tables/delta-live-tables-configuration.html#cluster-configuration" alt="https://docs.databricks.com/workflows/delta-live-tables/delta-live-tables-configuration.html#cluster-configuration" target="_blank"&gt;here&lt;/A&gt;). The value for the "aws_attributes" key could be found by going to the cluster definition menu, clicking edit, then viewing the JSON of the cluster definition.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Apparently this issue would not have occurred had I been using a job cluster.&lt;/P&gt;</description>
      <pubDate>Wed, 18 Jan 2023 15:54:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-it-possible-to-use-other-databases-within-delta-live-tables/m-p/20829#M14108</guid>
      <dc:creator>477061</dc:creator>
      <dc:date>2023-01-18T15:54:07Z</dc:date>
    </item>
  </channel>
</rss>

