<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Issues to load from ADLS in DLT in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/issues-to-load-from-adls-in-dlt/m-p/4981#M1526</link>
    <description>&lt;P&gt;@Richard Guo​&amp;nbsp;:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The error message suggests that the container specified in the cloud_files function and the container specified in the fs.azure configuration settings are different. In the cloud_files function, you are using my_container while in the configuration settings you are using my_container@my_storageaccount.dfs.core.windows.net.&lt;/P&gt;&lt;P&gt;To fix the issue, you need to ensure that the container name used in both places matches exactly. You can try modifying the cloud_files function to use the full container path as follows:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;CREATE OR REFRESH STREAMING LIVE TABLE test_account_raw
AS SELECT * FROM cloud_files(
  "abfss://my_storageaccount.dfs.core.windows.net/my_container/test_csv/", 
  "csv", 
  map("header", "true"));&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;Then, make sure that the fs.azure configuration settings use the same container path:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;"configuration": {
        "fs.azure.account.auth.type.my_storageaccount.dfs.core.windows.net": "OAuth",
        "fs.azure.account.oauth.provider.type.my_storageaccount.dfs.core.windows.net": "org.apache.hadoop.fs.azurebfs.oauth2.ClientCredsTokenProvider",
        "fs.azure.account.oauth2.client.id.my_storageaccount.dfs.core.windows.net": "my_client_id",
        "fs.azure.account.oauth2.client.secret.my_storageaccount.dfs.core.windows.net": "my_secret",
        "fs.azure.account.oauth2.client.endpoint.my_storageaccount.dfs.core.windows.net": "https://login.microsoftonline.com/my_tenant_id/oauth2/token",
        "fs.azure.createRemoteFileSystemDuringInitialization": "true",
        "fs.abfss.my_container@my_storageaccount.dfs.core.windows.net.tokenProviderType": "org.apache.hadoop.fs.azurebfs.oauth2.ClientCredsTokenProvider",
        "fs.abfss.my_container@my_storageaccount.dfs.core.windows.net.oauth.provider.type": "org.apache.hadoop.fs.azurebfs.oauth2.ClientCredsTokenProvider",
        "fs.abfss.my_container@my_storageaccount.dfs.core.windows.net.oauth2.client.id": "my_client_id",
        "fs.abfss.my_container@my_storageaccount.dfs.core.windows.net.oauth2.client.secret": "my_secret",
        "fs.abfss.my_container@my_storageaccount.dfs.core.windows.net.oauth2.client.endpoint": "https://login.microsoftonline.com/my_tenant_id/oauth2/token"
    }&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;Note that in the fs.azure configuration settings, the fs.abfss prefix is used instead of fs.azure.account. This is because we are using the ABFS driver to access ADLS.&lt;/P&gt;</description>
    <pubDate>Sat, 13 May 2023 15:47:05 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2023-05-13T15:47:05Z</dc:date>
    <item>
      <title>Issues to load from ADLS in DLT</title>
      <link>https://community.databricks.com/t5/data-engineering/issues-to-load-from-adls-in-dlt/m-p/4980#M1525</link>
      <description>&lt;P&gt;I am using DLT to load csv in ADLS, below is my sql query in notebook:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;CREATE OR REFRESH STREAMING LIVE TABLE test_account_raw
AS SELECT * FROM cloud_files(
  "abfss://my_container@my_storageaccount.dfs.core.windows.net/test_csv/", 
  "csv", 
  map("header", "true"));&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;below is my configuration in Delta live table pipeline in order to access ADLS:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;    "configuration": {
        "fs.azure.account.auth.type.my_storageaccount.dfs.core.windows.net": "OAuth",
        "fs.azure.account.oauth.provider.type.my_storageaccount.dfs.core.windows.net": "org.apache.hadoop.fs.azurebfs.oauth2.ClientCredsTokenProvider",
        "fs.azure.account.oauth2.client.id.my_storageaccount.dfs.core.windows.net": "my_client_id",
        "fs.azure.account.oauth2.client.secret.my_storageaccount.dfs.core.windows.net": "my_secret",
        "fs.azure.account.oauth2.client.endpoint.my_storageaccount.dfs.core.windows.net": "https://login.microsoftonline.com/my_tenant_id/oauth2/token"
    }&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;the pipeline have below errors:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;org.apache.spark.sql.streaming.StreamingQueryException: [STREAM_FAILED] Query [id = 818323fc-80d5-4833-9f46-7d1afc9c5bf7, runId = 722e9aac-0fdd-4206-9d49-683bb151f0bf] terminated with exception: The container in the file event `{"backfill":{"bucket":"root@dbstoragelhdp7mflfxe2y","key":"5810201264315799/Data/Temp/xxxx.csv","size":1801,"eventTime":1682522202000,"newerThan$default$2":false}}` is different from expected by the source: `my_container@my_storageaccount`.
at org.apache.spark.sql.execution.streaming.StreamExecution.org$apache$spark$sql$execution$streaming$StreamExecution$$runStream(StreamExecution.scala:395)
at org.apache.spark.sql.execution.streaming.StreamExecution$$anon$1.$anonfun$run$2(StreamExecution.scala:257)
....&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;How can I fix this issue?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;</description>
      <pubDate>Mon, 01 May 2023 15:34:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issues-to-load-from-adls-in-dlt/m-p/4980#M1525</guid>
      <dc:creator>guostong</dc:creator>
      <dc:date>2023-05-01T15:34:43Z</dc:date>
    </item>
    <item>
      <title>Re: Issues to load from ADLS in DLT</title>
      <link>https://community.databricks.com/t5/data-engineering/issues-to-load-from-adls-in-dlt/m-p/4981#M1526</link>
      <description>&lt;P&gt;@Richard Guo​&amp;nbsp;:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The error message suggests that the container specified in the cloud_files function and the container specified in the fs.azure configuration settings are different. In the cloud_files function, you are using my_container while in the configuration settings you are using my_container@my_storageaccount.dfs.core.windows.net.&lt;/P&gt;&lt;P&gt;To fix the issue, you need to ensure that the container name used in both places matches exactly. You can try modifying the cloud_files function to use the full container path as follows:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;CREATE OR REFRESH STREAMING LIVE TABLE test_account_raw
AS SELECT * FROM cloud_files(
  "abfss://my_storageaccount.dfs.core.windows.net/my_container/test_csv/", 
  "csv", 
  map("header", "true"));&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;Then, make sure that the fs.azure configuration settings use the same container path:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;"configuration": {
        "fs.azure.account.auth.type.my_storageaccount.dfs.core.windows.net": "OAuth",
        "fs.azure.account.oauth.provider.type.my_storageaccount.dfs.core.windows.net": "org.apache.hadoop.fs.azurebfs.oauth2.ClientCredsTokenProvider",
        "fs.azure.account.oauth2.client.id.my_storageaccount.dfs.core.windows.net": "my_client_id",
        "fs.azure.account.oauth2.client.secret.my_storageaccount.dfs.core.windows.net": "my_secret",
        "fs.azure.account.oauth2.client.endpoint.my_storageaccount.dfs.core.windows.net": "https://login.microsoftonline.com/my_tenant_id/oauth2/token",
        "fs.azure.createRemoteFileSystemDuringInitialization": "true",
        "fs.abfss.my_container@my_storageaccount.dfs.core.windows.net.tokenProviderType": "org.apache.hadoop.fs.azurebfs.oauth2.ClientCredsTokenProvider",
        "fs.abfss.my_container@my_storageaccount.dfs.core.windows.net.oauth.provider.type": "org.apache.hadoop.fs.azurebfs.oauth2.ClientCredsTokenProvider",
        "fs.abfss.my_container@my_storageaccount.dfs.core.windows.net.oauth2.client.id": "my_client_id",
        "fs.abfss.my_container@my_storageaccount.dfs.core.windows.net.oauth2.client.secret": "my_secret",
        "fs.abfss.my_container@my_storageaccount.dfs.core.windows.net.oauth2.client.endpoint": "https://login.microsoftonline.com/my_tenant_id/oauth2/token"
    }&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;Note that in the fs.azure configuration settings, the fs.abfss prefix is used instead of fs.azure.account. This is because we are using the ABFS driver to access ADLS.&lt;/P&gt;</description>
      <pubDate>Sat, 13 May 2023 15:47:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issues-to-load-from-adls-in-dlt/m-p/4981#M1526</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-05-13T15:47:05Z</dc:date>
    </item>
    <item>
      <title>Re: Issues to load from ADLS in DLT</title>
      <link>https://community.databricks.com/t5/data-engineering/issues-to-load-from-adls-in-dlt/m-p/4982#M1527</link>
      <description>&lt;P&gt;Hi @Richard Guo​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you for posting your question in our community! We are happy to assist you.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers your question?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;This will also help other community members who may have similar questions in the future. Thank you for your participation and let us know if you need any further assistance!&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 18 May 2023 09:33:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issues-to-load-from-adls-in-dlt/m-p/4982#M1527</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-05-18T09:33:16Z</dc:date>
    </item>
    <item>
      <title>Re: Issues to load from ADLS in DLT</title>
      <link>https://community.databricks.com/t5/data-engineering/issues-to-load-from-adls-in-dlt/m-p/4983#M1528</link>
      <description>&lt;P&gt;thank you every one, the problem is resolved, problem is gone when I have workspace admin access.&lt;/P&gt;</description>
      <pubDate>Thu, 18 May 2023 13:57:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issues-to-load-from-adls-in-dlt/m-p/4983#M1528</guid>
      <dc:creator>guostong</dc:creator>
      <dc:date>2023-05-18T13:57:24Z</dc:date>
    </item>
  </channel>
</rss>

