<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file. in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28991#M20748</link>
    <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Taras's answer is correct. Because spark-avro is based on the RDD APIs, the properties must be set in the hadoopConfiguration options.&lt;/P&gt;
&lt;P&gt;Please note these docs for configuration using the RDD API: &lt;A href="https://docs.azuredatabricks.net/spark/latest/data-sources/azure/azure-datalake.html#access-azure-data-lake-store-using-the-rdd-api" target="test_blank"&gt;https://docs.azuredatabricks.net/spark/latest/data-sources/azure/azure-datalake.html#access-azure-data-lake-store-using-the-rdd-api&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;In Python, you can use &lt;/P&gt;
&lt;PRE&gt;&lt;CODE&gt;sc._jsc.hadoopConfiguration().set()&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 11 Jun 2018 22:46:47 GMT</pubDate>
    <dc:creator>User16301467523</dc:creator>
    <dc:date>2018-06-11T22:46:47Z</dc:date>
    <item>
      <title>Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28982#M20739</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I have files hosted on an Azure Data Lake Store which I can connect from Azure Databricks configured as per instructions here.&lt;/P&gt;&lt;P&gt;I can read JSON files fine, however, I'm getting the following error when I try to read an Avro file.&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;spark.read.format("com.databricks.spark.avro").load("adl://blah.azuredatalakestore.net/blah/blah.avro")&lt;/CODE&gt;&lt;/PRE&gt;&lt;PRE&gt;&lt;CODE&gt;IllegalArgumentException: u'No value for dfs.adls.oauth2.access.token.provider found in conf file.'&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;I made sure that the file existed by running&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;dbutils.fs.ls("adl://blah.azuredatalakestore.net/blah/blah.avro")&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;Please note that the error refers to&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;dfs.adls.oauth2.access.token.provider&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;not &lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;dfs.adls.oauth2.access.token.provider.type&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;mentioned in the documentation above. Even after I set it to something, it would still throw the same error.&lt;/P&gt;&lt;P&gt;Has anyone experienced this issue before? Please let me know what else I should try to further troubleshoot. Thanks.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 26 Jan 2018 10:52:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28982#M20739</guid>
      <dc:creator>microamp</dc:creator>
      <dc:date>2018-01-26T10:52:59Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28983#M20740</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Just found a workaround for the issue with avro file read operation as it seems proper configuration for dfs.adls.oauth2.access.token.provider is not setup inside. If the ADL folder is mounted on databrick notebook , then it is working . Please try following steps&lt;/P&gt;
&lt;P&gt;1. Mount adl folder&lt;/P&gt;
&lt;PRE&gt;&lt;CODE&gt;val configs = Map(
  "dfs.adls.oauth2.access.token.provider.type" -&amp;gt; "ClientCredential",
  "dfs.adls.oauth2.client.id" -&amp;gt; "XXX",
  "dfs.adls.oauth2.credential" -&amp;gt; "YYY",
  "dfs.adls.oauth2.refresh.url" -&amp;gt; "https://login.microsoftonline.com/ZZZ/oauth2/token",
  "dfs.adls.oauth2.access.token.provider"-&amp;gt;"org.apache.hadoop.fs.adls.oauth2.ConfCredentialBasedAccessTokenProvider") &lt;/CODE&gt;&lt;/PRE&gt;
&lt;PRE&gt;&lt;CODE&gt;dbutils.fs.mount(
  source = "adl://XYZ.azuredatalakestore.net/myfolder/demo/",
  mountPoint = "/mnt/mymount",
  extraConfigs = configs)&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;2.Verify your file is visible on mount&lt;/P&gt;
&lt;PRE&gt;&lt;CODE&gt;dbutils.fs.ls("dbfs:/mnt/ashitabh3")&lt;/CODE&gt;&lt;/PRE&gt;import com.databricks.spark.avro._ 
&lt;P&gt;&lt;/P&gt; 
&lt;P&gt;spark.read.avro("dbfs:/mnt/mymount/mydata.avro").show&lt;/P&gt;
&lt;P&gt;I can see the records now&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 05 Feb 2018 09:22:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28983#M20740</guid>
      <dc:creator>AshitabhKumar</dc:creator>
      <dc:date>2018-02-05T09:22:12Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28984#M20741</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Any chance you found a solution for this by now?&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 06 Feb 2018 23:10:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28984#M20741</guid>
      <dc:creator>adina</dc:creator>
      <dc:date>2018-02-06T23:10:56Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28985#M20742</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;I have not unfortunately. I can load the Avro file as JSON although I would get corrupted data as expected, but at least that proves that the file is accessible. I don't know what's causing the above error.&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 06 Feb 2018 23:34:32 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28985#M20742</guid>
      <dc:creator>microamp</dc:creator>
      <dc:date>2018-02-06T23:34:32Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28986#M20743</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;You may want to try mounting your Data Lake Store to DBFS and access your files through the mounted path.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.databricks.com/spark/latest/data-sources/azure/azure-datalake.html#mounting-azure-data-lake-stores-with-dbfs" target="test_blank"&gt;https://docs.databricks.com/spark/latest/data-sources/azure/azure-datalake.html#mounting-azure-data-lake-stores-with-dbfs&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;I have not tried it yet. You might find the following thread helpful however. &lt;/P&gt;
&lt;P&gt;&lt;A href="https://forums.databricks.com/questions/13266/azure-db-mount-on-python-unexpected-keyword-argume.html" target="test_blank"&gt;https://forums.databricks.com/questions/13266/azure-db-mount-on-python-unexpected-keyword-argume.html&lt;/A&gt;&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 07 Feb 2018 00:49:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28986#M20743</guid>
      <dc:creator>microamp</dc:creator>
      <dc:date>2018-02-07T00:49:10Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28987#M20744</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Any solutions for this? I can read CSV files but not geojson files because I am getting this exception.&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 12 Feb 2018 23:18:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28987#M20744</guid>
      <dc:creator>PirrALuis_Simoe</dc:creator>
      <dc:date>2018-02-12T23:18:02Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28988#M20745</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;I had the same issue with using dynamic partitioning in ADLS using Databricks Spark Sql. &lt;/P&gt;
&lt;P&gt;You need to pass ADLS configs as Spark configs during cluster creation:&lt;/P&gt;dfs.adls.oauth2.client.id ***
&lt;P&gt;&lt;/P&gt; 
&lt;P&gt;dfs.adls.oauth2.refresh.url &lt;A href="https://login.microsoftonline.com/**/oauth2/token" target="test_blank"&gt;https://login.microsoftonline.com/**/oauth2/token&lt;/A&gt; &lt;/P&gt; 
&lt;P&gt;dfs.adls.oauth2.credential **&lt;/P&gt; 
&lt;P&gt;dfs.adls.oauth2.access.token.provider.type ClientCredential &lt;/P&gt; 
&lt;P&gt;dfs.adls.oauth2.access.token.provider org.apache.hadoop.fs.adls.oauth2.ConfCredentialBasedAccessTokenProvider&lt;/P&gt;
&lt;P&gt;You also need to set hadoopConfiguration for RDD related functionality:&lt;/P&gt;spark.sparkContext.hadoopConfiguration.set("dfs.adls.oauth2.access.token.provider.type", spark.conf.get("dfs.adls.oauth2.access.token.provider.type"))
&lt;P&gt;&lt;/P&gt; 
&lt;P&gt;spark.sparkContext.hadoopConfiguration.set("dfs.adls.oauth2.client.id", spark.conf.get("dfs.adls.oauth2.client.id"))&lt;/P&gt; 
&lt;P&gt;spark.sparkContext.hadoopConfiguration.set("dfs.adls.oauth2.credential", spark.conf.get("dfs.adls.oauth2.credential")) &lt;/P&gt; 
&lt;P&gt;spark.sparkContext.hadoopConfiguration.set("dfs.adls.oauth2.refresh.url", spark.conf.get("dfs.adls.oauth2.refresh.url"))&lt;/P&gt;
&lt;P&gt;Those two measures fixed the issue for me.&lt;/P&gt;
&lt;P&gt;/Taras&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 05 Apr 2018 08:29:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28988#M20745</guid>
      <dc:creator>TarasChaikovsky</dc:creator>
      <dc:date>2018-04-05T08:29:13Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28989#M20746</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Thanks for the workaround.&lt;/P&gt;
&lt;P&gt;I had a similar issue unrelated to Avro, but in saving a Spark ML model to ADLS. Even setting the property manually:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE&gt;dfs.adls.oauth2.access.token.provider org.apache.hadoop.fs.adls.oauth2.ConfCredentialBasedAccessTokenProvider&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;when setting up the spark cluster would result in error message when trying to save to adl directly: &lt;/P&gt;
&lt;PRE&gt;&lt;CODE&gt;IllegalArgumentException: u'No value for dfs.adls.oauth2.access.token.provider found in conf file.'&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;After mounting adl folder, saving works properly.&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 16 Apr 2018 01:59:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28989#M20746</guid>
      <dc:creator>MichaelParque</dc:creator>
      <dc:date>2018-04-16T01:59:16Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28990#M20747</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;I can also confirm your workaround is working. But, it takes a long time to mount it. The main question is why this workaround is needed in the first place. Hopefully some official response from Databricks will be provided.&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 09 May 2018 20:34:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28990#M20747</guid>
      <dc:creator>_not_provid1608</dc:creator>
      <dc:date>2018-05-09T20:34:05Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28991#M20748</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Taras's answer is correct. Because spark-avro is based on the RDD APIs, the properties must be set in the hadoopConfiguration options.&lt;/P&gt;
&lt;P&gt;Please note these docs for configuration using the RDD API: &lt;A href="https://docs.azuredatabricks.net/spark/latest/data-sources/azure/azure-datalake.html#access-azure-data-lake-store-using-the-rdd-api" target="test_blank"&gt;https://docs.azuredatabricks.net/spark/latest/data-sources/azure/azure-datalake.html#access-azure-data-lake-store-using-the-rdd-api&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;In Python, you can use &lt;/P&gt;
&lt;PRE&gt;&lt;CODE&gt;sc._jsc.hadoopConfiguration().set()&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 11 Jun 2018 22:46:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28991#M20748</guid>
      <dc:creator>User16301467523</dc:creator>
      <dc:date>2018-06-11T22:46:47Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28992#M20749</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Like Taras said, after adding spark.sparkContext.hadoopConfiguration.set no need to mount adl folder&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 18 Oct 2018 13:32:51 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28992#M20749</guid>
      <dc:creator>DonatienTessier</dc:creator>
      <dc:date>2018-10-18T13:32:51Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28993#M20750</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;I am getting same error for csv did you solved ??&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 28 Oct 2018 06:29:39 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28993#M20750</guid>
      <dc:creator>ChandanIsrani</dc:creator>
      <dc:date>2018-10-28T06:29:39Z</dc:date>
    </item>
    <item>
      <title>Re: Azure Data Lake Config Issue: No value for dfs.adls.oauth2.access.token.provider found in conf file.</title>
      <link>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28994#M20751</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Hi Michael,&lt;/P&gt;
&lt;P&gt;Did you find any other way? I am trying to write TF Records into ADLS and getting the same error even after setting this config.&lt;/P&gt;
&lt;P&gt;traindf.repartition(32).write.format('tfrecords').mode('overwrite').option('recordType', 'Example').save("ADLS_URL/my/path")&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 12 Mar 2019 22:28:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/azure-data-lake-config-issue-no-value-for-dfs-adls-oauth2-access/m-p/28994#M20751</guid>
      <dc:creator>SmitShah</dc:creator>
      <dc:date>2019-03-12T22:28:07Z</dc:date>
    </item>
  </channel>
</rss>

