<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Databricks notebook failed with &amp;quot;Caused by: java.io.FileNotFoundException: Operation failed: &amp;quot;The specified path does not exist.&amp;quot;, 404, HEAD, https://adls.dfs.core.windows.net/raw/file.csv?upn=false&amp;action=getStatus&amp;timeout=90&amp;quot;. in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17291#M11319</link>
    <description>&lt;P&gt;I couldn't find any best solution yet. I have seen this issue so many times now and it get fixed after rerun. I don't feel re-running is the best solution.&lt;/P&gt;</description>
    <pubDate>Tue, 30 Aug 2022 07:17:22 GMT</pubDate>
    <dc:creator>rpshgupta</dc:creator>
    <dc:date>2022-08-30T07:17:22Z</dc:date>
    <item>
      <title>Databricks notebook failed with "Caused by: java.io.FileNotFoundException: Operation failed: "The specified path does not exist.", 404, HEAD, https://adls.dfs.core.windows.net/raw/file.csv?upn=false&amp;action=getStatus&amp;timeout=90".</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17284#M11312</link>
      <description>&lt;P&gt;org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 458.0 failed 4 times, most recent failure: Lost task 0.3 in stage 458.0 (TID 2247) (172.18.102.75 executor 1): com.databricks.sql.io.FileReadException: Error while reading file abfss:REDACTED_LOCAL_PART@adls.dfs.core.windows.net/file.csv. It is possible the underlying files have been updated. You can explicitly invalidate the cache in Spark by running 'REFRESH TABLE tableName' command in SQL or by recreating the Dataset/DataFrame involved. If Delta cache is stale or the underlying files have been removed, you can invalidate Delta cache manually by restarting the cluster.&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1$$anon$2.logFileNameAndThrow(FileScanRDD.scala:417)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1$$anon$2.getNext(FileScanRDD.scala:369)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.util.NextIterator.hasNext(NextIterator.scala:73)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1.nextIterator(FileScanRDD.scala:509)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1.$anonfun$hasNext$1(FileScanRDD.scala:322)&lt;/P&gt;&lt;P&gt;	at scala.runtime.java8.JFunction0$mcZ$sp.apply(JFunction0$mcZ$sp.java:23)&lt;/P&gt;&lt;P&gt;	at com.databricks.spark.util.ExecutorFrameProfiler$.record(ExecutorFrameProfiler.scala:110)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1.hasNext(FileScanRDD.scala:317)&lt;/P&gt;&lt;P&gt;	at scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:460)&lt;/P&gt;&lt;P&gt;	at scala.collection.Iterator$$anon$12.hasNext(Iterator.scala:513)&lt;/P&gt;&lt;P&gt;	at scala.collection.Iterator$$anon$11.hasNext(Iterator.scala:491)&lt;/P&gt;&lt;P&gt;	at scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:460)&lt;/P&gt;&lt;P&gt;	at scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:460)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.collect.UnsafeRowBatchUtils$.encodeUnsafeRows(UnsafeRowBatchUtils.scala:80)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.collect.Collector.$anonfun$processFunc$1(Collector.scala:155)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.scheduler.ResultTask.$anonfun$runTask$3(ResultTask.scala:75)&lt;/P&gt;&lt;P&gt;	at com.databricks.spark.util.ExecutorFrameProfiler$.record(ExecutorFrameProfiler.scala:110)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.scheduler.ResultTask.$anonfun$runTask$1(ResultTask.scala:75)&lt;/P&gt;&lt;P&gt;	at com.databricks.spark.util.ExecutorFrameProfiler$.record(ExecutorFrameProfiler.scala:110)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:55)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.scheduler.Task.doRunTask(Task.scala:156)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.scheduler.Task.$anonfun$run$1(Task.scala:125)&lt;/P&gt;&lt;P&gt;	at com.databricks.spark.util.ExecutorFrameProfiler$.record(ExecutorFrameProfiler.scala:110)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.scheduler.Task.run(Task.scala:95)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$13(Executor.scala:825)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:1658)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$4(Executor.scala:828)&lt;/P&gt;&lt;P&gt;	at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)&lt;/P&gt;&lt;P&gt;	at com.databricks.spark.util.ExecutorFrameProfiler$.record(ExecutorFrameProfiler.scala:110)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:683)&lt;/P&gt;&lt;P&gt;	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)&lt;/P&gt;&lt;P&gt;	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)&lt;/P&gt;&lt;P&gt;	at java.lang.Thread.run(Thread.java:748)&lt;/P&gt;&lt;P&gt;Caused by: java.io.FileNotFoundException: Operation failed: "The specified path does not exist.", 404, HEAD, &lt;A href="https://adls.dfs.core.windows.net/raw/file.csv?upn=false&amp;amp;action=getStatus&amp;amp;timeout=90" target="test_blank"&gt;https://adls.dfs.core.windows.net/raw/file.csv?upn=false&amp;amp;action=getStatus&amp;amp;timeout=90&lt;/A&gt;&lt;/P&gt;&lt;P&gt;	at shaded.databricks.azurebfs.org.apache.hadoop.fs.azurebfs.AzureBlobFileSystem.checkException(AzureBlobFileSystem.java:1344)&lt;/P&gt;&lt;P&gt;	at shaded.databricks.azurebfs.org.apache.hadoop.fs.azurebfs.AzureBlobFileSystem.open(AzureBlobFileSystem.java:266)&lt;/P&gt;&lt;P&gt;	at com.databricks.spark.metrics.FileSystemWithMetrics.open(FileSystemWithMetrics.scala:336)&lt;/P&gt;&lt;P&gt;	at org.apache.hadoop.fs.FileSystem.lambda$openFileWithOptions$0(FileSystem.java:4633)&lt;/P&gt;&lt;P&gt;	at org.apache.hadoop.util.LambdaUtils.eval(LambdaUtils.java:52)&lt;/P&gt;&lt;P&gt;	at org.apache.hadoop.fs.FileSystem.openFileWithOptions(FileSystem.java:4631)&lt;/P&gt;&lt;P&gt;	at org.apache.hadoop.fs.FileSystem$FSDataInputStreamBuilder.build(FileSystem.java:4768)&lt;/P&gt;&lt;P&gt;	at org.apache.hadoop.mapreduce.lib.input.LineRecordReader.initialize(LineRecordReader.java:92)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.datasources.HadoopFileLinesReader.&amp;lt;init&amp;gt;(HadoopFileLinesReader.scala:65)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.datasources.csv.TextInputCSVDataSource.readFile(CSVDataSource.scala:108)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.datasources.csv.CSVFileFormat.$anonfun$buildReader$2(CSVFileFormat.scala:169)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.datasources.FileFormat$$anon$1.apply(FileFormat.scala:156)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.datasources.FileFormat$$anon$1.apply(FileFormat.scala:143)&lt;/P&gt;&lt;P&gt;	at org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1$$anon$2.getNext(FileScanRDD.scala:353)&lt;/P&gt;&lt;P&gt;	... 31 more&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jun 2022 06:56:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17284#M11312</guid>
      <dc:creator>rpshgupta</dc:creator>
      <dc:date>2022-06-20T06:56:23Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks notebook failed with "Caused by: java.io.FileNotFoundException: Operation failed: "The specified path does not exist.", 404, HEAD, https://adls.dfs.core.windows.net/raw/file.csv?upn=false&amp;action=getStatus&amp;timeout=90".</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17285#M11313</link>
      <description>&lt;P&gt;It seems that it points to a file that no longer exists. As the error says, please try 'REFRESH TABLE tableName' so it will update links to files in hive metastore. If that doesn't help, please share your code.&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jun 2022 17:43:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17285#M11313</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2022-06-20T17:43:30Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks notebook failed with "Caused by: java.io.FileNotFoundException: Operation failed: "The specified path does not exist.", 404, HEAD, https://adls.dfs.core.windows.net/raw/file.csv?upn=false&amp;action=getStatus&amp;timeout=90".</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17286#M11314</link>
      <description>&lt;P&gt;@Hubert Dudek​&amp;nbsp;There is no table at all. I am just writing/reading parquet files.&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jun 2022 19:12:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17286#M11314</guid>
      <dc:creator>rpshgupta</dc:creator>
      <dc:date>2022-06-20T19:12:04Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks notebook failed with "Caused by: java.io.FileNotFoundException: Operation failed: "The specified path does not exist.", 404, HEAD, https://adls.dfs.core.windows.net/raw/file.csv?upn=false&amp;action=getStatus&amp;timeout=90".</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17288#M11316</link>
      <description>&lt;P&gt;Please share your code. Then we will be able to help.&lt;/P&gt;</description>
      <pubDate>Tue, 28 Jun 2022 13:49:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17288#M11316</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2022-06-28T13:49:23Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks notebook failed with "Caused by: java.io.FileNotFoundException: Operation failed: "The specified path does not exist.", 404, HEAD, https://adls.dfs.core.windows.net/raw/file.csv?upn=false&amp;action=getStatus&amp;timeout=90".</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17289#M11317</link>
      <description>&lt;P&gt;Try to convert your Parquet table to Delta table and this error will be resolved.&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jul 2022 20:22:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17289#M11317</guid>
      <dc:creator>jose_gonzalez</dc:creator>
      <dc:date>2022-07-29T20:22:46Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks notebook failed with "Caused by: java.io.FileNotFoundException: Operation failed: "The specified path does not exist.", 404, HEAD, https://adls.dfs.core.windows.net/raw/file.csv?upn=false&amp;action=getStatus&amp;timeout=90".</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17290#M11318</link>
      <description>&lt;P&gt;Hi @Rupesh gupta​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Hope you are well. Just wanted to see if you were able to find an answer to your question and would you like to mark an answer as best? It would be really helpful for the other members too.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Cheers!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 25 Aug 2022 08:51:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17290#M11318</guid>
      <dc:creator>Vidula</dc:creator>
      <dc:date>2022-08-25T08:51:25Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks notebook failed with "Caused by: java.io.FileNotFoundException: Operation failed: "The specified path does not exist.", 404, HEAD, https://adls.dfs.core.windows.net/raw/file.csv?upn=false&amp;action=getStatus&amp;timeout=90".</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17291#M11319</link>
      <description>&lt;P&gt;I couldn't find any best solution yet. I have seen this issue so many times now and it get fixed after rerun. I don't feel re-running is the best solution.&lt;/P&gt;</description>
      <pubDate>Tue, 30 Aug 2022 07:17:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17291#M11319</guid>
      <dc:creator>rpshgupta</dc:creator>
      <dc:date>2022-08-30T07:17:22Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks notebook failed with "Caused by: java.io.FileNotFoundException: Operation failed: "The specified path does not exist.", 404, HEAD, https://adls.dfs.core.windows.net/raw/file.csv?upn=false&amp;action=getStatus&amp;timeout=90".</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17292#M11320</link>
      <description>&lt;P&gt;I am also facing the same issue . I am accessing view that is created on top of joining 4 tables that are in parquet format. so when i pull the data from the view using my streaming job , the job fails . &lt;/P&gt;&lt;P&gt;Even though the base table is incremental append on daily basis , does the part file changes its name for every day in case of parquet file format ?&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/1776iB52EF6179CED0360/image-size/large?v=v2&amp;amp;px=999" role="button" title="image" alt="image" /&gt;&lt;/span&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/1775i9FADAF124F3BE65F/image-size/large?v=v2&amp;amp;px=999" role="button" title="image" alt="image" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 30 Nov 2022 21:12:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-notebook-failed-with-quot-caused-by-java-io/m-p/17292#M11320</guid>
      <dc:creator>nagini_sitarama</dc:creator>
      <dc:date>2022-11-30T21:12:15Z</dc:date>
    </item>
  </channel>
</rss>

