<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: java.lang.IllegalArgumentException: java.net.URISyntaxException in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34924#M25617</link>
    <description>&lt;P&gt;Previously we have run AutoLoader many times on very similar folder names without a fail. Now we get: &lt;/P&gt;&lt;P&gt;&lt;I&gt;StreamingQueryException: Job aborted due to stage failure: Task 1 in stage 1657.0 failed 4 times, most recent failure: Lost task 1.3 in stage 1657.0 (TID 5451) (10.38.20.138 executor 17): java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative path in absolute URI: 2022-03-07T20:47:0&lt;/I&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;B&gt;"2022-03-07T20.47.04.000Z" = fails now in November&lt;/B&gt;&lt;/LI&gt;&lt;LI&gt;&lt;B&gt;"2022-03-07T20.47.04.000Z" = succeeded from July through October&lt;/B&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;DESCRIBE HISTORY&lt;/P&gt;&lt;P&gt;operation (most recent at top)&lt;/P&gt;&lt;P&gt;MERGE&lt;/P&gt;&lt;P&gt;MERGE&lt;/P&gt;&lt;P&gt;ADD COLUMNS --started having problems after this&lt;/P&gt;&lt;P&gt;RESTORE&lt;/P&gt;&lt;P&gt;RESTORE&lt;/P&gt;&lt;P&gt;RESTORE&lt;/P&gt;&lt;P&gt;RESTORE&lt;/P&gt;&lt;P&gt;MERGE&lt;/P&gt;&lt;P&gt;MERGE&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 21 Nov 2022 14:05:34 GMT</pubDate>
    <dc:creator>B_Seibert</dc:creator>
    <dc:date>2022-11-21T14:05:34Z</dc:date>
    <item>
      <title>java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34916#M25609</link>
      <description>&lt;P&gt;I am using Databricks Autoloader to load JSON files from ADLS gen2 incrementally in directory listing mode. All source filename has Timestamp on them. The autoloader works perfectly couple of days with the below configuration and breaks the next day with the following error.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;org.apache.spark.SparkException: Job aborted due to stage failure: Task 1 in stage 7.0 failed 4 times, most recent failure: Lost task 1.3 in stage 7.0 (TID 24) (10.150.38.137 executor 0): java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative path in absolute URI: 2022-04-27T20:09:00 (Attached the complete error message)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I deleted the checkpoint, and target delta table and loaded fresh with the option "cloudFiles.includeExistingFiles":"true". All files loaded successfully and then after a couple of incremental loads the same error occurred.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Autoloader configurations&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;{"cloudFiles.format":"json","cloudFiles.useNotifications":"false", "cloudFiles.inferColumnTypes":"true", "cloudFiles.schemaEvolutionMode":"addNewColumns", "cloudFiles.includeExistingFiles":"false"}&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Path location passed as below&lt;/P&gt;&lt;P&gt;raw_data_location : dbfs:/mnt/DEV-cdl-raw/data/storage-xxxxx/xxxx/&lt;/P&gt;&lt;P&gt;target_delta_table_location : dbfs:/mnt/DEV-cdl-bronze/data/storage-xxxxx/xxxx/&lt;/P&gt;&lt;P&gt;checkpoint_location : dbfs:/mnt/DEV-cdl-bronze/configuration/autoloader/storage-xxxxx/xxxx/checkpoint/&lt;/P&gt;&lt;P&gt;schema_location : dbfs:/mnt/DEV-cdl-bronze/metadata/storage-xxxxx/xxxx/&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;StreamingQuery = StreamDF.writeStream \&lt;/P&gt;&lt;P&gt; .option("checkpointLocation", checkpoint_location) \&lt;/P&gt;&lt;P&gt; .option("mergeSchema", "true") \&lt;/P&gt;&lt;P&gt; .queryName(f"AutoLoad_RawtoBronze_{sourceFolderName}_{sourceEntityName}") \&lt;/P&gt;&lt;P&gt; .trigger(availableNow=&lt;B&gt;True&lt;/B&gt;) \&lt;/P&gt;&lt;P&gt; .partitionBy(targetPartitionByCol) \&lt;/P&gt;&lt;P&gt; .start(target_delta_table_location)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Can someone help me here?&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;Thanks in advance.​&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2022 22:00:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34916#M25609</guid>
      <dc:creator>Prem1</dc:creator>
      <dc:date>2022-08-10T22:00:57Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34917#M25610</link>
      <description>&lt;P&gt;I think you are running into the same problem as I do right now: Autoloader (or something even deeper) doesn't like ":" (colon) in the file names. &lt;span class="lia-unicode-emoji" title=":confused_face:"&gt;😕&lt;/span&gt; &lt;/P&gt;&lt;P&gt;Loading the files with the simple Spark read option works fine.&lt;/P&gt;</description>
      <pubDate>Wed, 17 Aug 2022 12:06:51 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34917#M25610</guid>
      <dc:creator>Alexey</dc:creator>
      <dc:date>2022-08-17T12:06:51Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34918#M25611</link>
      <description>&lt;P&gt;I don't understand the part why it's not consistent with the failure. It runs fine for a few runs ​then it stops with this error. It is a strange situation.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Aug 2022 03:44:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34918#M25611</guid>
      <dc:creator>Prem1</dc:creator>
      <dc:date>2022-08-18T03:44:27Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34919#M25612</link>
      <description>&lt;P&gt;so, for me it breaks directly on the first file with the colon in the name.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Aug 2022 07:37:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34919#M25612</guid>
      <dc:creator>Alexey</dc:creator>
      <dc:date>2022-08-18T07:37:59Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34920#M25613</link>
      <description>&lt;P&gt;Hi there @PREM KUMAR KUMMAN RAMESH​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;We'd love to hear from you.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 08 Sep 2022 11:23:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34920#M25613</guid>
      <dc:creator>Vidula</dc:creator>
      <dc:date>2022-09-08T11:23:47Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34921#M25614</link>
      <description>&lt;P&gt;Hi @Vidula Khanna​&amp;nbsp;&lt;/P&gt;&lt;P&gt;I can report from my side: I wasn't able to solve the issue with the AutoLoader. For my daily job, I first perform &lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;os.walk(...)&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;in Python and check if there are any files with "colon" in the naming (and some other criteria). If everything is fine, I use AutoLoader for incremental load, else I reload the data every time. &lt;/P&gt;&lt;P&gt;Luckily for us, the biggest chunk of data is fine, but I hope that this issue will be fixed some day.&lt;/P&gt;</description>
      <pubDate>Fri, 09 Sep 2022 13:39:35 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34921#M25614</guid>
      <dc:creator>Alexey</dc:creator>
      <dc:date>2022-09-09T13:39:35Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34922#M25615</link>
      <description>&lt;P&gt;Yes, for us it runs several times before error.&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 13:53:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34922#M25615</guid>
      <dc:creator>B_Seibert</dc:creator>
      <dc:date>2022-11-21T13:53:11Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34923#M25616</link>
      <description>&lt;P&gt;@Alexey Egorov​&amp;nbsp;, can you tell us more precisely what you mean by "I reload the data"? &lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 13:56:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34923#M25616</guid>
      <dc:creator>B_Seibert</dc:creator>
      <dc:date>2022-11-21T13:56:03Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34924#M25617</link>
      <description>&lt;P&gt;Previously we have run AutoLoader many times on very similar folder names without a fail. Now we get: &lt;/P&gt;&lt;P&gt;&lt;I&gt;StreamingQueryException: Job aborted due to stage failure: Task 1 in stage 1657.0 failed 4 times, most recent failure: Lost task 1.3 in stage 1657.0 (TID 5451) (10.38.20.138 executor 17): java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative path in absolute URI: 2022-03-07T20:47:0&lt;/I&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;B&gt;"2022-03-07T20.47.04.000Z" = fails now in November&lt;/B&gt;&lt;/LI&gt;&lt;LI&gt;&lt;B&gt;"2022-03-07T20.47.04.000Z" = succeeded from July through October&lt;/B&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;DESCRIBE HISTORY&lt;/P&gt;&lt;P&gt;operation (most recent at top)&lt;/P&gt;&lt;P&gt;MERGE&lt;/P&gt;&lt;P&gt;MERGE&lt;/P&gt;&lt;P&gt;ADD COLUMNS --started having problems after this&lt;/P&gt;&lt;P&gt;RESTORE&lt;/P&gt;&lt;P&gt;RESTORE&lt;/P&gt;&lt;P&gt;RESTORE&lt;/P&gt;&lt;P&gt;RESTORE&lt;/P&gt;&lt;P&gt;MERGE&lt;/P&gt;&lt;P&gt;MERGE&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 14:05:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34924#M25617</guid>
      <dc:creator>B_Seibert</dc:creator>
      <dc:date>2022-11-21T14:05:34Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34925#M25618</link>
      <description>&lt;P&gt;By reloading I mean to load all the existing data in that folder. As mentioned above:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;if there are no special charaters that make AutoLoader fail we can do: &lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;``&lt;/P&gt;&lt;P&gt;autoloader = spark.readStream.format("cloudFiles") \&lt;/P&gt;&lt;P&gt;            .option("cloudFiles.format", data_format) \&lt;/P&gt;&lt;P&gt;            .option("header", "true") \&lt;/P&gt;&lt;P&gt;            .option("cloudFiles.schemaLocation", schema_location) \&lt;/P&gt;&lt;P&gt;            .option("cloudFiles.allowOverwrites", "true") \&lt;/P&gt;&lt;P&gt;            .load(path)&lt;/P&gt;&lt;P&gt;``​&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;in the second case, where Autloader will fail (at least we know from experience, that it does with the colon in the file names), we use simple data load:&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;``&lt;/P&gt;&lt;P&gt;df = spark.read.format(data_format)\&lt;/P&gt;&lt;P&gt;            .option("header", "true") \&lt;/P&gt;&lt;P&gt;            .load(path)&lt;/P&gt;&lt;P&gt;``​&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;That is why I mentioned that luckily for us, this data folder is not that huge and it works fast. &lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 14:33:37 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34925#M25618</guid>
      <dc:creator>Alexey</dc:creator>
      <dc:date>2022-11-21T14:33:37Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34926#M25619</link>
      <description>&lt;P&gt;For me, the simple read also throws an "java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative path in absolute URI" exception when it encounters files with ':' in the name:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;df1= spark.read.format("binaryFile").load("s3://bucket_name/*/*/*/*.bag")&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So the problem is not solved.&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 15:20:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34926#M25619</guid>
      <dc:creator>Andrei_Radulesc</dc:creator>
      <dc:date>2022-11-21T15:20:33Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34927#M25620</link>
      <description>&lt;P&gt;wait, but I think this is another problem. We are mounting an S3 bucket into DBFS and my path is then something like this:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;S3_BUCKET_PATH = "dbfs:/mnt/mounted_bucket_name/"&lt;/P&gt;&lt;P&gt;df = spark.read.format(format).load(S3_BUCKET_PATH)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 15:26:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34927#M25620</guid>
      <dc:creator>Alexey</dc:creator>
      <dc:date>2022-11-21T15:26:16Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34928#M25621</link>
      <description>&lt;P&gt;Here is the stacktrace:&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 15:52:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34928#M25621</guid>
      <dc:creator>Andrei_Radulesc</dc:creator>
      <dc:date>2022-11-21T15:52:49Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34929#M25622</link>
      <description>&lt;P&gt;Have either of you .changed the schema? We experienced the problem after we did ADD COLUMNS&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 16:10:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34929#M25622</guid>
      <dc:creator>B_Seibert</dc:creator>
      <dc:date>2022-11-21T16:10:07Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34930#M25623</link>
      <description>&lt;P&gt;We did not get an error until we ran ADD COLUMNS.  Has anyone else done similar DDL changes?   BTW   I get the same error in Azure.&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 17:04:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34930#M25623</guid>
      <dc:creator>B_Seibert</dc:creator>
      <dc:date>2022-11-21T17:04:56Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34931#M25624</link>
      <description>&lt;P&gt;I did not change the schema. The schema is fixed in my case.&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 18:03:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34931#M25624</guid>
      <dc:creator>Andrei_Radulesc</dc:creator>
      <dc:date>2022-11-21T18:03:41Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34932#M25625</link>
      <description>&lt;P&gt;For me, it's the same error when mounting through a mount point:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;schema = StructType() \&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;.add("path", StringType(), False) \&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;.add("modificationTime", StringType(), False) \&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;.add("length", IntegerType(), False) \&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;.add("content", BinaryType(), True)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;df = spark.read.format("binaryFile").schema(schema) \&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;.load("dbfs:/mnt/bucket_name/[...]/*/*.bag")&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;IllegalArgumentException: java.net.URISyntaxException: Relative path in absolute URI: 2022-11-16T22:01:49+00:00&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Surely enough, I have some files in there with a ':' character in the name. Incidentally, the Databricks architect who advised us a while back said that mount points are obsolete, and don't play well with the Unity Catalog permission scheme, so I've tried to refrain from using mount points.&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 22:14:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34932#M25625</guid>
      <dc:creator>Andrei_Radulesc</dc:creator>
      <dc:date>2022-11-21T22:14:09Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34933#M25626</link>
      <description>&lt;P&gt;Hello to everyone. We filed a support ticket with Databricks. This is the response I received, along with an interim solution to the problem. I hope it is useful to those who read it.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;Problem Statement:&amp;nbsp;&amp;nbsp;&lt;/B&gt;&lt;/P&gt;&lt;P&gt; org.apache.spark.SparkException: Job aborted due to stage failure: Task 1 in stage 7.0 failed 4 times, most recent failure: Lost task 1.3 in stage 7.0 (TID 24) (10.150.38.137 executor 0): java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative path in absolute URI: 2022-04-27T20:09:00 (Attached the complete error message)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;Root Cause Analysis:&amp;nbsp;&amp;nbsp;&lt;/B&gt;&lt;/P&gt;&lt;P&gt; We have an incremental listing mode to speedup the listing by not scanning prefixes we saw before but the incremental listing mode does not like file names with certain special char in it for example:&lt;/P&gt;&lt;P&gt; and if you upload a file with a special character into DBFS file gets renamed and automatically replaced by _&lt;/P&gt;&lt;P&gt;This is currently in our roadmap but don't have an exact ETA.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;Solution:&amp;nbsp;&amp;nbsp;&lt;/B&gt;&lt;/P&gt;&lt;P&gt; use the below config to mitigate the issue&lt;/P&gt;&lt;P&gt; cloudFiles.useIncrementalListing to false.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Jan 2023 15:17:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34933#M25626</guid>
      <dc:creator>Prem1</dc:creator>
      <dc:date>2023-01-03T15:17:24Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34934#M25627</link>
      <description>&lt;P&gt;Hi Everyone,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I'm seeing this issue as well - same configuration of the previous posts, using autoloader with incremental file listing turned on.  The strange part is that it mostly works despite almost all of the files we're loading having colons included as part of the timestamp.  &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It seems to be happening more frequently now, which is becoming an issue.  Having to rarely clear a checkpoint is very different than needing to clear checkpoints each day.  I'm also not comfortable clearing the checkpoint programmatically.  I'm lucky in our case that I have some level of control of how the files get named, so removing colons from the timestamp is a possibility.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Including a couple bullet points here for anyone else struggling though this.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;This issue is intermittent - I haven't seen any pattern as to what causes this to occur with certain files.  &lt;/LI&gt;&lt;LI&gt;This is using Databricks in Azure using adlsg2. - streaming cluster is Databricks runtime 10.4LTS&lt;/LI&gt;&lt;LI&gt;When an autoloader task fails due to this issue, it is broken permanently until the checkpoint gets cleared.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I haven't tried setting &lt;I&gt;cloudFiles.useIncrementalListing &lt;/I&gt;to false, but this also feels like an unideal fix for my purposes.   I'll be following updates in this tread closely, Also thanks for everyone who already shared info previously.&lt;/P&gt;</description>
      <pubDate>Wed, 04 Jan 2023 14:56:35 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34934#M25627</guid>
      <dc:creator>jshields</dc:creator>
      <dc:date>2023-01-04T14:56:35Z</dc:date>
    </item>
    <item>
      <title>Re: java.lang.IllegalArgumentException: java.net.URISyntaxException</title>
      <link>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34935#M25628</link>
      <description>&lt;P&gt;Thanks for sharing your experience. Will the setting for &lt;I&gt;useIncrementalListing&lt;/I&gt; somehow change your processing strategy? I think it is only for performance improvements, isn't it? I am really not quite sure. But seems actually more like a bug, that will be fixed one day. It would be great if someone from databricks would comment here when this is fixed.&lt;/P&gt;</description>
      <pubDate>Mon, 16 Jan 2023 12:59:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/java-lang-illegalargumentexception-java-net-urisyntaxexception/m-p/34935#M25628</guid>
      <dc:creator>Alexey</dc:creator>
      <dc:date>2023-01-16T12:59:58Z</dc:date>
    </item>
  </channel>
</rss>

