<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Bug Report: SDP (DLT) with autoloader not passing through pipe delimiter/separator in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/bug-report-sdp-dlt-with-autoloader-not-passing-through-pipe/m-p/144189#M52282</link>
    <description>&lt;P&gt;I am noticing a difference between using autoloader in an interactive notebook vs using it in a Spark Declarative Pipeline (DLT Pipeline). This issue seems to be very similar to this other unanswered post from a few years ago.&amp;nbsp;&lt;LI-MESSAGE title="Bug report: the delimiter option does not work when run on DLT" uid="54094" url="https://community.databricks.com/t5/get-started-discussions/bug-report-the-delimiter-option-does-not-work-when-run-on-dlt/m-p/54094#U54094" discussion_style_icon_css="lia-mention-container-editor-message lia-img-icon-forum-thread lia-fa-icon lia-fa-forum lia-fa-thread lia-fa"&gt;&lt;/LI-MESSAGE&gt;&amp;nbsp;&lt;BR /&gt;The CSV I am trying to ingest uses the pipe character (|) as its separator.&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; df = (spark.readStream&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .format("cloudFiles")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("cloudFiles.format", "csv")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("delimiter", "|")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("cloudFiles.schemaLocation", f"/Volumes/{location}/schema")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .load(f"/Volumes/{path to csv}")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .writeStream&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("checkpointLocation", f"/Volumes/{location}/checkpoint")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .trigger(availableNow=True)&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .toTable(f"{table_location}")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; )&lt;BR /&gt;```&lt;BR /&gt;^ running this in a notebook serverless has no issues.&lt;/P&gt;&lt;P&gt;``` python&lt;BR /&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/97035"&gt;@Dlt&lt;/a&gt;.table(&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; name=f"{bronze_raw_schema}.{table_name}",&lt;BR /&gt;&amp;nbsp; &amp;nbsp; )&lt;BR /&gt;def load_table():&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; return&amp;nbsp;spark.readStream.format("cloudFiles")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .format("cloudFiles")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("cloudFiles.format", "csv")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("delimiter", "|")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("header", "true")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("cloudFiles.includeExistingFiles", "true")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .load(f"/Volumes/{path to csv}")&lt;BR /&gt;```&lt;/P&gt;&lt;P&gt;When I run the above in a SDP (DLT) pipeline I get the following error. Which appears to be failing to delimit on the pipe (|) character and is thus hitting a length limit on the header column names.&lt;/P&gt;&lt;P&gt;com.databricks.sql.managedcatalog.UnityCatalogServiceException: [ErrorClass=INVALID_PARAMETER_VALUE] Invalid input: RPC UpdateTable Field managedcatalog.ColumnInfo.name: At columns.0: name "HiPortSystemCode|HiportCode|CashAccountCode|ValuationDate|ContractDate|CashTransactionId|DeletedIndicator|CashTransactionTypeCode|AccountingDate|EntryDate|SettlementDate|OriginalAccountingDate|SecurityCode|AnalysisTypeCode|TransferCashAccountCode|Transact..." too long. Maximum length is 255 characters.&lt;/P&gt;</description>
    <pubDate>Fri, 16 Jan 2026 00:01:42 GMT</pubDate>
    <dc:creator>ChrisLawford_n1</dc:creator>
    <dc:date>2026-01-16T00:01:42Z</dc:date>
    <item>
      <title>Bug Report: SDP (DLT) with autoloader not passing through pipe delimiter/separator</title>
      <link>https://community.databricks.com/t5/data-engineering/bug-report-sdp-dlt-with-autoloader-not-passing-through-pipe/m-p/144189#M52282</link>
      <description>&lt;P&gt;I am noticing a difference between using autoloader in an interactive notebook vs using it in a Spark Declarative Pipeline (DLT Pipeline). This issue seems to be very similar to this other unanswered post from a few years ago.&amp;nbsp;&lt;LI-MESSAGE title="Bug report: the delimiter option does not work when run on DLT" uid="54094" url="https://community.databricks.com/t5/get-started-discussions/bug-report-the-delimiter-option-does-not-work-when-run-on-dlt/m-p/54094#U54094" discussion_style_icon_css="lia-mention-container-editor-message lia-img-icon-forum-thread lia-fa-icon lia-fa-forum lia-fa-thread lia-fa"&gt;&lt;/LI-MESSAGE&gt;&amp;nbsp;&lt;BR /&gt;The CSV I am trying to ingest uses the pipe character (|) as its separator.&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; df = (spark.readStream&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .format("cloudFiles")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("cloudFiles.format", "csv")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("delimiter", "|")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("cloudFiles.schemaLocation", f"/Volumes/{location}/schema")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .load(f"/Volumes/{path to csv}")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .writeStream&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("checkpointLocation", f"/Volumes/{location}/checkpoint")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .trigger(availableNow=True)&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .toTable(f"{table_location}")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; )&lt;BR /&gt;```&lt;BR /&gt;^ running this in a notebook serverless has no issues.&lt;/P&gt;&lt;P&gt;``` python&lt;BR /&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/97035"&gt;@Dlt&lt;/a&gt;.table(&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; name=f"{bronze_raw_schema}.{table_name}",&lt;BR /&gt;&amp;nbsp; &amp;nbsp; )&lt;BR /&gt;def load_table():&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; return&amp;nbsp;spark.readStream.format("cloudFiles")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .format("cloudFiles")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("cloudFiles.format", "csv")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("delimiter", "|")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("header", "true")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .option("cloudFiles.includeExistingFiles", "true")&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; .load(f"/Volumes/{path to csv}")&lt;BR /&gt;```&lt;/P&gt;&lt;P&gt;When I run the above in a SDP (DLT) pipeline I get the following error. Which appears to be failing to delimit on the pipe (|) character and is thus hitting a length limit on the header column names.&lt;/P&gt;&lt;P&gt;com.databricks.sql.managedcatalog.UnityCatalogServiceException: [ErrorClass=INVALID_PARAMETER_VALUE] Invalid input: RPC UpdateTable Field managedcatalog.ColumnInfo.name: At columns.0: name "HiPortSystemCode|HiportCode|CashAccountCode|ValuationDate|ContractDate|CashTransactionId|DeletedIndicator|CashTransactionTypeCode|AccountingDate|EntryDate|SettlementDate|OriginalAccountingDate|SecurityCode|AnalysisTypeCode|TransferCashAccountCode|Transact..." too long. Maximum length is 255 characters.&lt;/P&gt;</description>
      <pubDate>Fri, 16 Jan 2026 00:01:42 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/bug-report-sdp-dlt-with-autoloader-not-passing-through-pipe/m-p/144189#M52282</guid>
      <dc:creator>ChrisLawford_n1</dc:creator>
      <dc:date>2026-01-16T00:01:42Z</dc:date>
    </item>
    <item>
      <title>Re: Bug Report: SDP (DLT) with autoloader not passing through pipe delimiter/separator</title>
      <link>https://community.databricks.com/t5/data-engineering/bug-report-sdp-dlt-with-autoloader-not-passing-through-pipe/m-p/144346#M52312</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/121074"&gt;@ChrisLawford_n1&lt;/a&gt;, can you try doing a full refresh&amp;nbsp;&lt;SPAN&gt;of this table? The old schema may still be in the schema location.&lt;/SPAN&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 18 Jan 2026 13:42:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/bug-report-sdp-dlt-with-autoloader-not-passing-through-pipe/m-p/144346#M52312</guid>
      <dc:creator>nikhilj0421</dc:creator>
      <dc:date>2026-01-18T13:42:23Z</dc:date>
    </item>
    <item>
      <title>Re: Bug Report: SDP (DLT) with autoloader not passing through pipe delimiter/separator</title>
      <link>https://community.databricks.com/t5/data-engineering/bug-report-sdp-dlt-with-autoloader-not-passing-through-pipe/m-p/144396#M52318</link>
      <description>&lt;P&gt;Hey,&lt;BR /&gt;Okay thanks&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/30672"&gt;@nikhilj0421&lt;/a&gt;. I have now solved the issue but not with a full refresh of the table. I had tried this previously and even deleted the DLT pipeline hoping that would provide me the clean slate if this lingering schema was an issue but when re-running the pipeline it still failed with the same issue. The solution was to change the output schema(in my case to {schema_name}_2) of the table/pipeline and run again. I was then able to again alter the schema back to the original schema_name and this time everything worked without a problem.&lt;BR /&gt;&lt;BR /&gt;Overall I have learned that DLT pipelines must store more information than I previously thought when deleting them or full refreshing them.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Jan 2026 09:23:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/bug-report-sdp-dlt-with-autoloader-not-passing-through-pipe/m-p/144396#M52318</guid>
      <dc:creator>ChrisLawford_n1</dc:creator>
      <dc:date>2026-01-19T09:23:52Z</dc:date>
    </item>
  </channel>
</rss>

