<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to select from a very large column  ( string ) of delta table ? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-to-select-from-a-very-large-column-string-of-delta-table/m-p/32105#M23402</link>
    <description>&lt;P&gt;In one of my delta table , the string column "abc" has 1753484 characters long value (string) .&amp;nbsp;I get an error  while selecting or transforming this column value ( in the  downstream application). How do I solve this? &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;SELECT ID,  abc, length(abc) as len&lt;/P&gt;&lt;P&gt;FROM &amp;lt;delta_table_name&amp;gt; &lt;/P&gt;&lt;P&gt;where ID= 28 &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;java.lang.Exception: Results too large at com.databricks.backend.daemon.driver.OutputAggregator$.maybeApplyOutputAggregation(OutputAggregator.scala:303) at com.databricks.backend.daemon.driver.OutputAggregator$.withOutputAggregation0(OutputAggregator.scala:206) at com.databricks.backend.daemon.driver.OutputAggregator$.withOutputAggregation(OutputAggregator.scala:57) at com.databricks.backend.daemon.driver.SQLDriverLocal.executeSql(SQLDriverLocal.scala:114) at com.databricks.backend.daemon.driver.SQLDriverLocal.repl(SQLDriverLocal.scala:143) at com.databricks.backend.daemon.driver.DriverLocal.$anonfun$execute$10(DriverLocal.scala:431) at com.databricks.logging.UsageLogging.$anonfun$withAttributionContext$1(UsageLogging.scala:239) at scala.util.DynamicVariable.withValue(DynamicVariable.scala:62) at com.databricks.logging.UsageLogging.withAttributionContext(UsageLogging.scala:234) at com.databricks.logging.UsageLogging.withAttributionContext$(UsageLogging.scala:231) at com.databricks.backend.daemon.driver.DriverLocal.withAttributionContext(DriverLocal.scala:48) at com.databricks.logging.UsageLogging.withAttributionTags(UsageLogging.scala:276) at com.databricks.logging.UsageLogging.withAttributionTags$(UsageLogging.scala:269) at com.databricks.backend.daemon.driver.DriverLocal.withAttributionTags(DriverLocal.scala:48) at com.databricks.backend.daemon.driver.DriverLocal.execute(DriverLocal.scala:408) at com.databricks.backend.daemon.driver.DriverWrapper.$anonfun$tryExecutingCommand$1(DriverWrapper.scala:653) at scala.util.Try$.apply(Try.scala:213) at com.databricks.backend.daemon.driver.DriverWrapper.tryExecutingCommand(DriverWrapper.scala:645) at com.databricks.backend.daemon.driver.DriverWrapper.getCommandOutputAndError(DriverWrapper.scala:486) at com.databricks.backend.daemon.driver.DriverWrapper.executeCommand(DriverWrapper.scala:598) at com.databricks.backend.daemon.driver.DriverWrapper.runInnerLoop(DriverWrapper.scala:391) at com.databricks.backend.daemon.driver.DriverWrapper.runInner(DriverWrapper.scala:337) at com.databricks.backend.daemon.driver.DriverWrapper.run(DriverWrapper.scala:219) at java.lang.Thread.run(Thread.java:748)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I can do below but that does not solve original issue – &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;df = spark.sql('''SELECT ID, abc, length(abc) FROM &amp;lt;delta table&amp;gt; where ID= 28  ''') &lt;/P&gt;&lt;P&gt;df.show()&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;ID |  abc                                 | length      &lt;/P&gt;&lt;P&gt;------------------------------------------&lt;/P&gt;&lt;P&gt;28 | {"PolicyMessage":...|   1753484&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The abc column value shows truncated which I think is the notebook /browser limitation but the actual value is not truncated&lt;/P&gt;</description>
    <pubDate>Thu, 06 Jan 2022 21:00:21 GMT</pubDate>
    <dc:creator>prasadvaze</dc:creator>
    <dc:date>2022-01-06T21:00:21Z</dc:date>
    <item>
      <title>How to select from a very large column  ( string ) of delta table ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-select-from-a-very-large-column-string-of-delta-table/m-p/32105#M23402</link>
      <description>&lt;P&gt;In one of my delta table , the string column "abc" has 1753484 characters long value (string) .&amp;nbsp;I get an error  while selecting or transforming this column value ( in the  downstream application). How do I solve this? &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;SELECT ID,  abc, length(abc) as len&lt;/P&gt;&lt;P&gt;FROM &amp;lt;delta_table_name&amp;gt; &lt;/P&gt;&lt;P&gt;where ID= 28 &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;java.lang.Exception: Results too large at com.databricks.backend.daemon.driver.OutputAggregator$.maybeApplyOutputAggregation(OutputAggregator.scala:303) at com.databricks.backend.daemon.driver.OutputAggregator$.withOutputAggregation0(OutputAggregator.scala:206) at com.databricks.backend.daemon.driver.OutputAggregator$.withOutputAggregation(OutputAggregator.scala:57) at com.databricks.backend.daemon.driver.SQLDriverLocal.executeSql(SQLDriverLocal.scala:114) at com.databricks.backend.daemon.driver.SQLDriverLocal.repl(SQLDriverLocal.scala:143) at com.databricks.backend.daemon.driver.DriverLocal.$anonfun$execute$10(DriverLocal.scala:431) at com.databricks.logging.UsageLogging.$anonfun$withAttributionContext$1(UsageLogging.scala:239) at scala.util.DynamicVariable.withValue(DynamicVariable.scala:62) at com.databricks.logging.UsageLogging.withAttributionContext(UsageLogging.scala:234) at com.databricks.logging.UsageLogging.withAttributionContext$(UsageLogging.scala:231) at com.databricks.backend.daemon.driver.DriverLocal.withAttributionContext(DriverLocal.scala:48) at com.databricks.logging.UsageLogging.withAttributionTags(UsageLogging.scala:276) at com.databricks.logging.UsageLogging.withAttributionTags$(UsageLogging.scala:269) at com.databricks.backend.daemon.driver.DriverLocal.withAttributionTags(DriverLocal.scala:48) at com.databricks.backend.daemon.driver.DriverLocal.execute(DriverLocal.scala:408) at com.databricks.backend.daemon.driver.DriverWrapper.$anonfun$tryExecutingCommand$1(DriverWrapper.scala:653) at scala.util.Try$.apply(Try.scala:213) at com.databricks.backend.daemon.driver.DriverWrapper.tryExecutingCommand(DriverWrapper.scala:645) at com.databricks.backend.daemon.driver.DriverWrapper.getCommandOutputAndError(DriverWrapper.scala:486) at com.databricks.backend.daemon.driver.DriverWrapper.executeCommand(DriverWrapper.scala:598) at com.databricks.backend.daemon.driver.DriverWrapper.runInnerLoop(DriverWrapper.scala:391) at com.databricks.backend.daemon.driver.DriverWrapper.runInner(DriverWrapper.scala:337) at com.databricks.backend.daemon.driver.DriverWrapper.run(DriverWrapper.scala:219) at java.lang.Thread.run(Thread.java:748)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I can do below but that does not solve original issue – &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;df = spark.sql('''SELECT ID, abc, length(abc) FROM &amp;lt;delta table&amp;gt; where ID= 28  ''') &lt;/P&gt;&lt;P&gt;df.show()&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;ID |  abc                                 | length      &lt;/P&gt;&lt;P&gt;------------------------------------------&lt;/P&gt;&lt;P&gt;28 | {"PolicyMessage":...|   1753484&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The abc column value shows truncated which I think is the notebook /browser limitation but the actual value is not truncated&lt;/P&gt;</description>
      <pubDate>Thu, 06 Jan 2022 21:00:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-select-from-a-very-large-column-string-of-delta-table/m-p/32105#M23402</guid>
      <dc:creator>prasadvaze</dc:creator>
      <dc:date>2022-01-06T21:00:21Z</dc:date>
    </item>
  </channel>
</rss>

