<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Possible false positive warning on DLT pipeline in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/57622#M31149</link>
    <description>&lt;P&gt;It's DLT Stable, so whatever DBR that's using under the hood. Here's the JSON of the warning with UIDs etc redacted&lt;/P&gt;&lt;P&gt;{&lt;BR /&gt;"id": "redacted",&lt;BR /&gt;"sequence": {&lt;BR /&gt;"data_plane_id": {&lt;BR /&gt;"instance": "execution",&lt;BR /&gt;"seq_no": redacted&lt;BR /&gt;},&lt;BR /&gt;"control_plane_seq_no": redacted&lt;BR /&gt;},&lt;BR /&gt;"origin": {&lt;BR /&gt;"cloud": "Azure",&lt;BR /&gt;"region": "canadacentral",&lt;BR /&gt;"org_id": redacted,&lt;BR /&gt;"pipeline_id": "redacted",&lt;BR /&gt;"pipeline_type": "WORKSPACE",&lt;BR /&gt;"pipeline_name": "redacted",&lt;BR /&gt;"cluster_id": "redacted",&lt;BR /&gt;"update_id": "redacted",&lt;BR /&gt;"request_id": "redacted",&lt;BR /&gt;"uc_resource_id": "redacted"&lt;BR /&gt;},&lt;BR /&gt;"timestamp": "2024-01-17T15:15:47.639Z",&lt;BR /&gt;"message": "Notebook:/redacted/dlt_meta_pipeline used `DataFrame.collect` function that will be deprecated soon. Please fix the notebook.",&lt;BR /&gt;"level": "WARN",&lt;BR /&gt;"details": {&lt;BR /&gt;"unsupported_operation": {&lt;BR /&gt;"operation": "COLLECT_TO_DRIVER"&lt;BR /&gt;}&lt;BR /&gt;},&lt;BR /&gt;"event_type": "unsupported_operation",&lt;BR /&gt;"maturity_level": "STABLE"&lt;BR /&gt;}&lt;/P&gt;</description>
    <pubDate>Wed, 17 Jan 2024 21:57:51 GMT</pubDate>
    <dc:creator>ipreston</dc:creator>
    <dc:date>2024-01-17T21:57:51Z</dc:date>
    <item>
      <title>Possible false positive warning on DLT pipeline</title>
      <link>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/57612#M31147</link>
      <description>&lt;P&gt;I have a DLT pipeline script that starts by extracting metadata on the tables it should generate from a delta table. Each record returned from the table should be a dlt table to generate, so I use .collect() to turn each row into a list and then iterate on calling my DLT pipeline logic on it. I don't use .collect() in any dlt functions or functions that have a dlt decorator on them. When I run the pipeline I get a warning&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;Notebook:/&amp;lt;my path here&amp;gt;/dlt_meta_pipeline used `DataFrame.collect` function that will be deprecated soon. Please fix the notebook.&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://kb.databricks.com/en_US/delta-live-tables/delta-live-tables-job-fails-when-using-collect" target="_blank"&gt;Delta Live Tables job fails when using collect() - Databricks&lt;/A&gt;&lt;/P&gt;&lt;P&gt;based on the above post I think I should only be seeing this if I'm using collect to return results that I want to instantiate as a dlt managed table. Is this warning in error or do I actually have to change something?&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jan 2024 17:27:40 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/57612#M31147</guid>
      <dc:creator>ipreston</dc:creator>
      <dc:date>2024-01-17T17:27:40Z</dc:date>
    </item>
    <item>
      <title>Re: Possible false positive warning on DLT pipeline</title>
      <link>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/57620#M31148</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/98350"&gt;@ipreston&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;Could you share the DBR version please? also share the full stack message.&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jan 2024 21:53:42 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/57620#M31148</guid>
      <dc:creator>jose_gonzalez</dc:creator>
      <dc:date>2024-01-17T21:53:42Z</dc:date>
    </item>
    <item>
      <title>Re: Possible false positive warning on DLT pipeline</title>
      <link>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/57622#M31149</link>
      <description>&lt;P&gt;It's DLT Stable, so whatever DBR that's using under the hood. Here's the JSON of the warning with UIDs etc redacted&lt;/P&gt;&lt;P&gt;{&lt;BR /&gt;"id": "redacted",&lt;BR /&gt;"sequence": {&lt;BR /&gt;"data_plane_id": {&lt;BR /&gt;"instance": "execution",&lt;BR /&gt;"seq_no": redacted&lt;BR /&gt;},&lt;BR /&gt;"control_plane_seq_no": redacted&lt;BR /&gt;},&lt;BR /&gt;"origin": {&lt;BR /&gt;"cloud": "Azure",&lt;BR /&gt;"region": "canadacentral",&lt;BR /&gt;"org_id": redacted,&lt;BR /&gt;"pipeline_id": "redacted",&lt;BR /&gt;"pipeline_type": "WORKSPACE",&lt;BR /&gt;"pipeline_name": "redacted",&lt;BR /&gt;"cluster_id": "redacted",&lt;BR /&gt;"update_id": "redacted",&lt;BR /&gt;"request_id": "redacted",&lt;BR /&gt;"uc_resource_id": "redacted"&lt;BR /&gt;},&lt;BR /&gt;"timestamp": "2024-01-17T15:15:47.639Z",&lt;BR /&gt;"message": "Notebook:/redacted/dlt_meta_pipeline used `DataFrame.collect` function that will be deprecated soon. Please fix the notebook.",&lt;BR /&gt;"level": "WARN",&lt;BR /&gt;"details": {&lt;BR /&gt;"unsupported_operation": {&lt;BR /&gt;"operation": "COLLECT_TO_DRIVER"&lt;BR /&gt;}&lt;BR /&gt;},&lt;BR /&gt;"event_type": "unsupported_operation",&lt;BR /&gt;"maturity_level": "STABLE"&lt;BR /&gt;}&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jan 2024 21:57:51 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/57622#M31149</guid>
      <dc:creator>ipreston</dc:creator>
      <dc:date>2024-01-17T21:57:51Z</dc:date>
    </item>
    <item>
      <title>Re: Possible false positive warning on DLT pipeline</title>
      <link>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/57624#M31150</link>
      <description>&lt;P&gt;The code is a fork of dlt-meta. Here's where it does the same operation:&amp;nbsp;&lt;A href="https://github.com/databrickslabs/dlt-meta/blob/2a93dd9ae42dfdb167b73629bd1acc8a256e7a49/src/dataflow_spec.py#L150" target="_blank"&gt;https://github.com/databrickslabs/dlt-meta/blob/2a93dd9ae42dfdb167b73629bd1acc8a256e7a49/src/dataflow_spec.py#L150&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jan 2024 22:02:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/57624#M31150</guid>
      <dc:creator>ipreston</dc:creator>
      <dc:date>2024-01-17T22:02:09Z</dc:date>
    </item>
    <item>
      <title>Re: Possible false positive warning on DLT pipeline</title>
      <link>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/57632#M31151</link>
      <description>&lt;P&gt;Thank you for sharing more details. In this case, this is a warning message level ("level": "WARN",), so it should be fine as long as you are not getting FATAL error level.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jan 2024 00:14:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/57632#M31151</guid>
      <dc:creator>jose_gonzalez</dc:creator>
      <dc:date>2024-01-18T00:14:14Z</dc:date>
    </item>
    <item>
      <title>Re: Possible false positive warning on DLT pipeline</title>
      <link>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/58178#M31152</link>
      <description>&lt;P&gt;Thanks for the reply. Based on that response though, it seems like the warning itself is a bug in the DLT implementation. Per &lt;A href="https://learn.microsoft.com/en-us/azure/databricks/delta-live-tables/python-ref" target="_self"&gt;the docs&lt;/A&gt;&amp;nbsp;"&lt;SPAN&gt;However, you can include these functions outside of&amp;nbsp;&lt;/SPAN&gt;table&lt;SPAN&gt;&amp;nbsp;or&amp;nbsp;&lt;/SPAN&gt;view&lt;SPAN&gt;&amp;nbsp;function definitions because this code is run once during the graph initialization phase." Is there a way to report this issue upstream? I'm not concerned about my code failing as a result of this warning, but I'd like to avoid having false positive alerts in the pipeline as it increases the risk that I'll miss an important warning amidst irrelevant ones like this.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 22 Jan 2024 16:01:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/58178#M31152</guid>
      <dc:creator>ipreston</dc:creator>
      <dc:date>2024-01-22T16:01:00Z</dc:date>
    </item>
    <item>
      <title>Re: Possible false positive warning on DLT pipeline</title>
      <link>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/68628#M33708</link>
      <description>&lt;P&gt;Are you using Take() or First() in your code?&amp;nbsp; I was using collect and get the same warning, but have since changed to Take() and still get the warning.&lt;/P&gt;</description>
      <pubDate>Wed, 08 May 2024 19:00:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/possible-false-positive-warning-on-dlt-pipeline/m-p/68628#M33708</guid>
      <dc:creator>Bal</dc:creator>
      <dc:date>2024-05-08T19:00:04Z</dc:date>
    </item>
  </channel>
</rss>

