<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Cannot import pyspark.pipelines module in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/cannot-import-pyspark-pipelines-module/m-p/135347#M50326</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/105089"&gt;@Saf4Databricks&lt;/a&gt;&amp;nbsp;- Are you trying to use it from a standalone Databricks notebook? You should only use it from with Lakeflow Declarative Pipeline (LDP). The link you shared is about LDP. Here is an example where I used it.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="dkushari_0-1760814706612.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20842i5AE7EFE71C31D0D8/image-size/medium?v=v2&amp;amp;px=400" role="button" title="dkushari_0-1760814706612.png" alt="dkushari_0-1760814706612.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="dkushari_1-1760814715212.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20843i93899211DE0B177D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="dkushari_1-1760814715212.png" alt="dkushari_1-1760814715212.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sat, 18 Oct 2025 19:12:35 GMT</pubDate>
    <dc:creator>dkushari</dc:creator>
    <dc:date>2025-10-18T19:12:35Z</dc:date>
    <item>
      <title>Cannot import pyspark.pipelines module</title>
      <link>https://community.databricks.com/t5/data-engineering/cannot-import-pyspark-pipelines-module/m-p/135293#M50322</link>
      <description>&lt;P&gt;&lt;STRONG&gt;Question&lt;/STRONG&gt;: What could be a cause of the following error of my code in a Databricks notebook, and how can we fix the error? I'm using latest Free Edition of Databricks that has runtime version 17.2 and PySpark version 4.0.0.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Error&lt;/STRONG&gt;:&lt;BR /&gt;ImportError: cannot import name 'pipelines' from 'pyspark' (/databricks/python/lib/python3.12/site-packages/pyspark/__init__.py)&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Following is the top line of the Databricks notebook that throws the error&lt;/STRONG&gt;:&lt;/P&gt;&lt;LI-CODE lang="python"&gt;from pyspark import pipelines as dp&lt;/LI-CODE&gt;&lt;P&gt;&lt;STRONG&gt;NOTE&lt;/STRONG&gt;: According to the following quote from &lt;A title="Basics of Python for pipeline development" href="https://docs.databricks.com/gcp/en/ldp/developer/python-dev#basics-of-python-for-pipeline-development" target="_blank" rel="noopener"&gt;Basics of Python for pipeline development&lt;/A&gt; from Databricks' team, we need to import the above module for creating &lt;STRONG&gt;Lakeflow Declarative pipelines&lt;/STRONG&gt;&amp;nbsp;using Python:&lt;/P&gt;&lt;P&gt;&lt;EM&gt;All Lakeflow Declarative Pipelines Python APIs are implemented in the `pyspark.pipelines` module.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;Also, as we know PySpark is an integral and primary programming interface used within the Databricks platform. So, what I may be missing here that causes the error?&lt;/P&gt;</description>
      <pubDate>Sat, 18 Oct 2025 00:01:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/cannot-import-pyspark-pipelines-module/m-p/135293#M50322</guid>
      <dc:creator>Saf4Databricks</dc:creator>
      <dc:date>2025-10-18T00:01:24Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot import pyspark.pipelines module</title>
      <link>https://community.databricks.com/t5/data-engineering/cannot-import-pyspark-pipelines-module/m-p/135347#M50326</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/105089"&gt;@Saf4Databricks&lt;/a&gt;&amp;nbsp;- Are you trying to use it from a standalone Databricks notebook? You should only use it from with Lakeflow Declarative Pipeline (LDP). The link you shared is about LDP. Here is an example where I used it.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="dkushari_0-1760814706612.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20842i5AE7EFE71C31D0D8/image-size/medium?v=v2&amp;amp;px=400" role="button" title="dkushari_0-1760814706612.png" alt="dkushari_0-1760814706612.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="dkushari_1-1760814715212.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20843i93899211DE0B177D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="dkushari_1-1760814715212.png" alt="dkushari_1-1760814715212.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 18 Oct 2025 19:12:35 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/cannot-import-pyspark-pipelines-module/m-p/135347#M50326</guid>
      <dc:creator>dkushari</dc:creator>
      <dc:date>2025-10-18T19:12:35Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot import pyspark.pipelines module</title>
      <link>https://community.databricks.com/t5/data-engineering/cannot-import-pyspark-pipelines-module/m-p/135353#M50328</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/38309"&gt;@dkushari&lt;/a&gt;&amp;nbsp;Thank you for responding. I'm working on the following tutorial from your team:&amp;nbsp;&lt;A title="What is change data capture (CDC)? | Databricks on AWS" href="https://docs.databricks.com/aws/en/ldp/what-is-change-data-capture" target="_blank" rel="noopener"&gt;What is change data capture (CDC)? | Databricks on AWS&lt;/A&gt;. Code in &lt;STRONG&gt;step 1&lt;/STRONG&gt; runs fine, but the code in &lt;STRONG&gt;step 2&lt;/STRONG&gt; fails at the following line on top:&lt;/P&gt;&lt;LI-CODE lang="python"&gt;from pyspark import pipelines as dp&lt;/LI-CODE&gt;&lt;P&gt;What you would you suggest I should do to make this tutorial from your Databricks team work?&lt;/P&gt;</description>
      <pubDate>Sat, 18 Oct 2025 22:51:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/cannot-import-pyspark-pipelines-module/m-p/135353#M50328</guid>
      <dc:creator>Saf4Databricks</dc:creator>
      <dc:date>2025-10-18T22:51:11Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot import pyspark.pipelines module</title>
      <link>https://community.databricks.com/t5/data-engineering/cannot-import-pyspark-pipelines-module/m-p/135357#M50329</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/105089"&gt;@Saf4Databricks&lt;/a&gt;&amp;nbsp;- Yes this is for LDP only as you can see &lt;A href="https://docs.databricks.com/aws/en/ldp/what-is-change-data-capture#examples-of-scd-type-1-and-type-2-processing-with-lakeflow-declarative-pipelines" target="_self"&gt;here&lt;/A&gt; in the heading. And as you can see &lt;A href="https://docs.databricks.com/aws/en/ldp/what-is-change-data-capture#step-2-use-scd-type-1-to-keep-only-the-latest-data" target="_self"&gt;here&lt;/A&gt;&amp;nbsp;that point number 1 and 2 is where you create and copy the notebook but you do not run it. In point 3 you create the pipeline and then you can run it like I showed you in my previous post.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="dkushari_1-1760829574829.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20846iF856D5FA7A958A95/image-size/medium?v=v2&amp;amp;px=400" role="button" title="dkushari_1-1760829574829.png" alt="dkushari_1-1760829574829.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="dkushari_0-1760829525424.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20845i2FC13EEC5F10516E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="dkushari_0-1760829525424.png" alt="dkushari_0-1760829525424.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 18 Oct 2025 23:26:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/cannot-import-pyspark-pipelines-module/m-p/135357#M50329</guid>
      <dc:creator>dkushari</dc:creator>
      <dc:date>2025-10-18T23:26:49Z</dc:date>
    </item>
  </channel>
</rss>

