<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic setup justfile command in order to launch your spark application in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/setup-justfile-command-in-order-to-launch-your-spark-application/m-p/150618#M53483</link>
    <description>&lt;P&gt;Hello Guys,&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Actually, I build a just file for my project which will be execute my wheel job task using command line, but when i run my wheel task i have encountered this error.&amp;nbsp;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;from pyspark.sql.connect.expressions import PythonUDFEnvironment&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;ImportError: cannot import name 'PythonUDFEnvironment' from 'pyspark.sql.connect.expressions&lt;/FONT&gt;&lt;BR /&gt;&lt;BR /&gt;Someone know how to solve this issue?&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;justfile&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;&lt;SPAN&gt;default&lt;/SPAN&gt;:&lt;BR /&gt;    @&lt;SPAN&gt;just&lt;/SPAN&gt; --list&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;install&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;poetry&lt;/SPAN&gt; install&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;poetry-remove-pyspark&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;poetry&lt;/SPAN&gt; show pyspark # Is PySpark already installed?&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;poetry-uninstall-pyspark&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;poetry&lt;/SPAN&gt; remove pyspark&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;test-connexion-databricks&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;databricks-connect&lt;/SPAN&gt; test&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;poetry-install-pyspark&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;poetry&lt;/SPAN&gt; add "pyspark (&amp;gt;=3.5.5,&amp;lt;=4.1.1)"&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;poetry-add-pyspark-connect&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;poetry&lt;/SPAN&gt; add databricks-connect@~17.3  # Or X.Y to match your cluster version.&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;edpiqual-example&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;python&lt;/SPAN&gt; -c "from edpiqual.entrypoint import main; main()" \&lt;BR /&gt;    --ingestion_catalog_name default \&lt;BR /&gt;    --product_catalog_name workspace \&lt;BR /&gt;    --data_bundle Transactions --data_bundle_object test1\&lt;BR /&gt;    --format csv --source_type autoloader --trigger availableNow \&lt;BR /&gt;    --agreement_version 1 --start_date 2026-03-08T04:01:40.285Z \&lt;BR /&gt;    --write_mode upsert --upsert_column reference_id&lt;/PRE&gt;&lt;/DIV&gt;&lt;P&gt;&lt;BR /&gt;this is my just file&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 11 Mar 2026 17:06:23 GMT</pubDate>
    <dc:creator>seefoods</dc:creator>
    <dc:date>2026-03-11T17:06:23Z</dc:date>
    <item>
      <title>setup justfile command in order to launch your spark application</title>
      <link>https://community.databricks.com/t5/data-engineering/setup-justfile-command-in-order-to-launch-your-spark-application/m-p/150618#M53483</link>
      <description>&lt;P&gt;Hello Guys,&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Actually, I build a just file for my project which will be execute my wheel job task using command line, but when i run my wheel task i have encountered this error.&amp;nbsp;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;from pyspark.sql.connect.expressions import PythonUDFEnvironment&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;ImportError: cannot import name 'PythonUDFEnvironment' from 'pyspark.sql.connect.expressions&lt;/FONT&gt;&lt;BR /&gt;&lt;BR /&gt;Someone know how to solve this issue?&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;justfile&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;&lt;SPAN&gt;default&lt;/SPAN&gt;:&lt;BR /&gt;    @&lt;SPAN&gt;just&lt;/SPAN&gt; --list&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;install&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;poetry&lt;/SPAN&gt; install&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;poetry-remove-pyspark&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;poetry&lt;/SPAN&gt; show pyspark # Is PySpark already installed?&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;poetry-uninstall-pyspark&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;poetry&lt;/SPAN&gt; remove pyspark&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;test-connexion-databricks&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;databricks-connect&lt;/SPAN&gt; test&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;poetry-install-pyspark&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;poetry&lt;/SPAN&gt; add "pyspark (&amp;gt;=3.5.5,&amp;lt;=4.1.1)"&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;poetry-add-pyspark-connect&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;poetry&lt;/SPAN&gt; add databricks-connect@~17.3  # Or X.Y to match your cluster version.&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;edpiqual-example&lt;/SPAN&gt;:&lt;BR /&gt;    &lt;SPAN&gt;python&lt;/SPAN&gt; -c "from edpiqual.entrypoint import main; main()" \&lt;BR /&gt;    --ingestion_catalog_name default \&lt;BR /&gt;    --product_catalog_name workspace \&lt;BR /&gt;    --data_bundle Transactions --data_bundle_object test1\&lt;BR /&gt;    --format csv --source_type autoloader --trigger availableNow \&lt;BR /&gt;    --agreement_version 1 --start_date 2026-03-08T04:01:40.285Z \&lt;BR /&gt;    --write_mode upsert --upsert_column reference_id&lt;/PRE&gt;&lt;/DIV&gt;&lt;P&gt;&lt;BR /&gt;this is my just file&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 11 Mar 2026 17:06:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/setup-justfile-command-in-order-to-launch-your-spark-application/m-p/150618#M53483</guid>
      <dc:creator>seefoods</dc:creator>
      <dc:date>2026-03-11T17:06:23Z</dc:date>
    </item>
    <item>
      <title>Re: setup justfile command in order to launch your spark application</title>
      <link>https://community.databricks.com/t5/data-engineering/setup-justfile-command-in-order-to-launch-your-spark-application/m-p/150972#M53551</link>
      <description>&lt;P&gt;This error typically happens when there’s a version mismatch between your local pyspark installation and databricks-connect.&lt;BR /&gt;PythonUDFEnvironment was introduced in a specific version of the Databricks Connect SDK. If you have a standalone pyspark package installed alongside databricks-connect, it shadows the correct one bundled with the connector.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If this is the main issue: try to remove pyspark, remove only databricks-connect. Via poetry, verify if pyspark is not installed (poetry show).&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 15 Mar 2026 11:03:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/setup-justfile-command-in-order-to-launch-your-spark-application/m-p/150972#M53551</guid>
      <dc:creator>mderela</dc:creator>
      <dc:date>2026-03-15T11:03:52Z</dc:date>
    </item>
    <item>
      <title>Re: setup justfile command in order to launch your spark application</title>
      <link>https://community.databricks.com/t5/data-engineering/setup-justfile-command-in-order-to-launch-your-spark-application/m-p/152194#M53783</link>
      <description>&lt;P&gt;&lt;SPAN&gt;This &lt;/SPAN&gt;&lt;SPAN&gt;ImportError&lt;/SPAN&gt;&lt;SPAN&gt; happens because you have &lt;/SPAN&gt;&lt;STRONG&gt;both&lt;/STRONG&gt;&lt;SPAN&gt; standalone &lt;/SPAN&gt;&lt;SPAN&gt;pyspark&lt;/SPAN&gt;&lt;SPAN&gt; and &lt;/SPAN&gt;&lt;SPAN&gt;databricks-connect&lt;/SPAN&gt;&lt;SPAN&gt; installed, and they conflict with each other. &lt;/SPAN&gt;&lt;SPAN&gt;databricks-connect&lt;/SPAN&gt;&lt;SPAN&gt; bundles its own version of PySpark internally — when the standalone &lt;/SPAN&gt;&lt;SPAN&gt;pyspark&lt;/SPAN&gt;&lt;SPAN&gt; package is also present, Python imports from the wrong one, which doesn't have &lt;/SPAN&gt;&lt;SPAN&gt;PythonUDFEnvironment&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Fix:&lt;/STRONG&gt;&lt;SPAN&gt; Remove the standalone &lt;/SPAN&gt;&lt;SPAN&gt;pyspark&lt;/SPAN&gt;&lt;SPAN&gt; and only use &lt;/SPAN&gt;&lt;SPAN&gt;databricks-connect&lt;/SPAN&gt;&lt;SPAN&gt;:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;# Remove standalone pyspark first&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;poetry-remove-pyspark:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;poetry remove pyspark&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;# Install databricks-connect (which bundles compatible pyspark)&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;poetry-add-databricks-connect:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;poetry add databricks-connect@~17.3&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;# Verify no standalone pyspark is installed&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;check-deps:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;poetry show pyspark 2&amp;gt;&amp;amp;1 || echo "OK: no standalone pyspark"&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;poetry show databricks-connect&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Key rules to avoid this:&lt;/SPAN&gt;&lt;/P&gt;
&lt;OL&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;STRONG&gt;Never install `pyspark` alongside `databricks-connect`&lt;/STRONG&gt;&lt;SPAN&gt; — they conflict&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;databricks-connect&lt;/SPAN&gt;&lt;SPAN&gt; version must match your &lt;/SPAN&gt;&lt;STRONG&gt;cluster DBR version&lt;/STRONG&gt;&lt;SPAN&gt; (e.g., &lt;/SPAN&gt;&lt;SPAN&gt;~17.3&lt;/SPAN&gt;&lt;SPAN&gt; for DBR 17.3)&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;After removing pyspark, clear any cached &lt;/SPAN&gt;&lt;SPAN&gt;.pyc&lt;/SPAN&gt;&lt;SPAN&gt; files: &lt;/SPAN&gt;&lt;SPAN&gt;find . -name "*.pyc" -delete&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&lt;SPAN&gt;If you need standalone PySpark for local-only testing (no Databricks), keep them in &lt;/SPAN&gt;&lt;STRONG&gt;separate Poetry dependency groups&lt;/STRONG&gt;&lt;SPAN&gt; and never activate both simultaneously.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 26 Mar 2026 15:31:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/setup-justfile-command-in-order-to-launch-your-spark-application/m-p/152194#M53783</guid>
      <dc:creator>anuj_lathi</dc:creator>
      <dc:date>2026-03-26T15:31:02Z</dc:date>
    </item>
  </channel>
</rss>

