<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Enhance Documentation for databricks-connect for python in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/enhance-documentation-for-databricks-connect-for-python/m-p/61623#M6611</link>
    <description>&lt;P&gt;Additionally, there are no comments on how to avoid uploading virtual environment to the catalog when using databricks extension for vscode&lt;/P&gt;</description>
    <pubDate>Thu, 22 Feb 2024 22:44:19 GMT</pubDate>
    <dc:creator>santiagortiiz</dc:creator>
    <dc:date>2024-02-22T22:44:19Z</dc:date>
    <item>
      <title>Enhance Documentation for databricks-connect for python</title>
      <link>https://community.databricks.com/t5/get-started-discussions/enhance-documentation-for-databricks-connect-for-python/m-p/61620#M6610</link>
      <description>&lt;P&gt;Hello, I'm setting up a python environment to work with databricks in vscode using Windows.&lt;/P&gt;&lt;P&gt;I followed the instructions in&amp;nbsp;&lt;A title="documentation" href="https://docs.databricks.com/en/dev-tools/databricks-connect/python/install.html#install-databricks-connect-for-python" target="_blank" rel="noopener"&gt;Install Databricks Connect for Python&lt;/A&gt;, and found some issues/conflicts/errors:&lt;/P&gt;&lt;P&gt;- There is a useless tip in the&amp;nbsp;&lt;A title="setup" href="https://docs.databricks.com/en/dev-tools/databricks-connect/python/install.html#set-up-the-client" target="_blank" rel="noopener"&gt;Set up the client section&lt;/A&gt;&amp;nbsp;section, it says that if you have the vscode extension you do not need to follow next instructions (makes no sense, you need to configure and setup the client)&lt;/P&gt;&lt;P&gt;- The documentation ask to uninstall pyspark in order to install databricks-connect, but the last one does not install pyspark, the problem is that at the end of the documentation page, it says:&lt;BR /&gt;&lt;U&gt;"run a simple PySpark command, such as&amp;nbsp;spark.range(1,10).show(). If there are no errors, you have successfully connected."&lt;/U&gt;&lt;SPAN&gt;&lt;BR /&gt;What raises the following error:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;Traceback (most recent call last):
  File "C:\Users\Santiago_Ortiz\EPAM\Clients\RDSA\poc\.venv\Scripts\find_spark_home.py", line 92, in &amp;lt;module&amp;gt;
    print(_find_spark_home())
          ^^^^^^^^^^^^^^^^^^
  File "C:\Users\Santiago_Ortiz\EPAM\Clients\RDSA\poc\.venv\Scripts\find_spark_home.py", line 56, in _find_spark_home
    module_home = os.path.dirname(find_spec("pyspark").origin)
                                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'NoneType' object has no attribute 'origin'
\Users\Santiago_Ortiz\EPAM\Clients\RDSA\poc\.venv/Scripts/pyspark: line 24: /bin/load-spark-env.sh: No such file or directory
Python 3.11.6 (tags/v3.11.6:8b6ee5b, Oct  2 2023, 14:57:12) [MSC v.1935 64 bit (AMD64)] on win32
Type "help", "copyright", "credits" or "license" for more information.
Could not open PYTHONSTARTUP
FileNotFoundError: [Errno 2] No such file or directory: 'C:/Program Files/Git/shell.py'&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;also, spark is not available as a global variable.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;&amp;gt;&amp;gt;&amp;gt; spark
Traceback (most recent call last):
  File "&amp;lt;stdin&amp;gt;", line 1, in &amp;lt;module&amp;gt;&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The documentation is very disorganised, the steps described cannot be reproduced as is, by anyone to achieve the desired result.&lt;/P&gt;</description>
      <pubDate>Thu, 22 Feb 2024 22:36:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/enhance-documentation-for-databricks-connect-for-python/m-p/61620#M6610</guid>
      <dc:creator>santiagortiiz</dc:creator>
      <dc:date>2024-02-22T22:36:53Z</dc:date>
    </item>
    <item>
      <title>Re: Enhance Documentation for databricks-connect for python</title>
      <link>https://community.databricks.com/t5/get-started-discussions/enhance-documentation-for-databricks-connect-for-python/m-p/61623#M6611</link>
      <description>&lt;P&gt;Additionally, there are no comments on how to avoid uploading virtual environment to the catalog when using databricks extension for vscode&lt;/P&gt;</description>
      <pubDate>Thu, 22 Feb 2024 22:44:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/enhance-documentation-for-databricks-connect-for-python/m-p/61623#M6611</guid>
      <dc:creator>santiagortiiz</dc:creator>
      <dc:date>2024-02-22T22:44:19Z</dc:date>
    </item>
  </channel>
</rss>

