<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Databricks Jobs do not run on job compute but on shared compute in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70510#M34080</link>
    <description>&lt;P&gt;Yes, I did the same. However, so we have to switch the code from local (VS Code) implementation to Databricks runs (Jobs/Workflow).&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&amp;nbsp;: Could you check this new issue?&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 23 May 2024 14:46:09 GMT</pubDate>
    <dc:creator>dbruehlmeier</dc:creator>
    <dc:date>2024-05-23T14:46:09Z</dc:date>
    <item>
      <title>Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70356#M34055</link>
      <description>&lt;P&gt;&lt;SPAN class=""&gt;Hello,&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;since last night none of our ETL jobs in Databricks are running anymore, although we have not made any code changes.&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;The identical jobs (deployed with Databricks asset bundles) run on an all-purpose cluster, but fail on a job cluster. &lt;/SPAN&gt;&lt;SPAN class=""&gt;We have not changed anything in the cluster configuration. &lt;/SPAN&gt;&lt;SPAN class=""&gt;The Databricks runtime version is also identical (14.3 LTS (includes Apache Spark 3.5.0, Scala 2.12)). &lt;/SPAN&gt;&lt;SPAN class=""&gt;We have also compared the code and double-checked the configurations.&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;What could be the reason for the jobs failing without us having made any changes? &lt;/SPAN&gt;&lt;SPAN class=""&gt;Have there been changes to Databricks that cause this?&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Error messages:&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;[NOT_COLUMN] Argument `col` should be a Column, got Column.&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;[SESSION_ALREADY_EXIST] Cannot start a remote Spark session because there is a regular Spark session already running.&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Does anyone else have problems with jobs?&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Best regards&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Robin&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 23 May 2024 06:41:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70356#M34055</guid>
      <dc:creator>RobinK</dc:creator>
      <dc:date>2024-05-23T06:41:07Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70504#M34074</link>
      <description>&lt;P&gt;Hi Robin&lt;/P&gt;&lt;P&gt;Do you use Databricks Connect creating spark Session?&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;&lt;SPAN&gt;from&lt;/SPAN&gt; &lt;SPAN&gt;databricks&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;connect&lt;/SPAN&gt; &lt;SPAN&gt;import&lt;/SPAN&gt; &lt;SPAN&gt;DatabricksSession&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;spark&lt;/SPAN&gt; &lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;DatabricksSession&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;builder&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;getOrCreate&lt;/SPAN&gt;&lt;SPAN&gt;()&lt;/SPAN&gt;&lt;/PRE&gt;&lt;DIV&gt;&lt;SPAN&gt;We are facing same issue on single user access cluster and in jobs.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Thu, 23 May 2024 14:16:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70504#M34074</guid>
      <dc:creator>dbruehlmeier</dc:creator>
      <dc:date>2024-05-23T14:16:23Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70507#M34077</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&amp;nbsp;: We exactly use your second solution. And we get same issue&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;&lt;SPAN&gt;from&lt;/SPAN&gt;&lt;SPAN&gt; databricks.connect &lt;/SPAN&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; DatabricksSession&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;spark &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; DatabricksSession.builder.&lt;/SPAN&gt;&lt;SPAN&gt;getOrCreate&lt;/SPAN&gt;&lt;SPAN&gt;()&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;schema &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;StructType&lt;/SPAN&gt;&lt;SPAN&gt;([&lt;/SPAN&gt;&lt;SPAN&gt;StructField&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;'category'&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;StringType&lt;/SPAN&gt;&lt;SPAN&gt;(), &lt;/SPAN&gt;&lt;SPAN&gt;True&lt;/SPAN&gt;&lt;SPAN&gt;), &lt;/SPAN&gt;&lt;SPAN&gt;StructField&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;'weight'&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;DoubleType&lt;/SPAN&gt;&lt;SPAN&gt;(), &lt;/SPAN&gt;&lt;SPAN&gt;True&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;SPAN&gt;])&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;data_source &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;"abfss://......_index_v01_??????_????????.csv"&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;df &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; (spark.read.&lt;/SPAN&gt;&lt;SPAN&gt;format&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;"csv"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt; .&lt;/SPAN&gt;&lt;SPAN&gt;options&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;**&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;'header'&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;'true'&lt;/SPAN&gt;&lt;SPAN&gt;})&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt; .&lt;/SPAN&gt;&lt;SPAN&gt;schema&lt;/SPAN&gt;&lt;SPAN&gt;(schema)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt; .&lt;/SPAN&gt;&lt;SPAN&gt;load&lt;/SPAN&gt;&lt;SPAN&gt;(data_source))&lt;/SPAN&gt;&lt;/PRE&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;[&lt;A href="https://docs.microsoft.com/azure/databricks/error-messages/error-classes#session_already_exist" target="_blank" rel="noopener noreferrer"&gt;SESSION_ALREADY_EXIST&lt;/A&gt;] Cannot start a remote Spark session because there is a regular Spark session already running.&lt;/P&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;P&gt;&lt;FONT face="terminal,monaco" size="2"&gt;File /databricks/spark/python/pyspark/instrumentation_utils.py:47, in _wrap_function.&amp;lt;locals&amp;gt;.wrapper(*args, **kwargs) 45 start = time.perf_counter() 46 try: ---&amp;gt; 47 res = func(*args, **kwargs) 48 logger.log_success( 49 module_name, class_name, function_name, time.perf_counter() - start, signature 50 ) 51 return res&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="terminal,monaco" size="2"&gt;File /databricks/spark/python/pyspark/sql/readwriter.py:150, in DataFrameReader.schema(self, schema) 117 """Specifies the input schema. 118 119 Some data sources (e.g. JSON) can infer the input schema automatically from data. (...) 146 |-- col1: double (nullable = true) 147 """ 148 from pyspark.sql import SparkSession --&amp;gt; 150 spark = SparkSession._getActiveSessionOrCreate() 151 if isinstance(schema, StructType): 152 jschema = spark._jsparkSession.parseDataType(schema.json())&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="terminal,monaco" size="2"&gt;File /databricks/spark/python/pyspark/sql/session.py:1265, in SparkSession._getActiveSessionOrCreate(**static_conf) 1263 for k, v in static_conf.items(): 1264 builder = builder.config(k, v) -&amp;gt; 1265 spark = builder.getOrCreate() 1266 return spark&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="terminal,monaco" size="2"&gt;File /databricks/spark/python/pyspark/sql/session.py:521, in SparkSession.Builder.getOrCreate(self) 519 return RemoteSparkSession.builder.config(map=opts).getOrCreate() 520 else: --&amp;gt; 521 raise PySparkRuntimeError( 522 error_class="SESSION_ALREADY_EXIST", 523 message_parameters={}, 524 ) 526 session = SparkSession._instantiatedSession 527 if session is None or session._sc._jsc is None:&lt;/FONT&gt;&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Thu, 23 May 2024 14:37:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70507#M34077</guid>
      <dc:creator>dbruehlmeier</dc:creator>
      <dc:date>2024-05-23T14:37:53Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70508#M34078</link>
      <description>&lt;P&gt;switching from&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;STRONG&gt;spark = DatabricksSession.builder.getOrCreate()&lt;/STRONG&gt;&lt;/DIV&gt;&lt;DIV&gt;to&lt;/DIV&gt;&lt;DIV&gt;&lt;STRONG&gt;spark = SparkSession.builder.getOrCreate()&lt;/STRONG&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;solved the issue. Strange exception nonetheless.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Thu, 23 May 2024 14:39:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70508#M34078</guid>
      <dc:creator>ha2983</dc:creator>
      <dc:date>2024-05-23T14:39:28Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70510#M34080</link>
      <description>&lt;P&gt;Yes, I did the same. However, so we have to switch the code from local (VS Code) implementation to Databricks runs (Jobs/Workflow).&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&amp;nbsp;: Could you check this new issue?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 23 May 2024 14:46:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70510#M34080</guid>
      <dc:creator>dbruehlmeier</dc:creator>
      <dc:date>2024-05-23T14:46:09Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70513#M34081</link>
      <description>&lt;P&gt;This Notebook can be used to recreate the issue:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;import pandas as pd
from databricks.connect import DatabricksSession
from pyspark.sql.functions import current_timestamp

spark = DatabricksSession.builder.getOrCreate()


# Create a pandas DataFrame
data = {
    "Name": ["John", "Alice", "Bob"],
    "Age": [25, 30, 35],
    "City": ["New York", "San Francisco", "Los Angeles"],
}
df = pd.DataFrame(data)

# Convert pandas DataFrame to Spark DataFrame
spark_df = spark.createDataFrame(df)


spark_df = spark_df.withColumn("_loaded_at", current_timestamp())

spark_df.show()&lt;/LI-CODE&gt;&lt;P&gt;I used databricks runtime 14.3 LTS with single user access mode&lt;/P&gt;</description>
      <pubDate>Thu, 23 May 2024 14:55:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70513#M34081</guid>
      <dc:creator>ha2983</dc:creator>
      <dc:date>2024-05-23T14:55:29Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70560#M34093</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/105654"&gt;@ha2983&lt;/a&gt;&amp;nbsp; I can confirm, that I can recreate the issue with your notebook.&lt;/P&gt;&lt;P&gt;In my case the error&amp;nbsp;&lt;SPAN&gt;&lt;EM&gt;[NOT_COLUMN] Argument `col` should be a Column, got Column.&lt;/EM&gt; occurs, when calling&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;EM&gt;.withColumn("IngestionTimestamp", unix_timestamp&lt;/EM&gt;&lt;SPAN&gt;&lt;EM&gt;())&lt;/EM&gt; on a dataframe.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;I can reproduce this error using the example from&amp;nbsp;&lt;A href="https://spark.apache.org/docs/3.5.0/api/python/reference/pyspark.sql/api/pyspark.sql.functions.unix_timestamp.html" target="_blank" rel="noopener"&gt;https://spark.apache.org/docs/3.5.0/api/python/reference/pyspark.sql/api/pyspark.sql.functions.unix_timestamp.html&lt;/A&gt;&amp;nbsp;and a single user cluster (DBR 14.3 LTS):&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&lt;SPAN class=""&gt;from pyspark.sql.functions import unix_timestamp&lt;BR /&gt;&lt;BR /&gt;spark&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;conf&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;set&lt;/SPAN&gt;&lt;SPAN class=""&gt;(&lt;/SPAN&gt;&lt;SPAN class=""&gt;"spark.sql.session.timeZone"&lt;/SPAN&gt;&lt;SPAN class=""&gt;,&lt;/SPAN&gt; &lt;SPAN class=""&gt;"America/Los_Angeles"&lt;/SPAN&gt;&lt;SPAN class=""&gt;)&lt;/SPAN&gt; &lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&lt;SPAN class=""&gt;time_df&lt;/SPAN&gt; &lt;SPAN class=""&gt;=&lt;/SPAN&gt; &lt;SPAN class=""&gt;spark&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;createDataFrame&lt;/SPAN&gt;&lt;SPAN class=""&gt;([(&lt;/SPAN&gt;&lt;SPAN class=""&gt;'2015-04-08'&lt;/SPAN&gt;&lt;SPAN class=""&gt;,)],&lt;/SPAN&gt; &lt;SPAN class=""&gt;[&lt;/SPAN&gt;&lt;SPAN class=""&gt;'dt'&lt;/SPAN&gt;&lt;SPAN class=""&gt;])&lt;/SPAN&gt; &lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&lt;SPAN class=""&gt;time_df&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;select&lt;/SPAN&gt;&lt;SPAN class=""&gt;(&lt;/SPAN&gt;&lt;SPAN class=""&gt;unix_timestamp&lt;/SPAN&gt;&lt;SPAN class=""&gt;(&lt;/SPAN&gt;&lt;SPAN class=""&gt;'dt'&lt;/SPAN&gt;&lt;SPAN class=""&gt;,&lt;/SPAN&gt; &lt;SPAN class=""&gt;'yyyy-MM-dd'&lt;/SPAN&gt;&lt;SPAN class=""&gt;)&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;alias&lt;/SPAN&gt;&lt;SPAN class=""&gt;(&lt;/SPAN&gt;&lt;SPAN class=""&gt;'unix_time'&lt;/SPAN&gt;&lt;SPAN class=""&gt;))&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;collect&lt;/SPAN&gt;&lt;SPAN class=""&gt;()&lt;/SPAN&gt; &lt;SPAN class=""&gt;spark&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;conf&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;unset&lt;/SPAN&gt;&lt;SPAN class=""&gt;(&lt;/SPAN&gt;&lt;SPAN class=""&gt;"spark.sql.session.timeZone"&lt;/SPAN&gt;&lt;SPAN class=""&gt;)&lt;/SPAN&gt;&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;P&gt;&amp;gt;&amp;gt;&amp;gt;&amp;nbsp;&lt;SPAN&gt;[&lt;/SPAN&gt;&lt;A class="" href="https://docs.databricks.com/error-messages/error-classes.html#not_column_or_str" target="_blank" rel="noopener noreferrer"&gt;NOT_COLUMN_OR_STR&lt;/A&gt;&lt;SPAN&gt;]&lt;/SPAN&gt;&lt;SPAN&gt; Argument `col` should be a Column or str, got Column.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;On a shared cluster the code above works.&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/39295"&gt;@dbruehlmeier&lt;/a&gt;&amp;nbsp;we are also using vscode for local development and create our spark session like this:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;from&lt;/SPAN&gt; &lt;SPAN&gt;databricks&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;connect&lt;/SPAN&gt; &lt;SPAN&gt;import&lt;/SPAN&gt; &lt;SPAN&gt;DatabricksSession&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;spark&lt;/SPAN&gt; &lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;DatabricksSession&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;builder&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;getOrCreate&lt;/SPAN&gt;&lt;SPAN&gt;()&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Fri, 24 May 2024 04:49:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70560#M34093</guid>
      <dc:creator>RobinK</dc:creator>
      <dc:date>2024-05-24T04:49:57Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70570#M34096</link>
      <description>&lt;P&gt;Update:&lt;/P&gt;&lt;P&gt;Removing the following code from all of our notebook fixed the error:&lt;/P&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;EM&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;from&lt;/SPAN&gt; &lt;SPAN class=""&gt;databricks.connect&lt;/SPAN&gt; &lt;SPAN class=""&gt;import&lt;/SPAN&gt; &lt;SPAN class=""&gt;DatabricksSession&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/EM&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;EM&gt;&lt;SPAN class=""&gt;spark&lt;/SPAN&gt; &lt;SPAN class=""&gt;=&lt;/SPAN&gt; &lt;SPAN class=""&gt;DatabricksSession&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;builder&lt;/SPAN&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;SPAN class=""&gt;getOrCreate&lt;/SPAN&gt;&lt;SPAN class=""&gt;()&lt;/SPAN&gt;&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV class=""&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class=""&gt;&lt;SPAN class=""&gt;I have found a line about SparkSessions in the change logs of databrick-connect:&amp;nbsp;&lt;A href="https://docs.databricks.com/en/release-notes/dbconnect/index.html#databricks-connect-1432-python" target="_blank" rel="noopener"&gt;https://docs.databricks.com/en/release-notes/dbconnect/index.html#databricks-connect-1432-python&lt;/A&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV class=""&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class=""&gt;&lt;SPAN class=""&gt;But this still doesn't answer the question, why the error occured in an environment, that did not change at all for us (same DBR Version, same cluster, same code).&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV class=""&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class=""&gt;&lt;SPAN class=""&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&amp;nbsp;maybe you could research if there where any internal updates?&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Fri, 24 May 2024 07:44:26 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70570#M34096</guid>
      <dc:creator>RobinK</dc:creator>
      <dc:date>2024-05-24T07:44:26Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70590#M34101</link>
      <description>&lt;P&gt;we are experiencing the exact same issues. But we do not even create the spark session explicitly. Are there any other fixes to this?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 24 May 2024 11:15:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70590#M34101</guid>
      <dc:creator>dbx-user7354</dc:creator>
      <dc:date>2024-05-24T11:15:55Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70631#M34111</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;We are also experiencing the same error message &lt;SPAN&gt;&lt;EM&gt;[NOT_COLUMN] Argument `col` should be a Column, got Column&lt;/EM&gt;&lt;/SPAN&gt;&lt;BR /&gt;This occurs when a workflow is run as a task from another workflow, but not when said workflow is run on its own, that is not triggered by another workflow. The problem seems to be connected to the Databricks Runtime, in 14.3 LTS the workflow fails with said error, as a temporary workaround we switched the job clusters to Runtime 13.3 LTS, this seems to be working.&lt;/P&gt;&lt;P&gt;Any update on this bug is highly appreciated as it affects our production environment.&lt;/P&gt;&lt;P&gt;Best regards&lt;BR /&gt;Markus&lt;/P&gt;</description>
      <pubDate>Sat, 25 May 2024 20:12:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70631#M34111</guid>
      <dc:creator>UniBart</dc:creator>
      <dc:date>2024-05-25T20:12:49Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70992#M34202</link>
      <description>&lt;P&gt;We just had the exact same issue and it broke all our jobs in production, any update on this bug would be appreciated. We had failures in Databricks Runtime 15.1 and we fixed by moving all the jobs' clusters to 15.2&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 29 May 2024 09:21:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/70992#M34202</guid>
      <dc:creator>Attol8</dc:creator>
      <dc:date>2024-05-29T09:21:09Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/71060#M34235</link>
      <description>&lt;P&gt;I do not believe this is solved, similar to a comment over here:&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.databricks.com/t5/data-engineering/databrickssession-broken-for-15-1/td-p/70585" target="_blank" rel="noopener"&gt;https://community.databricks.com/t5/data-engineering/databrickssession-broken-for-15-1/td-p/70585&lt;/A&gt;&lt;/P&gt;&lt;P&gt;We are also seeing this error in 14.3 LTS from a simple example:&lt;/P&gt;&lt;DIV class=""&gt;&lt;DIV&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;from pyspark.sql.functions import col&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;df = spark.table('things')&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;things = df.select(col('thing_id')).collect()&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;[NOT_COLUMN_OR_STR] Argument `col` should be a Column or str, got Column.&lt;/FONT&gt;&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 29 May 2024 21:51:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/71060#M34235</guid>
      <dc:creator>jcap</dc:creator>
      <dc:date>2024-05-29T21:51:21Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Jobs do not run on job compute but on shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/74946#M34825</link>
      <description>&lt;P&gt;I think Databricks fixed the issue. Now it works with 14.3 LTS and single user access mode&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jun 2024 09:21:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-jobs-do-not-run-on-job-compute-but-on-shared-compute/m-p/74946#M34825</guid>
      <dc:creator>dbruehlmeier</dc:creator>
      <dc:date>2024-06-19T09:21:13Z</dc:date>
    </item>
  </channel>
</rss>

