<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic workflow fails when ran using a job cluster but not in shared cluster in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/workflow-fails-when-ran-using-a-job-cluster-but-not-in-shared/m-p/95861#M39183</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;We have a workflow which calls 3 notebooks and when we run this workflow using shared cluster it runs fine, but when ran with job cluster , one of the notebooks fail.This notebook uses&amp;nbsp;sql function Lpad and looks like it errors because of it. Has anyone experienced anything similar.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Shalini&lt;/P&gt;</description>
    <pubDate>Thu, 24 Oct 2024 07:31:11 GMT</pubDate>
    <dc:creator>ShaliniC</dc:creator>
    <dc:date>2024-10-24T07:31:11Z</dc:date>
    <item>
      <title>workflow fails when ran using a job cluster but not in shared cluster</title>
      <link>https://community.databricks.com/t5/data-engineering/workflow-fails-when-ran-using-a-job-cluster-but-not-in-shared/m-p/95861#M39183</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;We have a workflow which calls 3 notebooks and when we run this workflow using shared cluster it runs fine, but when ran with job cluster , one of the notebooks fail.This notebook uses&amp;nbsp;sql function Lpad and looks like it errors because of it. Has anyone experienced anything similar.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Shalini&lt;/P&gt;</description>
      <pubDate>Thu, 24 Oct 2024 07:31:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/workflow-fails-when-ran-using-a-job-cluster-but-not-in-shared/m-p/95861#M39183</guid>
      <dc:creator>ShaliniC</dc:creator>
      <dc:date>2024-10-24T07:31:11Z</dc:date>
    </item>
    <item>
      <title>Re: workflow fails when ran using a job cluster but not in shared cluster</title>
      <link>https://community.databricks.com/t5/data-engineering/workflow-fails-when-ran-using-a-job-cluster-but-not-in-shared/m-p/96036#M39200</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/129041"&gt;@ShaliniC&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;The possible causes are:&lt;BR /&gt;1. The Databricks Runtime version of the job cluster is different than the one of the the shared cluster.&lt;/P&gt;&lt;P&gt;2. The libraries installed on the shared cluster are missing in the job cluster.&lt;BR /&gt;&lt;BR /&gt;Could you check the what are the above configuration changes and adjust the job cluster to the configuration of the shared cluster?&lt;BR /&gt;&lt;BR /&gt;If it does not help, could you please share the details of the error message?&lt;/P&gt;</description>
      <pubDate>Thu, 24 Oct 2024 18:26:39 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/workflow-fails-when-ran-using-a-job-cluster-but-not-in-shared/m-p/96036#M39200</guid>
      <dc:creator>filipniziol</dc:creator>
      <dc:date>2024-10-24T18:26:39Z</dc:date>
    </item>
    <item>
      <title>Re: workflow fails when ran using a job cluster but not in shared cluster</title>
      <link>https://community.databricks.com/t5/data-engineering/workflow-fails-when-ran-using-a-job-cluster-but-not-in-shared/m-p/96084#M39213</link>
      <description>&lt;P&gt;notebooks are executing sequentially or parallel in this workflow?&lt;/P&gt;</description>
      <pubDate>Fri, 25 Oct 2024 07:06:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/workflow-fails-when-ran-using-a-job-cluster-but-not-in-shared/m-p/96084#M39213</guid>
      <dc:creator>saurabh18cs</dc:creator>
      <dc:date>2024-10-25T07:06:21Z</dc:date>
    </item>
    <item>
      <title>Re: workflow fails when ran using a job cluster but not in shared cluster</title>
      <link>https://community.databricks.com/t5/data-engineering/workflow-fails-when-ran-using-a-job-cluster-but-not-in-shared/m-p/96941#M39366</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;The databricks runtime version and the installed libraries are the same on shared and job cluster. We had also raised a case with databricks support for the same. where they identified the error to be a syntax error in lpad function which we have used.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here is the explanation for reason for difference between clusters , which we have received from databricks&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Single access mode compute uses Classic Pyspark and Shared clusters use the secure SparkConnect interface for python API, So likely the behaviour is inconsistent between SparkConnect and Classic PySpark.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Thanks,Shalini&lt;/P&gt;</description>
      <pubDate>Thu, 31 Oct 2024 09:20:37 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/workflow-fails-when-ran-using-a-job-cluster-but-not-in-shared/m-p/96941#M39366</guid>
      <dc:creator>ShaliniC</dc:creator>
      <dc:date>2024-10-31T09:20:37Z</dc:date>
    </item>
    <item>
      <title>Re: workflow fails when ran using a job cluster but not in shared cluster</title>
      <link>https://community.databricks.com/t5/data-engineering/workflow-fails-when-ran-using-a-job-cluster-but-not-in-shared/m-p/96943#M39367</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;the error occured in the parallel run.&lt;/P&gt;&lt;P&gt;We had also raised a case with databricks support for the same. where they identified the error to be a syntax fail in lpad function which we have used.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here is the explanation for reason for difference between clusters , which we have received from databricks&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Single access mode compute uses Classic Pyspark and Shared clusters use the secure SparkConnect interface for python API, So likely the behaviour is inconsistent between SparkConnect and Classic PySpark.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Thanks,Shalini&lt;/P&gt;</description>
      <pubDate>Thu, 31 Oct 2024 09:22:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/workflow-fails-when-ran-using-a-job-cluster-but-not-in-shared/m-p/96943#M39367</guid>
      <dc:creator>ShaliniC</dc:creator>
      <dc:date>2024-10-31T09:22:47Z</dc:date>
    </item>
  </channel>
</rss>

