<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Geting error Caused by: com.databricks.NotebookExecutionException: FAILED in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/geting-error-caused-by-com-databricks-notebookexecutionexception/m-p/29606#M21329</link>
    <description>&lt;P&gt;&lt;/P&gt;&lt;P&gt;Hi @Dipak Bachhav​, not sure if you have fixed the issue, but here are few things you can check:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt; Is the path "/Users/mynotebook" correct? Maybe you are missing the dot in the beginning.&lt;/LI&gt;&lt;LI&gt;Run the notebook using dbutils.notebook.run("/Users/mynotebook") and see there are any errors.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 28 Nov 2022 18:47:09 GMT</pubDate>
    <dc:creator>sujai_sparks</dc:creator>
    <dc:date>2022-11-28T18:47:09Z</dc:date>
    <item>
      <title>Geting error Caused by: com.databricks.NotebookExecutionException: FAILED</title>
      <link>https://community.databricks.com/t5/data-engineering/geting-error-caused-by-com-databricks-notebookexecutionexception/m-p/29603#M21326</link>
      <description>&lt;P&gt;I am trying to run the below notebook through databricks but getting the below error. I have tried to update the notebook timeout and the retry mechanism but still no luck yet.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;NotebookData("/Users/mynotebook",9900, retry=3)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;]&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;res = parallelNotebooks(notebooks, 2)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;result = [f.result(timeout=9900) for f in res] # This is a blocking call.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;print(result)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Can someone please help me to sort out this issue? Thanks&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;%python&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;from concurrent.futures import ThreadPoolExecutor&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;class NotebookData:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;def __init__(self, path, timeout, parameters=None, retry=0):&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;self.path = path&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;self.timeout = timeout&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;self.parameters = parameters&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;self.retry = retry&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;def submitNotebook(notebook):&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;print("Running notebook %s" % notebook.path)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;try:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;if (notebook.parameters):&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;return dbutils.notebook.run(notebook.path, notebook.timeout, notebook.parameters)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;else:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;return dbutils.notebook.run(notebook.path, notebook.timeout)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;except Exception:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;if notebook.retry &amp;lt; 1:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;raise&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;print("Retrying notebook %s" % notebook.path)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;notebook.retry = notebook.retry - 1&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;submitNotebook(notebook)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;def parallelNotebooks(notebooks, numInParallel):&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;# This code limits the number of parallel notebooks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;with ThreadPoolExecutor(max_workers=numInParallel) as ec:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;return [ec.submit(submitNotebook, notebook) for notebook in notebooks]&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;notebooks = [&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;NotebookData("/Users/mynotebook",1200000, retry=0)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;]&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;res = parallelNotebooks(notebooks, 2)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;result = [f.result(timeout=1200000) for f in res] # This is a blocking call.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;print(result)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Error:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;Py4JJavaError: An error occurred while calling o1741._run.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;: com.databricks.WorkflowException: com.databricks.NotebookExecutionException: FAILED&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at com.databricks.workflow.WorkflowDriver.run(WorkflowDriver.scala:95)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at com.databricks.dbutils_v1.impl.NotebookUtilsImpl.run(NotebookUtilsImpl.scala:122)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at com.databricks.dbutils_v1.impl.NotebookUtilsImpl._run(NotebookUtilsImpl.scala:89)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at sun.reflect.GeneratedMethodAccessor820.invoke(Unknown Source)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at java.lang.reflect.Method.invoke(Method.java:498)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:244)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:380)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at py4j.Gateway.invoke(Gateway.java:295)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at py4j.commands.AbstractCommand.invokeMethod(AbstractCommand.java:132)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at py4j.commands.CallCommand.execute(CallCommand.java:79)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at py4j.GatewayConnection.run(GatewayConnection.java:251)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at java.lang.Thread.run(Thread.java:748)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;Caused by: com.databricks.NotebookExecutionException: FAILED&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at com.databricks.workflow.WorkflowDriver.run0(WorkflowDriver.scala:141)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	at com.databricks.workflow.WorkflowDriver.run(WorkflowDriver.scala:90)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;	... 12 more&lt;/P&gt;</description>
      <pubDate>Mon, 03 Oct 2022 14:25:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/geting-error-caused-by-com-databricks-notebookexecutionexception/m-p/29603#M21326</guid>
      <dc:creator>DipakBachhav</dc:creator>
      <dc:date>2022-10-03T14:25:48Z</dc:date>
    </item>
    <item>
      <title>Re: Geting error Caused by: com.databricks.NotebookExecutionException: FAILED</title>
      <link>https://community.databricks.com/t5/data-engineering/geting-error-caused-by-com-databricks-notebookexecutionexception/m-p/29604#M21327</link>
      <description>&lt;P&gt;Not sure what this code does, but spark executes job by job, so ThreadPoolExecutor doesn't make much sense. If you want to execute notebooks in parallel, please run them as separate jobs with a fair scheduler (so you reserve resources for each notebook - in first line sc.setLocalProperty("spark.scheduler.pool", "somename") when somename is unique for your parallel notebook execution)&lt;/P&gt;</description>
      <pubDate>Mon, 03 Oct 2022 19:42:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/geting-error-caused-by-com-databricks-notebookexecutionexception/m-p/29604#M21327</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2022-10-03T19:42:30Z</dc:date>
    </item>
    <item>
      <title>Re: Geting error Caused by: com.databricks.NotebookExecutionException: FAILED</title>
      <link>https://community.databricks.com/t5/data-engineering/geting-error-caused-by-com-databricks-notebookexecutionexception/m-p/29606#M21329</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;P&gt;Hi @Dipak Bachhav​, not sure if you have fixed the issue, but here are few things you can check:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt; Is the path "/Users/mynotebook" correct? Maybe you are missing the dot in the beginning.&lt;/LI&gt;&lt;LI&gt;Run the notebook using dbutils.notebook.run("/Users/mynotebook") and see there are any errors.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Nov 2022 18:47:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/geting-error-caused-by-com-databricks-notebookexecutionexception/m-p/29606#M21329</guid>
      <dc:creator>sujai_sparks</dc:creator>
      <dc:date>2022-11-28T18:47:09Z</dc:date>
    </item>
    <item>
      <title>Re: Geting error Caused by: com.databricks.NotebookExecutionException: FAILED</title>
      <link>https://community.databricks.com/t5/data-engineering/geting-error-caused-by-com-databricks-notebookexecutionexception/m-p/29607#M21330</link>
      <description>&lt;P&gt;Just a quibble here. It makes great sense to run notebooks in parallel and can greatly increase performance. On a relatively small cluster (4 nodes, 16 vCPUs total), I was able to cut my job's runtime in half.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;When you spawn multiple processes, it allows you to have better utilization of the cluster. In sequential notebook runs, there is often downtime for executors while the remaining tasks for a spark job are completing. Running multiple python processes in the driver node allows you to begin work on other spark tasks. &lt;/P&gt;</description>
      <pubDate>Wed, 15 Mar 2023 17:22:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/geting-error-caused-by-com-databricks-notebookexecutionexception/m-p/29607#M21330</guid>
      <dc:creator>cweymouth</dc:creator>
      <dc:date>2023-03-15T17:22:19Z</dc:date>
    </item>
  </channel>
</rss>

