<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Having a different performance while use GPU and CPU in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/having-a-different-performance-while-use-gpu-and-cpu/m-p/129211#M4249</link>
    <description>&lt;P&gt;Hi Sir,&lt;/P&gt;&lt;P&gt;Regarding your question &lt;EM&gt;"what you are using in your RF i.e. hyperparameters, depths etc."&lt;/EM&gt;, I would like to share the following points about my training process:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;Using &lt;STRONG&gt;30,000 features&lt;/STRONG&gt; for the TF-IDF matrix&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Setting &lt;STRONG&gt;n_estimators = 500&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Since I am working on a &lt;STRONG&gt;serverless environment in Databricks&lt;/STRONG&gt;, I am not quite sure what you meant by &lt;EM&gt;"CPU based cluster with minor tweak using parallelism"&lt;/EM&gt;. Could you please provide more details on this so I can better understand your point?&lt;/P&gt;&lt;P&gt;Thank you for your support.&lt;/P&gt;&lt;P&gt;Best regards,&lt;/P&gt;</description>
    <pubDate>Fri, 22 Aug 2025 08:59:57 GMT</pubDate>
    <dc:creator>dangkhai</dc:creator>
    <dc:date>2025-08-22T08:59:57Z</dc:date>
    <item>
      <title>Having a different performance while use GPU and CPU</title>
      <link>https://community.databricks.com/t5/machine-learning/having-a-different-performance-while-use-gpu-and-cpu/m-p/129186#M4247</link>
      <description>&lt;P&gt;I'm building a model that is mostly sklearn libraries, but I'm also using TF-IDF and RandomForest. In theory, they only need a CPU to work properly, but in fact, when I use a physical computer with about 32 GB of RAM, it runs very fast. There are some strange things that happen when:&lt;BR /&gt;- I run the notebook with a compute CPU - 16 GB RAM - 4 cores on databricks and the result returns about 0.15s. However, my colleague used a calculator with the same configuration and ran it for about &amp;gt; 40s. This is the first oddity, the context is the same source code, the same model, the same kind of compute but there's a difference when the model predicts as above.&lt;BR /&gt;- When I perform the above model serving with CPU 0-64 cons, the model seems to have insufficient resources to predict and does not respond back to me. I don't think my model would work with that much resources. When I switched to the GPU, it worked as expected. Checking GPU Usage or memory is absolutely zero. My model is not using GPUs but it only works well on GPUs.&lt;/P&gt;&lt;P&gt;Can anyone answer the above questions for me?&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 03:30:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/having-a-different-performance-while-use-gpu-and-cpu/m-p/129186#M4247</guid>
      <dc:creator>dangkhai</dc:creator>
      <dc:date>2025-08-22T03:30:05Z</dc:date>
    </item>
    <item>
      <title>Re: Having a different performance while use GPU and CPU</title>
      <link>https://community.databricks.com/t5/machine-learning/having-a-different-performance-while-use-gpu-and-cpu/m-p/129207#M4248</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/180571"&gt;@dangkhai&lt;/a&gt;&amp;nbsp;: I don't have much insight on your configuration used in your model. It all depends on the configuration what you are using in your RF i.e hyperparameters, depths etc. As you clearly mention that you are getting good resulted in GPU but not in CPU.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can you try running your model with CPU based cluster with minor tweak using parallelism and see if that work out?&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 08:25:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/having-a-different-performance-while-use-gpu-and-cpu/m-p/129207#M4248</guid>
      <dc:creator>BR_DatabricksAI</dc:creator>
      <dc:date>2025-08-22T08:25:22Z</dc:date>
    </item>
    <item>
      <title>Re: Having a different performance while use GPU and CPU</title>
      <link>https://community.databricks.com/t5/machine-learning/having-a-different-performance-while-use-gpu-and-cpu/m-p/129211#M4249</link>
      <description>&lt;P&gt;Hi Sir,&lt;/P&gt;&lt;P&gt;Regarding your question &lt;EM&gt;"what you are using in your RF i.e. hyperparameters, depths etc."&lt;/EM&gt;, I would like to share the following points about my training process:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;Using &lt;STRONG&gt;30,000 features&lt;/STRONG&gt; for the TF-IDF matrix&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Setting &lt;STRONG&gt;n_estimators = 500&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Since I am working on a &lt;STRONG&gt;serverless environment in Databricks&lt;/STRONG&gt;, I am not quite sure what you meant by &lt;EM&gt;"CPU based cluster with minor tweak using parallelism"&lt;/EM&gt;. Could you please provide more details on this so I can better understand your point?&lt;/P&gt;&lt;P&gt;Thank you for your support.&lt;/P&gt;&lt;P&gt;Best regards,&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 08:59:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/having-a-different-performance-while-use-gpu-and-cpu/m-p/129211#M4249</guid>
      <dc:creator>dangkhai</dc:creator>
      <dc:date>2025-08-22T08:59:57Z</dc:date>
    </item>
    <item>
      <title>Re: Having a different performance while use GPU and CPU</title>
      <link>https://community.databricks.com/t5/machine-learning/having-a-different-performance-while-use-gpu-and-cpu/m-p/129226#M4250</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/180571"&gt;@dangkhai&lt;/a&gt;&amp;nbsp;: Have the below link and see parallelism enabled during transformation.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/rafaelvalero/ParallelTextProcessing/blob/master/parallelizing_text_processing.ipynb" target="_blank"&gt;ParallelTextProcessing/parallelizing_text_processing.ipynb at master · rafaelvalero/ParallelTextProcessing&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 09:17:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/having-a-different-performance-while-use-gpu-and-cpu/m-p/129226#M4250</guid>
      <dc:creator>BR_DatabricksAI</dc:creator>
      <dc:date>2025-08-22T09:17:58Z</dc:date>
    </item>
  </channel>
</rss>

