<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Genie code meets autoresearch in MVP Articles</title>
    <link>https://community.databricks.com/t5/mvp-articles/genie-code-meets-autoresearch/m-p/154672#M160</link>
    <description>&lt;P&gt;Genie code is Claude fine-tuned with Databricks knowledge. I think it works quite well. Feel free to share your input and output&lt;/P&gt;</description>
    <pubDate>Wed, 15 Apr 2026 17:25:22 GMT</pubDate>
    <dc:creator>jasonyip</dc:creator>
    <dc:date>2026-04-15T17:25:22Z</dc:date>
    <item>
      <title>Genie code meets autoresearch</title>
      <link>https://community.databricks.com/t5/mvp-articles/genie-code-meets-autoresearch/m-p/151487#M123</link>
      <description>&lt;P&gt;Databricks recently released Genie Code. It sounds very promising. We know it’s a Databricks product and we wouldn’t be surprised if I tell you it works within Databricks. In this experiment, we want to see how much effort does it take to migrate someone else’s code and onboard to Databricks. We will take the latest &lt;A href="https://github.com/karpathy/autoresearch" target="_blank" rel="noopener"&gt;autoresearch&lt;/A&gt; by Andrej Karpathy.&lt;/P&gt;&lt;P&gt;Having done a similar migration last year with Gemini (&lt;A href="https://www.tredence.com/blog/game-arena-deepmind-on-databricks" target="_blank" rel="noopener"&gt;https://www.tredence.com/blog/game-arena-deepmind-on-databricks&lt;/A&gt;), I know it will take about an evening’s work to migrate with LLMs that don’t understand Databricks, at least back in the days when skills were not available.&lt;/P&gt;&lt;P&gt;While I am confident that Genie code will live up to the expectation, I want to measure its effectiveness by the following:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;How many prompts do I need to use to achieve my goal?&lt;/LI&gt;&lt;LI&gt;How long is the prompt that I need to write? Do I need to create a very detailed project plan? Or will it just read my mind?&lt;/LI&gt;&lt;LI&gt;Will the first run be successful?&lt;/LI&gt;&lt;LI&gt;Does it recover from errors automatically?&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Let’s find out.&lt;/P&gt;&lt;P&gt;First of all, we need to clone the repo, and I am glad to do this step manually: &lt;A href="https://github.com/karpathy/autoresearch" target="_blank" rel="noopener"&gt;https://github.com/karpathy/autoresearch&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Picture1.png" style="width: 652px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/25057i0A32B917A29EA5BA/image-size/large?v=v2&amp;amp;px=999" role="button" title="Picture1.png" alt="Picture1.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The second step is to “design” a prompt. I’d say this worries me, but I tried to start with something simple and continue to improve it. Below is the prompt I used:&lt;/P&gt;&lt;P&gt;“Can you convert autoresearch using databricks FMAPI and run a research topic.&lt;/P&gt;&lt;P&gt;Make sure you track the chain of thoughts using MLflow and log the experiment.”&lt;/P&gt;&lt;P&gt;Having used many agents in the past year, I know for a fact that the agent won’t know a few things:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;The syntax for connecting to FMAPI&lt;/LI&gt;&lt;LI&gt;It is not aware of the UC version of MLflow&lt;/LI&gt;&lt;LI&gt;It definitely doesn’t know tracing, which I didn’t mention in the prompt&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;With the code and the above prompt. I hit enter. And I’d expect some clarifications and failures, which even the most senior data scientist would ask. For example:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Where is the autoresearch code located?&lt;/LI&gt;&lt;LI&gt;Where is the endpoint located?&lt;/LI&gt;&lt;LI&gt;Which function do I want to log?&lt;/LI&gt;&lt;LI&gt;Where should I log the experiment?&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;But we will let Genie Code figure it out or allow it to ask follow-up questions. Below is the output:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Picture1.png" style="width: 936px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/25058iE2D2FDECED618300/image-size/large?v=v2&amp;amp;px=999" role="button" title="Picture1.png" alt="Picture1.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;It took a while, but as we can see, everything ran automatically, and it was able to find the code and modify it.&lt;/P&gt;&lt;P&gt;The results? Yes, it was able to run an experiment and log those traces into MLflow! We can see below that it ran for 5 iterations and produced a result in the 6th step, which is clean and organized. It also did some testing by itself in the first step to ensure the logging is working.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Picture1.png" style="width: 936px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/25059iE6541B6BEA1A9536/image-size/large?v=v2&amp;amp;px=999" role="button" title="Picture1.png" alt="Picture1.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;We can open up the final step and see the research output!&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Picture1.png" style="width: 936px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/25060i86502E3EA6DEA87B/image-size/large?v=v2&amp;amp;px=999" role="button" title="Picture1.png" alt="Picture1.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;What’s more, Genie Code also summarizes the iterations to determine which ones to keep and which to drop, aligning with the ReadMe.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Picture1.png" style="width: 814px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/25061i029B0FB8EAAD2EE7/image-size/large?v=v2&amp;amp;px=999" role="button" title="Picture1.png" alt="Picture1.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Conclusion&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;We now have the answer. Genie Code is not a simple rebrand from Assistant. It’s built for deep Databricks understanding. To answer my own questions:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;How many prompts do I need to use to achieve my goal?&lt;BR /&gt;One prompt only. Did not try to improve it.&lt;/LI&gt;&lt;LI&gt;How long is the prompt that I need to write? Do I need to create a very detailed project plan? Or will it just read my mind?&lt;BR /&gt;I wrote two sentences. Very short.&lt;/LI&gt;&lt;LI&gt;Will the first run be successful?&lt;BR /&gt;Yes – I can see the output&lt;/LI&gt;&lt;LI&gt;Does it recover from errors automatically?&lt;BR /&gt;Yes – Genie code automatically recovers from error and determines the next step!&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;This is a huge step forward for any code migration and onboarding to Databricks. No prior expert knowledge is required, and a single prompt is all you need!&lt;/P&gt;</description>
      <pubDate>Fri, 20 Mar 2026 06:14:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/mvp-articles/genie-code-meets-autoresearch/m-p/151487#M123</guid>
      <dc:creator>jasonyip</dc:creator>
      <dc:date>2026-03-20T06:14:23Z</dc:date>
    </item>
    <item>
      <title>Re: Genie code meets autoresearch</title>
      <link>https://community.databricks.com/t5/mvp-articles/genie-code-meets-autoresearch/m-p/154578#M159</link>
      <description>&lt;P&gt;does your Genie Code need customization? I try the same prompts but my Genie Code seems completely idiotic to perform the task.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 14 Apr 2026 20:26:37 GMT</pubDate>
      <guid>https://community.databricks.com/t5/mvp-articles/genie-code-meets-autoresearch/m-p/154578#M159</guid>
      <dc:creator>simonhuang</dc:creator>
      <dc:date>2026-04-14T20:26:37Z</dc:date>
    </item>
    <item>
      <title>Re: Genie code meets autoresearch</title>
      <link>https://community.databricks.com/t5/mvp-articles/genie-code-meets-autoresearch/m-p/154672#M160</link>
      <description>&lt;P&gt;Genie code is Claude fine-tuned with Databricks knowledge. I think it works quite well. Feel free to share your input and output&lt;/P&gt;</description>
      <pubDate>Wed, 15 Apr 2026 17:25:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/mvp-articles/genie-code-meets-autoresearch/m-p/154672#M160</guid>
      <dc:creator>jasonyip</dc:creator>
      <dc:date>2026-04-15T17:25:22Z</dc:date>
    </item>
  </channel>
</rss>

