<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Building a Theoretical Solar Flare Intelligence System for the Databricks Free Edition Hackathon in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/building-a-theoretical-solar-flare-intelligence-system-for-the/m-p/139114#M51100</link>
    <description>&lt;P&gt;Fabulous submission&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/196686"&gt;@zoe_unifeye&lt;/a&gt;&amp;nbsp;and good luck with hackathon.&lt;/P&gt;</description>
    <pubDate>Fri, 14 Nov 2025 15:36:50 GMT</pubDate>
    <dc:creator>Raman_Unifeye</dc:creator>
    <dc:date>2025-11-14T15:36:50Z</dc:date>
    <item>
      <title>Building a Theoretical Solar Flare Intelligence System for the Databricks Free Edition Hackathon</title>
      <link>https://community.databricks.com/t5/data-engineering/building-a-theoretical-solar-flare-intelligence-system-for-the/m-p/138956#M51063</link>
      <description>&lt;P class=""&gt;I recently built a Theoretical Solar Flare Grid Impact Intelligence System for the Databricks Free Edition Hackathon 2025, and I wanted to share my journey building an end-to-end data engineering and ML solution on Databricks Free Edition.&lt;/P&gt;&lt;H2&gt;&lt;STRONG&gt;Finding the Problem to Solve&lt;/STRONG&gt;&lt;/H2&gt;&lt;P class=""&gt;This isn't often the way round most people are used to working - usually a problem presents itself and you're tasked with finding the solution. However being a hackathon with a broad scope I thought I'd flip that and find a problem to solve. As a data engineer I love to solve problems, understand the core concepts of what ever is it I'm looking at and build a full picture. It's this aspect that also draws me to physics in my spare time so I thought I'd use this hackathon as way to dive into some data I find genuinly interesting - solar flares. I'm not an expert on solar flares but with some knowledge that solar flares&amp;nbsp;can wreak havoc on electricals - especially with aging grid infrastructure, I wondered "what if we could predict these events days in advance and prepare for outages?" - et voila, the problem to solve.&lt;/P&gt;&lt;H2&gt;&lt;STRONG&gt;Buidling the Solution to the Problem&lt;/STRONG&gt;&lt;/H2&gt;&lt;H3&gt;&lt;STRONG&gt;The Soluiton&lt;/STRONG&gt;&lt;/H3&gt;&lt;P class=""&gt;A system that combines NASA space weather data with power grid monitoring data to provide predictive intelligence for grid operators using natural language query interface.&lt;/P&gt;&lt;H3&gt;&lt;STRONG&gt;Architecture Overview&lt;/STRONG&gt;&lt;/H3&gt;&lt;P class=""&gt;&lt;STRONG&gt;Delta Live Tables Pipeline (Medallion Architecture):&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;span class="lia-unicode-emoji" title=":3rd_place_medal:"&gt;🥉&lt;/span&gt; &lt;STRONG&gt;Bronze Layer:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL class=""&gt;&lt;LI&gt;Ingested NASA space weather observations (solar flare class, intensity, timing)&lt;/LI&gt;&lt;LI&gt;Ingested power grid fault detection data (voltage, current, temperature, health scores)&lt;/LI&gt;&lt;LI&gt;Used Auto Loader for streaming data ingestion&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;&lt;span class="lia-unicode-emoji" title=":2nd_place_medal:"&gt;🥈&lt;/span&gt; &lt;STRONG&gt;Silver Layer:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL class=""&gt;&lt;LI&gt;Implemented data quality expectations with &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/97035"&gt;@Dlt&lt;/a&gt;.expect_or_drop()&lt;/LI&gt;&lt;LI&gt;Validated timestamps, flare classifications, voltage ranges, temperature limits&lt;/LI&gt;&lt;LI&gt;Enriched data with severity classifications and temporal features&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;&lt;span class="lia-unicode-emoji" title=":1st_place_medal:"&gt;🥇&lt;/span&gt; &lt;STRONG&gt;Gold Layer:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL class=""&gt;&lt;LI&gt;Created correlation tables joining solar and grid data by date&lt;/LI&gt;&lt;LI&gt;Added temporal lag features (same-day, next-day, 2-3 days later) to capture delayed geomagnetic effects&lt;/LI&gt;&lt;LI&gt;Built ML-enriched tables with predictions and probability forecasts&lt;/LI&gt;&lt;/UL&gt;&lt;H3&gt;&lt;STRONG&gt;Key Features&lt;/STRONG&gt;&lt;/H3&gt;&lt;P class=""&gt;&lt;STRONG&gt;1. Correlation Analysis:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL class=""&gt;&lt;LI&gt;Tracked how solar flare intensity correlates with grid voltage drops, temperature rises, and equipment health degradation&lt;/LI&gt;&lt;LI&gt;Implemented time-lagged features since geomagnetic storms take 24-72 hours to fully develop&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;&lt;STRONG&gt;2. ML Predictions:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL class=""&gt;&lt;LI&gt;Created scenario-based predictions: Quiet Sun → Severe X-class storms&lt;/LI&gt;&lt;LI&gt;Generated 7-day forecasts with risk levels and specific operational recommendations&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;&lt;STRONG&gt;3. Probabilistic Forecasting:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL class=""&gt;&lt;LI&gt;Calculated historical frequency of different flare classes (B, C, M, X)&lt;/LI&gt;&lt;LI&gt;Created probability forecasts for next 7 days&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;&lt;STRONG&gt;4. AI/BI Genie Integration:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL class=""&gt;&lt;LI&gt;Set up natural language query interface for grid operators&lt;/LI&gt;&lt;LI&gt;Sample queries:&lt;UL class=""&gt;&lt;LI&gt;"What happens if we get a severe X-class solar storm tomorrow?"&lt;/LI&gt;&lt;LI&gt;"Show me the most likely solar scenarios for the next 7 days"&lt;/LI&gt;&lt;LI&gt;"At what flare intensity should we activate emergency protocols?"&lt;/LI&gt;&lt;LI&gt;"Visualize daily faults and types"&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;H3&gt;Here's what the theoretical grid operators would get:&lt;/H3&gt;&lt;UL class=""&gt;&lt;LI&gt;&lt;STRONG&gt;7-day forecasts&lt;/STRONG&gt; that show probability estimates for different solar scenarios (from quiet sun to severe X-class storms)&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Clear risk thresholds&lt;/STRONG&gt; - no guessing about when to escalate from "keep an eye on it" to "activate emergency protocols"&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Specific action plans&lt;/STRONG&gt; - not vague warnings, but concrete steps like "pre-position repair crews at substations" or "alert hospitals about potential outages"&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Anomaly detection&lt;/STRONG&gt; that flags unusual patterns - days when something weird is happening that needs investigation&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Natural language queries&lt;/STRONG&gt; via Genie - operators can ask questions in plain English and get instant answers&lt;/LI&gt;&lt;/UL&gt;&lt;H3&gt;&lt;STRONG&gt;Tech Stack&lt;/STRONG&gt;&lt;/H3&gt;&lt;P class=""&gt;Built entirely on Databricks Free Edition using:&lt;/P&gt;&lt;UL class=""&gt;&lt;LI&gt;&lt;STRONG&gt;Delta Live Tables&lt;/STRONG&gt; for the pipeline orchestration&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Auto Loader&lt;/STRONG&gt; for streaming data ingestion&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;PySpark&lt;/STRONG&gt; for data transformations&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;AI/BI Genie&lt;/STRONG&gt; for natural language queries&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Python ML libraries&lt;/STRONG&gt; (RandomForest) for the predictive modeling&lt;/LI&gt;&lt;/UL&gt;&lt;H3&gt;&lt;STRONG&gt;The Journey: Finding Problems Worth Solving&lt;/STRONG&gt;&lt;/H3&gt;&lt;P class=""&gt;This hackathon gave me the freedom to work backwards - starting with fascinating data (solar flares) and discovering a problem worth solving (grid vulnerability). It's not the typical workflow, but it reminded me why I became a data engineer in the first place: curiosity about how systems work and the drive to build solutions that matter.&lt;BR /&gt;&lt;BR /&gt;You can watch the 5 minute demo that I entered into the Hackathon here:&lt;/P&gt;&lt;P class=""&gt;&lt;A href="https://youtu.be/HHkr4vfzD2M" target="_self"&gt;&lt;FONT size="3"&gt;Databricks Free Edition Hackathon: Theoretical Solar Flare Grid Impact Intelligence System&lt;/FONT&gt;&lt;/A&gt;&lt;/P&gt;&lt;P class="" data-unlink="true"&gt;&lt;SPAN&gt;Thanks for reading! &amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 13 Nov 2025 17:35:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/building-a-theoretical-solar-flare-intelligence-system-for-the/m-p/138956#M51063</guid>
      <dc:creator>zoe_unifeye</dc:creator>
      <dc:date>2025-11-13T17:35:07Z</dc:date>
    </item>
    <item>
      <title>Re: Building a Theoretical Solar Flare Intelligence System for the Databricks Free Edition Hackathon</title>
      <link>https://community.databricks.com/t5/data-engineering/building-a-theoretical-solar-flare-intelligence-system-for-the/m-p/139114#M51100</link>
      <description>&lt;P&gt;Fabulous submission&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/196686"&gt;@zoe_unifeye&lt;/a&gt;&amp;nbsp;and good luck with hackathon.&lt;/P&gt;</description>
      <pubDate>Fri, 14 Nov 2025 15:36:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/building-a-theoretical-solar-flare-intelligence-system-for-the/m-p/139114#M51100</guid>
      <dc:creator>Raman_Unifeye</dc:creator>
      <dc:date>2025-11-14T15:36:50Z</dc:date>
    </item>
  </channel>
</rss>

