<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: What are the best ways to implement transcription in podcast apps? in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/what-are-the-best-ways-to-implement-transcription-in-podcast/m-p/140193#M11059</link>
    <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/167850"&gt;@ShaneCorn&lt;/a&gt;, great question &lt;span class="lia-unicode-emoji" title=":waving_hand:"&gt;👋&lt;/span&gt;&lt;/P&gt;&lt;P&gt;When you think about transcription for a podcast app &lt;EM&gt;with Databricks&lt;/EM&gt;, it helps to break it down into a simple pattern:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Ingest &amp;amp; store the audio&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Transcribe it with a speech model&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Enrich the transcript (chapters, speakers, topics)&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Expose everything for search &amp;amp; recommendations&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Databricks works well here because you can run this end-to-end on one platform.&lt;/P&gt;</description>
    <pubDate>Mon, 24 Nov 2025 14:06:31 GMT</pubDate>
    <dc:creator>bianca_unifeye</dc:creator>
    <dc:date>2025-11-24T14:06:31Z</dc:date>
    <item>
      <title>What are the best ways to implement transcription in podcast apps?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/what-are-the-best-ways-to-implement-transcription-in-podcast/m-p/140095#M11054</link>
      <description>&lt;P&gt;I am starting this discussion for everyone who can answer my query.&lt;/P&gt;</description>
      <pubDate>Mon, 24 Nov 2025 11:23:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/what-are-the-best-ways-to-implement-transcription-in-podcast/m-p/140095#M11054</guid>
      <dc:creator>ShaneCorn</dc:creator>
      <dc:date>2025-11-24T11:23:54Z</dc:date>
    </item>
    <item>
      <title>Re: What are the best ways to implement transcription in podcast apps?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/what-are-the-best-ways-to-implement-transcription-in-podcast/m-p/140193#M11059</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/167850"&gt;@ShaneCorn&lt;/a&gt;, great question &lt;span class="lia-unicode-emoji" title=":waving_hand:"&gt;👋&lt;/span&gt;&lt;/P&gt;&lt;P&gt;When you think about transcription for a podcast app &lt;EM&gt;with Databricks&lt;/EM&gt;, it helps to break it down into a simple pattern:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Ingest &amp;amp; store the audio&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Transcribe it with a speech model&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Enrich the transcript (chapters, speakers, topics)&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Expose everything for search &amp;amp; recommendations&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Databricks works well here because you can run this end-to-end on one platform.&lt;/P&gt;</description>
      <pubDate>Mon, 24 Nov 2025 14:06:31 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/what-are-the-best-ways-to-implement-transcription-in-podcast/m-p/140193#M11059</guid>
      <dc:creator>bianca_unifeye</dc:creator>
      <dc:date>2025-11-24T14:06:31Z</dc:date>
    </item>
    <item>
      <title>Re: What are the best ways to implement transcription in podcast apps?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/what-are-the-best-ways-to-implement-transcription-in-podcast/m-p/140210#M11060</link>
      <description>&lt;P&gt;&lt;!--  StartFragment   --&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;1. Use Speech-to-Text Models via &lt;/SPAN&gt;&lt;SPAN class=""&gt;MLflow&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Integrate open-source models like &lt;/SPAN&gt;&lt;SPAN class=""&gt;OpenAI&lt;/SPAN&gt;&lt;SPAN class=""&gt; Whisper, Hugging Face Wav2Vec2, or &lt;/SPAN&gt;&lt;SPAN class=""&gt;AssemblyAI&lt;/SPAN&gt;&lt;SPAN class=""&gt; API.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Log the model in &lt;/SPAN&gt;&lt;SPAN class=""&gt;MLflow&lt;/SPAN&gt;&lt;SPAN class=""&gt; for versioning and reproducibility.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Deploy as a Databricks Model Serving endpoint for real-time transcription.&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;2. Leverage Serverless Compute for Audio Processing&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Use Databricks Serverless Jobs or Delta Live Tables for batch transcription of podcast episodes.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Store audio files in Unity Catalog-managed storage.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Process audio in parallel using Spark UDFs or Pandas UDFs for distributed workloads.&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;3. Optimize with Delta Lake&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Store transcriptions in Delta tables for efficient querying and analytics.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Add metadata like speaker info, timestamps, and confidence scores.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Enable Unity Catalog governance for secure access control.&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;4. Integrate External APIs for Accuracy&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN class=""&gt;If you need high accuracy and language support, integrate APIs like:&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Azure Cognitive Services Speech-to-Text&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Google Cloud Speech&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;AWS Transcribe&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;5. Enhance with NLP for Summarization &amp;amp; Search&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;After transcription, apply NLP models for:&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Summarization (using Hugging Face transformers)&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Keyword extraction&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Semantic search (via Databricks Vector Search)&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;6. Streaming for Live Podcasts&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Use Structured Streaming with Auto Loader to ingest audio chunks.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Apply real-time transcription using a deployed &lt;/SPAN&gt;&lt;SPAN class=""&gt;MLflow&lt;/SPAN&gt;&lt;SPAN class=""&gt; model or external API.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Output to Delta tables or publish to Kafka for downstream apps.&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;7. Cost &amp;amp; Performance Tips&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Use Spot instances or Photon runtime for compute efficiency.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Compress audio before processing.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN class=""&gt;Batch process episodes during off-peak hours.&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;!--  EndFragment   --&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 24 Nov 2025 16:02:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/what-are-the-best-ways-to-implement-transcription-in-podcast/m-p/140210#M11060</guid>
      <dc:creator>nayan_wylde</dc:creator>
      <dc:date>2025-11-24T16:02:00Z</dc:date>
    </item>
  </channel>
</rss>

