<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: apply a function across multiple smaller dataframes created from one big dataframe in scala in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/apply-a-function-across-multiple-smaller-dataframes-created-from/m-p/3164#M323</link>
    <description>&lt;P&gt;Hi @Sandesh Puligundla​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Great to meet you, and thanks for your question!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt; Let's see if your peers in the community have an answer to your question. Thanks.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 16 Jun 2023 06:10:06 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2023-06-16T06:10:06Z</dc:date>
    <item>
      <title>apply a function across multiple smaller dataframes created from one big dataframe in scala</title>
      <link>https://community.databricks.com/t5/data-engineering/apply-a-function-across-multiple-smaller-dataframes-created-from/m-p/3163#M322</link>
      <description>&lt;P&gt;The dataframe 'big_df' looks like the below&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;| id| index| timestamp|&lt;/P&gt;&lt;P&gt;|:---- |:------:| -----:|&lt;/P&gt;&lt;P&gt;| abc| 1| 11:00:00|&lt;/P&gt;&lt;P&gt;| abc| 1| 11:00:10|&lt;/P&gt;&lt;P&gt;| abc| 1| 11:00:20|&lt;/P&gt;&lt;P&gt;| abc| 1| 11:00:30|&lt;/P&gt;&lt;P&gt;| abc| 1| 11:00:40|&lt;/P&gt;&lt;P&gt;| abc| 1| 11:00:50|&lt;/P&gt;&lt;P&gt;| abc| 2| 11:01:00|&lt;/P&gt;&lt;P&gt;| abc| 2| 11:01:10|&lt;/P&gt;&lt;P&gt;| abc| 2| 11:01:20|&lt;/P&gt;&lt;P&gt;| def| 1| 23:00:00|&lt;/P&gt;&lt;P&gt;| def| 1| 23:01:00|&lt;/P&gt;&lt;P&gt;| xyz| 1| 15:00:00|&lt;/P&gt;&lt;P&gt;| xyz| 1| 15:01:00|&lt;/P&gt;&lt;P&gt;| xyz| 1| 15:02:00|&lt;/P&gt;&lt;P&gt;| xyz| 1| 15:03:00|&lt;/P&gt;&lt;P&gt;| xyz| 1| 15:04:00|&lt;/P&gt;&lt;P&gt;| xyz| 1| 15:05:00|&lt;/P&gt;&lt;P&gt;| xyz| 2| 15:06:00|&lt;/P&gt;&lt;P&gt;| xyz| 2| 15:07:00|&lt;/P&gt;&lt;P&gt;| xyz| 3| 15:10:00|&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;There is a function 'fun1' which takes a dataframe as input.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Each unique combination of columns 'id' and 'index' in big_df is a small dataframe that needs to be passed to the function fun1.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;How can this function be applied across multiple of the small dataframes in parallel?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Can it be achieved using the foreachpartition and if so how?&lt;/P&gt;</description>
      <pubDate>Tue, 13 Jun 2023 18:31:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/apply-a-function-across-multiple-smaller-dataframes-created-from/m-p/3163#M322</guid>
      <dc:creator>Sandesh87</dc:creator>
      <dc:date>2023-06-13T18:31:15Z</dc:date>
    </item>
    <item>
      <title>Re: apply a function across multiple smaller dataframes created from one big dataframe in scala</title>
      <link>https://community.databricks.com/t5/data-engineering/apply-a-function-across-multiple-smaller-dataframes-created-from/m-p/3164#M323</link>
      <description>&lt;P&gt;Hi @Sandesh Puligundla​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Great to meet you, and thanks for your question!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt; Let's see if your peers in the community have an answer to your question. Thanks.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Jun 2023 06:10:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/apply-a-function-across-multiple-smaller-dataframes-created-from/m-p/3164#M323</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-06-16T06:10:06Z</dc:date>
    </item>
  </channel>
</rss>

