<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Where does custom state store the data in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/where-does-custom-state-store-the-data/m-p/6180#M2397</link>
    <description>&lt;P&gt;Hi @somanath Sankaran​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you for posting your question in our community! We are happy to assist you.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers your question?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;This will also help other community members who may have similar questions in the future. Thank you for your participation and let us know if you need any further assistance!&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 12 Apr 2023 07:34:49 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2023-04-12T07:34:49Z</dc:date>
    <item>
      <title>Where does custom state store the data</title>
      <link>https://community.databricks.com/t5/data-engineering/where-does-custom-state-store-the-data/m-p/6178#M2395</link>
      <description>&lt;P&gt;There are couple of custom state functions like mapgroupswithstate,&lt;/P&gt;&lt;P&gt;ApplyinpandaswithState&lt;/P&gt;&lt;P&gt;Which has a internal state maintained is it maintained in same statestore(rocksdb) as aggregation state store function ​&lt;/P&gt;</description>
      <pubDate>Sun, 09 Apr 2023 11:34:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-does-custom-state-store-the-data/m-p/6178#M2395</guid>
      <dc:creator>Soma</dc:creator>
      <dc:date>2023-04-09T11:34:10Z</dc:date>
    </item>
    <item>
      <title>Re: Where does custom state store the data</title>
      <link>https://community.databricks.com/t5/data-engineering/where-does-custom-state-store-the-data/m-p/6179#M2396</link>
      <description>&lt;P&gt;@somanath Sankaran​&amp;nbsp;:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Yes, custom state functions like mapGroupsWithState and applyInPandasWithState use the same state store as the built-in aggregation state store. By default, this state is stored in RocksDB, which is an embedded, persistent key-value store that is optimized for storing and retrieving large amounts of data.&lt;/P&gt;&lt;P&gt;The state store is managed by the Databricks runtime and is automatically distributed across the worker nodes in the cluster. This allows the state to be shared and updated across multiple tasks running in parallel. The state is also fault-tolerant and can be recovered in case of a node failure.&lt;/P&gt;&lt;P&gt;When using custom state functions, it's important to keep in mind that the amount of state maintained by the function can have a significant impact on cluster performance and memory usage. It's important to properly configure the state timeout and eviction policies to ensure that old, unused state is regularly cleaned up to avoid running out of memory.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2023 13:03:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-does-custom-state-store-the-data/m-p/6179#M2396</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-04-10T13:03:09Z</dc:date>
    </item>
    <item>
      <title>Re: Where does custom state store the data</title>
      <link>https://community.databricks.com/t5/data-engineering/where-does-custom-state-store-the-data/m-p/6180#M2397</link>
      <description>&lt;P&gt;Hi @somanath Sankaran​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you for posting your question in our community! We are happy to assist you.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers your question?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;This will also help other community members who may have similar questions in the future. Thank you for your participation and let us know if you need any further assistance!&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Apr 2023 07:34:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-does-custom-state-store-the-data/m-p/6180#M2397</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-04-12T07:34:49Z</dc:date>
    </item>
    <item>
      <title>Re: Where does custom state store the data</title>
      <link>https://community.databricks.com/t5/data-engineering/where-does-custom-state-store-the-data/m-p/79327#M35735</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;As I understand from above response, the state store is distributed across worker nodes, which means the state store would be stored on the local storage of each worker node. Correct me I understood incorrectly?&lt;/P&gt;&lt;P&gt;So, if it is stored on local storage of worker node, then that storage is ephemeral storage which means it would be wiped out on restart of that worker node. In this case how the state is restored on restart or failure of a worker node?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jul 2024 05:52:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-does-custom-state-store-the-data/m-p/79327#M35735</guid>
      <dc:creator>PushkarDeole</dc:creator>
      <dc:date>2024-07-19T05:52:14Z</dc:date>
    </item>
  </channel>
</rss>

