@Retired_mod thanks for the response. As I understand, there are 3 options to be explored to get optimal performance out of rocksdb based state management:
1. specify a local directory 'rocksdb.localdir'
--> will you be able to guide how (through which configuration) this can be specified?
2. implement asynchronous checkpoints
--> I looked more into the details of asynchronous checkpoints through this article https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/async-checkpointing
As mentioned in the limitations, cluster resizing might not work well with asynchronous checkpointing. Since we are using auto scaling feature for our databricks cluster, does that mean that we won't be able to use asynchronous checkpointing as it will frequently resize the cluster?
3. Databricks' state rebalancing
--> will explore this more