cancel
Showing results for 
Search instead for 
Did you mean: 
Community Platform Discussions
Connect with fellow community members to discuss general topics related to the Databricks platform, industry trends, and best practices. Share experiences, ask questions, and foster collaboration within the community.
cancel
Showing results for 
Search instead for 
Did you mean: 

The databricks jdbc driver has a memory leak

gf
New Contributor

https://community.databricks.com/t5/community-platform-discussions/memory-leak/td-p/80756 My question is the same as above Unable to upload pictures, I had to dictate

Question from ResultFileDownloadMonitor. M_requestList parameters
Because is ResultFileDownloadMonitor getResultFileDownloadMonitor method, so this class into a singleton, and cannot be GC
The m_requestList variable is a HashMap, and the addDownloadTask method is invoked to add elements to the m_requestList whenever the sql query is executed
The problem is that the m_requestList object only has the logic to add and modify elements, but no logic to delete elements, which is weird
After testing, we found that there are more than 120,000 elements in the m_requestList from the dump file, which is only 3 days of data volume
Although the size of each element is small, the amount is too large, and over time there will always be problems
I checked the driver for the latest 2.6.40-patch-1 release and found that the problem still exists

Therefore, I would like to confirm the following questions:
1. Will this issue be fixed in a future release?
2. Can the stopped object in m_requestList be deleted? Can I periodically clean KV whose value is true in m_requestList using reflection?

1 ACCEPTED SOLUTION

Accepted Solutions

Walter_C
Databricks Employee
Databricks Employee

Hello @gf thanks for your question, it seems that this has been reported with Simba, but no fix has been provided yet, as a temporary workaround, you can consider using reflection to periodically clean up the m_requestList by removing KV pairs whose value is true. However, this should be done with caution and thorough testing to ensure it does not introduce new issues.

View solution in original post

1 REPLY 1

Walter_C
Databricks Employee
Databricks Employee

Hello @gf thanks for your question, it seems that this has been reported with Simba, but no fix has been provided yet, as a temporary workaround, you can consider using reflection to periodically clean up the m_requestList by removing KV pairs whose value is true. However, this should be done with caution and thorough testing to ensure it does not introduce new issues.

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group