Hi Mantu,

Let me share some insights based on our experience of me and my team on a topic detection problem.
We used semantic chunker function in LangChain with using Text-embedding-ada-002 model from openAI.
For longer text we used a different approach involving gpt4 to make the semantic split.
If you are interested in the topic I can suggest you to have a look at this Medium article where my collegue explain the process in a very clear way "Transforming Voice of Customer into Actionable Insights with BERTopic and OpenAI on Databricks"

Hope this could help, let me know your thought about it.