Edthehead
Contributor III

You can refer to this article Optimizing Large-Scale Fuzzy Matching with Apache Spark and Databricks | by Gavaragirijarani | Mediu....

As far as open-source libraries go, rapidfuzz is known to be faster than fuzzywuzzy.