for those of you who are looking for a not too complicated solution, you can use the two built in spark api soundex and levenshtein
val newDF = accountDF.join(
accountDF2,
levenshtein(accountDF("name"), accountDF2("name")) < 3 && (accountDF("i...
for those of you looking for a not very complicated solution, you can use the 2 native spark api Soundex and Levenshtein as your fuzzy matching algorithms.val joinedDF = accountDF.join( accountDF2, levenshtein(accountDF("name"), accountDF2("name")) ...