@Yosep Wijaya
When you write the image to DBFS, you should write it to /FileStore/svm.jpg
When you reference the file in html, it should be via /files/svm.jpg
If I understand you correctly, you have a large array of tokens, and you want to filter that large array against a smaller array of tokens.
You should convert these arrays into RDDs and then use the intersect() function to just return the tokens in ...