Hello,
Just in case, here is an example for proposed solution above:
import org.apache.spark.sql.functions._
import org.apache.spark.sql.expressions._
import org.apache.spark.sql.types._
val data = Seq(("A", Seq((3,4),(5,6),(7,10))), ("B", Seq((-1,...
Hello,
Please, take a look at the documentation:
https://docs.databricks.com/user-guide/dbfs-databricks-file-system.html#dbfs-command-line-interface
To delete created folder in notebooks:
%fs rm -r foobar
To download files, please, use dbfs:
# Get ...
Actually, the intersection transformation does deduplication. If you don't need it, you can just slightly modify your code:
val filteredRdd = rddAll.filter(line => line.contains(token))
and send data of the rdd to your program by calling of an act...