Databricks Community

__max · 10-18-2017

Hello, Just in case, here is an example for proposed solution above: import org.apache.spark.sql.functions._ import org.apache.spark.sql.expressions._ import org.apache.spark.sql.types._ val data = Seq(("A", Seq((3,4),(5,6),(7,10))), ("B", Seq((-1,...

__max · 08-30-2017

Hello, Please, take a look at the documentation: https://docs.databricks.com/user-guide/dbfs-databricks-file-system.html#dbfs-command-line-interface To delete created folder in notebooks: %fs rm -r foobar To download files, please, use dbfs: # Get ...

__max · 06-13-2017

Actually, the intersection transformation does deduplication. If you don't need it, you can just slightly modify your code: val filteredRdd = rddAll.filter(line => line.contains(token)) and send data of the rdd to your program by calling of an act...

Databricks Community

User Stats

User Activity

Re: How do I register a UDF that returns an array of tuples in scala/spark?

Re: How can I delete folders from my DBFS?

Re: Filtering records for all values of an array in Spark