Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
We're trying to update permission in Unity Catalog using spark sql statetment spark.sql("ALTER TABLE %s.%s OWNER TO `%s`;" % (raw_schema, table_name, 'Metastore_Admins'))But we're running into following error We're facing issues like these in DBR 10....
I have seen this beforeso there is one file in the DBFS where we have to write the whitelisted file , so you have to find out where is your whitelisted file location after that you have to add that "com.databricks.....parallel" line in that file and ...
I'm trying to use pypmml in a DB notebook, but I'm getting the known `Error : Py4JError: Could not find py4j jar at` error. I've followed the solution here: https://kb.databricks.com/libraries/pypmml-fail-find-py4j-jar.html. However, this has not wor...
I've been struggling myslef with it, but after installing pypmml for spark, I can use the other library, maybe it will work for you:runtime 10.4 LTS MLinstall pypmml-spark (https://github.com/autodeployai/pypmml-spark)install pmml4s-spark (org.pmml4s...
I have an NLP application that I build on my local machine using spacy and pandas, but now I would like to scale my application to a large production dataset and utilize the benefits of sparks distributed compute. How do I import and utilize a librar...
It depends on what you mean, but if you're just trying to (say) tokenize and process data with spacy in parallel, then that's trivial. Write a 'pandas UDF' function that expresses how you want to transform data using spacy, in terms of a pandas DataF...