cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Science & Machine Learning

Forum Posts

dvmentalmadess
by Valued Contributor
  • 2051 Views
  • 1 replies
  • 2 kudos

Resolved! Store a secret only accessible to the current user

During an interactive notebook session, I want a user to be able to retrieve a secret specific to that user. I haven't decided on storage mechanisms, but I'm open to storage mechanisms that can scalably authorize access to a single user and that I ca...

  • 2051 Views
  • 1 replies
  • 2 kudos
Latest Reply
dvmentalmadess
Valued Contributor
  • 2 kudos

I ended up using Databricks Secrets as the storage mechanism after learning from my account rep that the limit is soft and we can request a higher scope limit. In this case, each user gets a dedicated scope and no other users have access.

  • 2 kudos
Slalom_Tobias
by New Contributor III
  • 2722 Views
  • 4 replies
  • 1 kudos

Resolved! ML Practioner | ml 09 - automl notebook | error on importing databricks.automl

executing the following code...from databricks import automlsummary = automl.regress(train_df, target_col="price", primary_metric="rmse", timeout_minutes=5, max_trials=10)generates the error...ImportError: cannot import name 'automl' from 'databricks...

  • 2722 Views
  • 4 replies
  • 1 kudos
Latest Reply
Krueger156
New Contributor II
  • 1 kudos

I'm happy to see a particularly subject.

  • 1 kudos
3 More Replies
HariK1
by New Contributor II
  • 1939 Views
  • 2 replies
  • 1 kudos

How to input initial centroids to K-Means or GMM Clustering in SparkML ?

Hi, I want to use KMeans Model or Gaussian Mixture Model algorithm for clustering using the SparkML library, in which I want to specify the initial centroids. The option of giving initial centroids is there in the Spark MLlib (RDD based APIs) however...

  • 1939 Views
  • 2 replies
  • 1 kudos
Latest Reply
HariK1
New Contributor II
  • 1 kudos

@Kaniz Fatma​ I still haven't got an answer to my question!!!

  • 1 kudos
1 More Replies
duck_butter123
by New Contributor II
  • 4965 Views
  • 4 replies
  • 7 kudos

Resolved! No Module named 'mlflow'

I new to the scalable machine learning with apache spark course. I am in the notebook ML 00a - Install Datasets it includes one cell (attached) which throws an error 'no module named 'mlflow''. It attempts to run the Classroom-Setup file. Error is th...

Classroom-Setup code + error ML 00a - Install Dataset Error
  • 4965 Views
  • 4 replies
  • 7 kudos
Latest Reply
Anonymous
Not applicable
  • 7 kudos

@Myles Pember​ I hope the suggestions above helped out! If so, please select one as 'best' for us!If you still need assistance, let us know!

  • 7 kudos
3 More Replies
Slalom_Tobias
by New Contributor III
  • 2483 Views
  • 2 replies
  • 3 kudos

Resolved! ML Practioner | ML 10 - Feature Store notebook | feature_store import error

the following code...from pyspark.sql.functions import monotonically_increasing_id, lit, expr, randimport uuidfrom databricks import feature_storefrom pyspark.sql.types import StringType, DoubleTypefrom databricks.feature_store import feature_table, ...

  • 2483 Views
  • 2 replies
  • 3 kudos
Latest Reply
Anonymous
Not applicable
  • 3 kudos

Hope that was an easy fix - @Tobias Cortese​ ! Thanks for marking the "best answer"!

  • 3 kudos
1 More Replies
mattsteinpreis
by New Contributor III
  • 4327 Views
  • 4 replies
  • 5 kudos

Getting Py4J "Could not find py4j jar" error when trying to use pypmml, solution doesn't work

I'm trying to use pypmml in a DB notebook, but I'm getting the known `Error : Py4JError: Could not find py4j jar at` error. I've followed the solution here: https://kb.databricks.com/libraries/pypmml-fail-find-py4j-jar.html. However, this has not wor...

  • 4327 Views
  • 4 replies
  • 5 kudos
Latest Reply
pawelmitrus
Contributor
  • 5 kudos

I've been struggling myslef with it, but after installing pypmml for spark, I can use the other library, maybe it will work for you:runtime 10.4 LTS MLinstall pypmml-spark (https://github.com/autodeployai/pypmml-spark)install pmml4s-spark (org.pmml4s...

  • 5 kudos
3 More Replies
Direo
by Contributor
  • 1622 Views
  • 2 replies
  • 1 kudos

Is it possible to load MLFlow artifacts and models from local diretory to databricks DBFS?

I have been working locally and created a few models and now I want to move those to databricks/DBFS. Is it possible to do that?

  • 1622 Views
  • 2 replies
  • 1 kudos
Latest Reply
Vivian_Wilfred
Databricks Employee
  • 1 kudos

Hi @Direo Direo​, can you check these docs and see if it helps-https://docs.databricks.com/applications/mlflow/access-hosted-tracking-server.html#access-the-mlflow-tracking-server-from-outside-databrickshttps://docs.databricks.com/applications/mlflow...

  • 1 kudos
1 More Replies
Deep
by New Contributor II
  • 2075 Views
  • 3 replies
  • 1 kudos

ML Model serving cluster tags?

Is there a way to add tags automatically to ML Model serving clusters? I see we can add tags to the model itself which persist but any tags I add to the cluster serving it do not after the endpoint is stopped. This would be important to track billing...

  • 2075 Views
  • 3 replies
  • 1 kudos
Latest Reply
Anonymous
Not applicable
  • 1 kudos

Hey there @Deep Kalra​ Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you....

  • 1 kudos
2 More Replies
jnjns
by New Contributor II
  • 971 Views
  • 0 replies
  • 3 kudos

Java Error for installation rasterframes

Hi all,I have followed the steps in this notebook to install rasterframes on my databricks cluster.Eventually I am able to import the following:from pyrasterframes import rf_ipython from pyrasterframes.utils import create_rf_spark_session from pyspar...

  • 971 Views
  • 0 replies
  • 3 kudos
hulma
by New Contributor II
  • 2983 Views
  • 4 replies
  • 0 kudos

Model serving keep relaunching

Hello, I tried to serve my model realtime. Model process keeps relaunching.I am getting this error in the logs, TypeError: Descriptors cannot not be created directly. If this call came from a _pb2.py file, your generated code is out of date and must ...

  • 2983 Views
  • 4 replies
  • 0 kudos
Latest Reply
Anonymous
Not applicable
  • 0 kudos

Hey there @Hulma Abdul Rahman​ Hope you are well. Just wanted to see if you were able to find an answer to your question and would you like to mark an answer as best? It would be really helpful for the other members too.Cheers!

  • 0 kudos
3 More Replies
Chiraag
by New Contributor II
  • 1103 Views
  • 1 replies
  • 1 kudos
  • 1103 Views
  • 1 replies
  • 1 kudos
Latest Reply
Noopur_Nigam
Databricks Employee
  • 1 kudos

Hi @Chiraag Lathia​ Gentle reminder on the answer provided by @Kaniz Fatma​ . Please let us know if you have more doubts or queries.

  • 1 kudos
Mikelo
by New Contributor II
  • 1272 Views
  • 1 replies
  • 3 kudos

Is there a latency difference between querying from the feature store and delta table?

Our team ran a benchmarking experiment comparing queries from the MLFlow feature store vs directly from delta table, and we found querying delta table was ~2-3 times faster than the feature store. So I’m wondering if someone else has done a similar b...

  • 1272 Views
  • 1 replies
  • 3 kudos
Latest Reply
Noopur_Nigam
Databricks Employee
  • 3 kudos

Hi @Mike Lo​ Gentle reminder on the answer provided by @Kaniz Fatma​ . Please let us know if you have more doubts or queries.

  • 3 kudos
vivoedoardo
by New Contributor II
  • 2076 Views
  • 3 replies
  • 1 kudos

How to track features used and filters in MLFlow?

Hello everyone,We are experimenting with several approaches in a Machine Learning project ( binary classification), and we would like to keep track of those using MLFlow. We are using the feature store to build, store, and retrieve the features, and ...

  • 2076 Views
  • 3 replies
  • 1 kudos
Latest Reply
NathanielN
New Contributor II
  • 1 kudos

 Thanks for the information, I will try to figure it out for more. Keep sharing such informative post keep suggesting such post.

  • 1 kudos
2 More Replies
Dhara
by New Contributor III
  • 17652 Views
  • 9 replies
  • 5 kudos

Access multiple .mdb files using Python

Hi, I wanted to access multiple .mdb access files which are stored in the Azure Data Lake Storage(ADLS) or on Databricks File System using Python. Is it possible to guide me how can I achieve it? It would be great if you can share some code snippets ...

  • 17652 Views
  • 9 replies
  • 5 kudos
Latest Reply
User16764241763
Honored Contributor
  • 5 kudos

@Dhara Mandal​ Can you please try below?# cmd 1 %pip instal pandas_access   # cmd 2 import pandas_access as mdb   db_filename = '/dbfs/FileStore/Campaign_Template.mdb'   # Listing the tables. for tbl in mdb.list_tables(db_filename): print(tbl)   ...

  • 5 kudos
8 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels