cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Science & Machine Learning

Forum Posts

Orianh
by Valued Contributor II
  • 1766 Views
  • 1 replies
  • 2 kudos

Run mlflow project from a Job.

Hey Guys, I'm trying to make automated process to run ML training sessions using mlflow and databricks jobs.While developing the model on my local machine using IDE, When finished I have a template notebook that get as parameters the mlflow project p...

error
  • 1766 Views
  • 1 replies
  • 2 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 2 kudos

Hi @orian hindi​ â€‹, We haven’t heard from you since the last response, and I was checking back to see if you have a resolution yet. If you have any solution, please share it with the community as it can be helpful to others. Otherwise, we will respon...

  • 2 kudos
cheng
by New Contributor
  • 734 Views
  • 0 replies
  • 0 kudos

Prophet/PyStan compiling error in Runtime 10.4 LTS ML

We're upgrading our ML jobs from using Runtime 9.1 LTS ML to Runtime 10.4 LTS ML in Databricks. One of the libraries our jobs relying on is Prophet. From 9.1 to 10.4, both the versions of Prophet (1.0.1) and PyStan (2.19.1.1) haven't changed, however...

  • 734 Views
  • 0 replies
  • 0 kudos
DavideCagnoni
by Contributor
  • 8060 Views
  • 8 replies
  • 7 kudos

Resolved! How to use python packages from `sys.path` ( in some sort of "edit-mode") which functions on workers too?

The help of `dbx sync` states that ```for the imports to work you need to update the Python path to include this target directory you're syncing to```This works quite well whenever the package is containing only driver-level functions. However, I ran...

  • 8060 Views
  • 8 replies
  • 7 kudos
Latest Reply
Scott_B
New Contributor III
  • 7 kudos

Hi @Davide Cagnoni​. Please see my answer to this post https://community.databricks.com/s/question/0D53f00001mUyh2CAC/limitations-with-udfs-wrapping-modules-imported-via-repos-filesI will copy it here for you:If your notebook is in the same Repo as t...

  • 7 kudos
7 More Replies
zzy
by New Contributor III
  • 1574 Views
  • 2 replies
  • 2 kudos

Why is GPU accelerated node much slower than CPU node for training a random forest model on databricks?

I have a dataset about 5 million rows with 14 features and a binary target. I decided to train a pyspark random forest classifier on Databricks. The CPU cluster I created contains 2 c4.8xlarge workers (60GB, 36core) and 1 r4.xlarge (31GB, 4core) driv...

  • 1574 Views
  • 2 replies
  • 2 kudos
Latest Reply
Hubert-Dudek
Esteemed Contributor III
  • 2 kudos

In many cases, you need to adjust your code to utilize GPU.

  • 2 kudos
1 More Replies
elementalM
by New Contributor III
  • 2333 Views
  • 4 replies
  • 4 kudos

Catch-up Structured Stream hangs on last step of write job to delta sync using toTable

I'm running databricks version 10.4 on gcp. I'm running a structured stream trying to process historical files in a delta table on gcp cloud storage. This source delta table is big but maintained with OPTIMIZE.The stream repartitions which seems to b...

image
  • 2333 Views
  • 4 replies
  • 4 kudos
Latest Reply
Anonymous
Not applicable
  • 4 kudos

Hi @Dwight Branscombe​ Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you....

  • 4 kudos
3 More Replies
HemanthVak
by New Contributor II
  • 1271 Views
  • 2 replies
  • 3 kudos

How to isolate environments for different projects in a single mlflow server?

I am planning to deploy MLFlow server deployed in Azure as a centralised repositories for my machine learning experiments and runs and to store events and artifacts. I would like to have different environments or isolated environments in the same wor...

  • 1271 Views
  • 2 replies
  • 3 kudos
Latest Reply
Anonymous
Not applicable
  • 3 kudos

Hi @Hemanth Vakacharla​ Does @Debayan Mukherjee​  response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

  • 3 kudos
1 More Replies
weldermartins
by Honored Contributor
  • 17036 Views
  • 17 replies
  • 13 kudos

Resolved! Created nested struct schema SPARK - Schema Jira

Hello guys,I'm using Jira API to return "ISSUES". But to be able to use pyspark I need to create the Dataframe passing in the Schema. But I am not able to create the Schema based on the model below. Would you have any ideas?root |-- expand: string ...

  • 17036 Views
  • 17 replies
  • 13 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 13 kudos

if columns are missing, that particular data is not present in the json. I am not aware of spark skipping columns when reading json with inferschema. There is an option dropFieldIfAllNull but that is False by default.That makes me think: you might ...

  • 13 kudos
16 More Replies
data_engineer_s
by New Contributor II
  • 1144 Views
  • 2 replies
  • 0 kudos

Utilize databricks compute for model training from Pycharm IDE

I like to train my machine learning model from Pycharm IDE. But I want to utilize databricks cluster as compute power to speed up the training. Is it possible

  • 1144 Views
  • 2 replies
  • 0 kudos
Latest Reply
Anonymous
Not applicable
  • 0 kudos

Hi @Suvikram Yerramilli​ Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from yo...

  • 0 kudos
1 More Replies
Willis1
by New Contributor
  • 864 Views
  • 2 replies
  • 1 kudos

Feature Store best practice: refactoring notebook

Hello, I have a question about best practice regarding registering a feature in Databricks feature store.​Lets say that I create and register features​ during the EDA or experiment phase of a ML project. Later the model is moving to production stage ...

  • 864 Views
  • 2 replies
  • 1 kudos
Latest Reply
Anonymous
Not applicable
  • 1 kudos

Hi @Willis Harding​ Does @Kaniz Fatma​  response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

  • 1 kudos
1 More Replies
jochoa
by New Contributor
  • 1407 Views
  • 1 replies
  • 0 kudos

Resolved! Issue logging into my account

Hello, I need assistance accessing my account in data bricks community edition. I got an error that my account was locked due to recent suspicious activity. I tried to reset my password but did not get an email with password change instructions. Than...

  • 1407 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @Juan Ochoa​ , Thank you for reaching out, and we’re sorry to hear about this log-in issue! We have this Community Edition login troubleshooting post on Community. Please take a look, and follow the troubleshooting steps. If the steps do not resol...

  • 0 kudos
jcapplefields88
by New Contributor II
  • 1412 Views
  • 3 replies
  • 1 kudos

Expose low latency APIs from Deltalake for mobile apps and microservices

My company is using Deltalake to extract customer insights and run batch scoring with ML models. I need to expose this data to some microservices thru gRPC and REST APIs. How to do this? I'm thinking to build Spark pipelines to extract teh data, stor...

  • 1412 Views
  • 3 replies
  • 1 kudos
Latest Reply
Noopur_Nigam
Valued Contributor II
  • 1 kudos

Hi @John Capplefield​ Gentle follow-up, please let us know if you need further help on this.

  • 1 kudos
2 More Replies
dsiu
by New Contributor II
  • 1024 Views
  • 1 replies
  • 2 kudos

CountVectorizer no longer works through Azure ML

Hello. I am trying to use the CountVectorizer module as part of our feature engineering. It works on a Databricks notebook directly, but when I try to run the code through Azure with the databricks connection, it throws an error. This isn't the first...

  • 1024 Views
  • 1 replies
  • 2 kudos
Latest Reply
Noopur_Nigam
Valued Contributor II
  • 2 kudos

Hi @Danny Siu​ Please check that you are using the latest dbconnect version corresponding to the DBR version that you are using in the databricks cluster.You can check the latest dbr version here: https://pypi.org/project/databricks-connect/#history

  • 2 kudos
studentofml
by New Contributor
  • 1015 Views
  • 1 replies
  • 0 kudos

Is Model Serving REST API available?

This is mentioned in:https://learn.microsoft.com/en-us/azure/databricks/mlflow/create-manage-serverless-model-endpointswith api call example, while in:https://learn.microsoft.com/en-us/answers/questions/892678/how-to-enable-databricks-model-serving-w...

  • 1015 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @Thou Mather​ , Did you get a chance to go through this doc?

  • 0 kudos
ashrafkhan94
by New Contributor II
  • 1563 Views
  • 2 replies
  • 2 kudos

Resolved! Failure in mlflow.spark.load_model : Random Forrest pretrained model

model = mlflow.spark.load_model(model_uri=f"models:/{model_name}/{model_version}")Log:An error occurred while calling o2861.load.: org.apache.spark.SparkException: Job aborted due to stage failure: Task 4 in stage 4599.0 failed 4 times, most recent f...

  • 1563 Views
  • 2 replies
  • 2 kudos
Latest Reply
Noopur_Nigam
Valued Contributor II
  • 2 kudos

Hi @Ashraf Khan​ Did you get a chance to look into Sean's response. Please let us know if you need more help on this.

  • 2 kudos
1 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels