cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Science & Machine Learning

Forum Posts

117074
by New Contributor III
  • 2462 Views
  • 2 replies
  • 1 kudos

sklearn logistic regression restarting kernel

Hello,I am trying to create a simple logistic regression model to test for the impact of PCA on my dataset. I'm getting the error in the attachment.It feels to me like a version mismatching - but I'm using a fresh cluster with no additional packages ...

  • 2462 Views
  • 2 replies
  • 1 kudos
Latest Reply
117074
New Contributor III
  • 1 kudos

Sorry for the long delay - holiday & work activities stopped from me progressing this issue. The error was actually an OOM error - fixed by configuring cluster sizes appropriately. 

  • 1 kudos
1 More Replies
Kaniz_Fatma
by Community Manager
  • 3358 Views
  • 4 replies
  • 3 kudos
  • 3358 Views
  • 4 replies
  • 3 kudos
Latest Reply
Raluka
New Contributor III
  • 3 kudos

In the world of academic and professional writing, finding reliable essay specialists can be a daunting task. However, I've come nursingpaper.com across a website that simplifies this process. It's a platform where you can connect with seasoned write...

  • 3 kudos
3 More Replies
Soumyajeet_das
by New Contributor
  • 10477 Views
  • 2 replies
  • 0 kudos

Databricks ML Professional Certification exam got suspended due to technical issue

Hi Team,I was taking online exam for Databricks Machine Learning Professional exam on 22nd September,2023(16:15 Asia/Calcutta), the exam first got suspended due to proctoring issue and was rescheduled by the proctor in the next 30 mins, then the exam...

  • 10477 Views
  • 2 replies
  • 0 kudos
Latest Reply
Cert-Team
Esteemed Contributor
  • 0 kudos

@Soumyajeet_das Thank you for filing a ticket through support! The team is working on it. Thank you for your patience.

  • 0 kudos
1 More Replies
varun-adi
by New Contributor
  • 2957 Views
  • 1 replies
  • 0 kudos

How to allocate more memory to GPU when training through databricks notebook

I am trying to train a Hubert Model, specifically the facebook/hubert-base-ls960 model on a custom speech dataset.Training parameters are below:trainer_config = {  "OUTPUT_DIR": "results",  "TRAIN_EPOCHS": 6,  "TRAIN_BATCH_SIZE": 2,  "EVAL_BATCH_SIZE...

  • 2957 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @varun-adi, Based on the provided information, there are several ways you can try to ensure more memory is available to CUDA and the process when training through the notebook: 1. **Tune the batch size**: You've already tried reducing the batch si...

  • 0 kudos
m12
by New Contributor II
  • 5271 Views
  • 3 replies
  • 2 kudos

Resolved! Enabling vector search in the workspace

Hi,I'm testing out LLM/RAG Databricks demo here: https://notebooks.databricks.com/demos/llm-rag-chatbot/index.html?_gl=1*1nj8hq2*_gcl_au*MTcxOTY0MDY4LjE2OTQ2MzgwNDU.# As part of the demo, I'm trying to create a vector search with the line below. vsc....

  • 5271 Views
  • 3 replies
  • 2 kudos
Latest Reply
Kumaran
Valued Contributor III
  • 2 kudos

Hi @m12, Thank you for posting your question in the Databricks community. The vector search feature is currently undergoing a private preview. If you wish to participate, kindly complete the form provided below for onboarding. https://docs.google.com...

  • 2 kudos
2 More Replies
adithyasanker
by New Contributor II
  • 10495 Views
  • 1 replies
  • 0 kudos

MLFlow Error

On running https://www.databricks.com/notebooks/mlops/deploy_azure_ml_model_.htmlAt the build docker image step, mlflow.azureml.build_imageI faced an error ''TypeError: join() argument must be str, bytes, or os.PathLike object, not 'dict''' .The mode...

  • 10495 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @adithyasanker, This error is typically related to Python and occurs when the os.path.join() function is provided with a dictionary as an argument, which is not a valid input type. As for your question about whether this could be an MLflow version...

  • 0 kudos
lndlzy
by New Contributor II
  • 1820 Views
  • 1 replies
  • 0 kudos

MLFlow Recipes + Feature Store

Hi everyone,I am currently exploring MLFlow recipes, is there someone here who has already tried implementing MLFlow Recipes along with Databricks Feature Store? I am curious as to how you somehow defined the ingestion steps since I am unable to thin...

  • 1820 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @lndlzy, To integrate MLflow Recipes with Databricks Feature Store, follow these steps. 1. **Define Features**: Write code to convert raw data into features and create a Spark DataFrame containing the desired features. If your workspace is enable...

  • 0 kudos
adithyasanker
by New Contributor II
  • 3397 Views
  • 2 replies
  • 0 kudos

Resolved! Mlflow Import error

I am trying to deploy the latest mlFlow registry Model to Azure ML by following the article: https://www.databricks.com/notebooks/mlops/deploy_azure_ml_model_.htmlBut during the import process at cmd:6 . I am getting an error ModulenotFoundError No m...

  • 3397 Views
  • 2 replies
  • 0 kudos
Latest Reply
adithyasanker
New Contributor II
  • 0 kudos

@Kaniz_Fatma Thank you, that solved the issue.But on proceeding with the execution, at the build image step, I faced another issue.''TypeError: join() argument must be str, bytes, or os.PathLike object, not 'dict''' .The model is registered successfu...

  • 0 kudos
1 More Replies
kashy
by New Contributor III
  • 2331 Views
  • 2 replies
  • 0 kudos

Creating or using a custom defined model with SpaCy

I want to train and use a custom model with spaCy.   I don't know how to manage and create folders that the model would be need to save and load custom models and associated files (e.g. from DBFS)It should be something like this but it doesn't accept...

  • 2331 Views
  • 2 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @kashy , To train and use a custom model with spaCy, you would need to save and load your model. However, you're correct that spaCy does not directly accept a path from DBFS. To work around this, you can save your trained model to DBFS and then l...

  • 0 kudos
1 More Replies
Roshanshekh
by New Contributor II
  • 3608 Views
  • 1 replies
  • 0 kudos

Data

To import an Excel file into Databricks, you can follow these general steps: 1. **Upload the Excel File**: - Go to the Databricks workspace or cluster where you want to work. - Navigate to the location where you want to upload the Excel file. - Click...

  • 3608 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @Roshanshekh ,  Your step-by-step guide on importing an Excel file into Databricks is spot-on! This comprehensive approach is incredibly helpful for anyone looking to work with Excel data in Databricks. Your detailed code example and e...

  • 0 kudos
lndlzy
by New Contributor II
  • 1453 Views
  • 2 replies
  • 0 kudos

StackOverflow Error - FeatureLookup & fs.create_training_set

When trying to utilize feature_lookup on at least 2 feature tables and trying fs.create_training_set, I get a stackoverflow error. Can anyone help me understand why this happens? This hasn't happened before but now I get this error and I am unable to...

  • 1453 Views
  • 2 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @lndlzy, A StackOverflow error usually occurs when your program recurses too deeply. In this case, it might be due to a problem with the FeatureStoreClient.create_training_set method or how the FeatureLookup objects are defined or used. Here are ...

  • 0 kudos
1 More Replies
UmaMahesh1
by Honored Contributor III
  • 2154 Views
  • 2 replies
  • 9 kudos

How to get list of all the tabular models in a Analysis server using databricks ?

Hello community, I want to fetch the list of all the tabular models (if possible details about those models too) which are there in a SQL Analysis server using databricks. Can anyone help me out ?Use case: I want to process clear a large number of mo...

  • 2154 Views
  • 2 replies
  • 9 kudos
Latest Reply
omfspartan
New Contributor III
  • 9 kudos

Did you try Azure Analysis Services Rest API?

  • 9 kudos
1 More Replies
jonathan-dufaul
by Valued Contributor
  • 3133 Views
  • 5 replies
  • 5 kudos

Does FeatureStoreClient().score_batch support multidimentional predictions?

I have a pyfunc model that I can use to get predictions. It takes time series data with context information at each date, and produces a string of predictions. For example:The data is set up like below (temp/pressure/output are different than my inpu...

  • 3133 Views
  • 5 replies
  • 5 kudos
Latest Reply
EmilAndersson
New Contributor II
  • 5 kudos

I have the same question. I've decided to look for alternative Feature Stores as this makes it very difficult to use for time series forecasting.

  • 5 kudos
4 More Replies
bento
by New Contributor
  • 2224 Views
  • 1 replies
  • 0 kudos

Notebook Langchain ModuleNotFoundError: No module named 'langchain.retrievers.merger_retriever'

Hi,As mentioned in the title, receiving this error despite%pip install --upgrade langchainSpecific line of code:from langchain.retrievers.merger_retriever import MergerRetriever All other langchain import works when this is commented out. Same line w...

  • 2224 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @bento, â€¢ The error message "ModuleNotFoundError: No module named ’langchain.retrievers.merger_retriever’" indicates that the Python module ’langchain.retrievers.merger_retriever’ is not found in the current environment.• The code suggests that th...

  • 0 kudos
JamieCh
by New Contributor
  • 817 Views
  • 1 replies
  • 0 kudos

Pandas options

Hi All,Per this post's suggestion:https://towardsdatascience.com/a-solution-for-inconsistencies-in-indexing-operations-in-pandas-b76e10719744 I put the following code in Databricks notebook:import pandas as pd pd.set_option('mode.copy_on_write', True...

  • 817 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @JamieCh, The error you're encountering is because pandas have no option. The set_option function in pandas is used to change the default number of rows to display or to change the precision of the floating point numbers. However, 'mode.copy_on_wr...

  • 0 kudos

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels