cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

bothma2
by New Contributor
  • 94 Views
  • 3 replies
  • 0 kudos

How to I select an 80/10/10 split when doing AutoML

Headline says it all. I am doing a regression and want to select a testvaltrain split that is not 60/20/20. Anyone know how to do this?

  • 94 Views
  • 3 replies
  • 0 kudos
Latest Reply
mhiltner
New Contributor III
  • 0 kudos

You'd need to put 80% of your data with the earliest timestamp, then 10% with another one and 10% with another. 

  • 0 kudos
2 More Replies
semsim
by New Contributor
  • 230 Views
  • 1 replies
  • 0 kudos

Content Type error legacy serving

Hi,I have deployed an endpoint in Databricks using legacy serving. I am using the custom pyfunc in mlflow to deploy the custom code. This code uses Machine Learning to parse out the table of contents in some pdf files then returns the table of conten...

  • 230 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @semsim, Ensure that you’re setting the Content-Type header correctly when making requests to your model endpoint. Since you mentioned using Postman, make sure you set the header to application/json.Verify that the request body is also correctly...

  • 0 kudos
Iblouse
by New Contributor
  • 202 Views
  • 1 replies
  • 1 kudos

Machine Learning Practitioner learning Plan Notebook demos

I am enrolled on the Machine Learning Practitioner learning Plan free version, I can't get the notebook demos to run on databricks community edition. How can I do the demo practices of these courses? Is there another alternative? 

  • 202 Views
  • 1 replies
  • 1 kudos
Latest Reply
Kaniz
Community Manager
  • 1 kudos

Hi @Iblouse, Thank you for sharing your concern on Community!   To expedite your request, please list your concerns on our ticketing portal. Our support staff would be able to act faster on the resolution (our standard resolution time is 24-48 hours)...

  • 1 kudos
ThomasSvane
by New Contributor
  • 450 Views
  • 1 replies
  • 0 kudos

Machine learning accuracy depends on execution plans

I'm using Databricks for a machine learning project -- a fairly standard text classification problem, where I want to use the description of an item (i.e. AXELTTNING KOLKERAMIK MM) to predict which of n product categories the item belongs to ('Bushin...

  • 450 Views
  • 1 replies
  • 0 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 0 kudos

that is weird.The regression algorithm should just do a prediction on a dataframe.  Such a huge difference in accuracy seems very suspicious.I would test the algorithm on a reference dataset, for which you know the accuracy beforehand.Perhaps your tr...

  • 0 kudos
Shreyash
by New Contributor II
  • 783 Views
  • 5 replies
  • 0 kudos

java.lang.ClassNotFoundException: com.johnsnowlabs.nlp.DocumentAssembler

I am trying to serve a pyspark model using an endpoint. I was able to load and register the model normally. I could also load that model and perform inference but while serving the model, I am getting the following error: [94fffqts54] ERROR StatusLog...

Machine Learning
Model serving
sparknlp
  • 783 Views
  • 5 replies
  • 0 kudos
Latest Reply
Rajora
New Contributor II
  • 0 kudos

I'm having the same problem and have tried various solutions with no luck. I found some potentially relevant information on the following link: https://www.johnsnowlabs.com/serving-spark-nlp-via-api-3-3-databricks-jobs-and-mlflow-serve-apis/  In the ...

  • 0 kudos
4 More Replies
tim-mcwilliams
by New Contributor III
  • 338 Views
  • 2 replies
  • 0 kudos

Notebook cell gets hung up but code completes

Have been running into an issue when running a pymc-marketing model in a Databricks notebook. The cell that fits the model gets hung up and the progress bar stops moving, however the code completes and dumps all needed output into a folder. After the...

  • 338 Views
  • 2 replies
  • 0 kudos
Latest Reply
tim-mcwilliams
New Contributor III
  • 0 kudos

Hi @Kaniz , Thanks for the feedback here as well as on the other discussion forum. I've commented on your trouble shooting tips on that board. One thing to touch upon hereVerify that the output files are correct and contain the expected results.all o...

  • 0 kudos
1 More Replies
enkefalos-commu
by New Contributor III
  • 264 Views
  • 2 replies
  • 1 kudos

Unable to deploy phi-3 model due to packaging library

I am trying to deploy phi-3 model in databricks but getting below error while creating serving endpoint. Help us on this as soon as possible.

  • 264 Views
  • 2 replies
  • 1 kudos
Latest Reply
enkefalos-commu
New Contributor III
  • 1 kudos

Initially, I attempted the task without including it in the requirement.txt file. Upon encountering an error, I subsequently added it to the list of dependencies.

  • 1 kudos
1 More Replies
elgeo
by Valued Contributor II
  • 1309 Views
  • 3 replies
  • 2 kudos

Table name as a parameter in SQL UDF

Hello experts,We would like to create a UDF function with input parameter a table_name. Please check the below simple example:CREATE OR REPLACE FUNCTION F_NAME(v_table_name STRING,                   v_w...

  • 1309 Views
  • 3 replies
  • 2 kudos
Latest Reply
alm
New Contributor III
  • 2 kudos

Did you find a solutions? I'm having the same problem

  • 2 kudos
2 More Replies
amal15
by New Contributor II
  • 233 Views
  • 2 replies
  • 0 kudos

error: not found: type XGBoostEstimator

error: not found: type XGBoostEstimator Spark & Scala  

  • 233 Views
  • 2 replies
  • 0 kudos
Latest Reply
shan_chandra
Esteemed Contributor
  • 0 kudos

@amal15 - can you please include the below to the import statement and see if it works. ml.dmlc.xgboost4j.scala.spark.XGBoostEstimator 

  • 0 kudos
1 More Replies
ccataV
by New Contributor
  • 425 Views
  • 3 replies
  • 0 kudos

databricks-vectorsearch lib install

Hello,I'm trying to create and query a vector searc index like in this example : How to create and query a Vector Search index | Databricks on AWS on a databricks on azure. I have a cluster ina  private network so i need to install the suggested lib ...

  • 425 Views
  • 3 replies
  • 0 kudos
Latest Reply
shan_chandra
Esteemed Contributor
  • 0 kudos

@ccataV - Databricks vector search is in public preview in selected regions as per the below documentation. However, since you look like a PVC user, you may need to reach out to Databricks support for accessibility. https://docs.databricks.com/en/gen...

  • 0 kudos
2 More Replies
Kaizen
by Contributor III
  • 429 Views
  • 2 replies
  • 0 kudos

Model Serving Latency Chart

Hi, For the model serving latency graph what is p50 and p99? I only have one model i am serving on this endpoing so im surprised to see two models being tracked  

Kaizen_0-1714504038212.png
  • 429 Views
  • 2 replies
  • 0 kudos
Latest Reply
Kaizen
Contributor III
  • 0 kudos

If im not mistaken this refers to 50% of responses and 99% responses and averages accordingly for the metrics? @s_park @Sujitha @Debayan 

  • 0 kudos
1 More Replies
VineethRavella
by New Contributor
  • 682 Views
  • 2 replies
  • 0 kudos

Databricks Conference

The dolly2.0 session involving RAGs was very informative!

  • 682 Views
  • 2 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @VineethRavella, Thank you for attending DAIS 2023 and for sharing your experience on the community platform. We're glad to hear that you found the Dolly2.0 session involving RAGs informative! We wanted to let you know that the Databricks Communit...

  • 0 kudos
1 More Replies
Prabhaker
by New Contributor III
  • 441 Views
  • 1 replies
  • 0 kudos

DataAI Summit 2023

Grean Learning and fun at the summit 2023 hosted by Databricks

  • 441 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @Prabhaker,  It's great to hear that you had such a positive experience at DAIS 2023! We're thrilled that you found the summit to be a great learning and networking opportunity. We wanted to let you know that the Databricks community team will be ...

  • 0 kudos
HangLePhuonf
by New Contributor
  • 727 Views
  • 1 replies
  • 0 kudos

MLOPs

I’m here to learn more about DataBricks MLOps. I’ve learnt so much about how to build and maintain a production-level ML models. I will apply this knowledge to build a scalable ML solutions for my company. 

  • 727 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @HangLePhuonf ! Thank you for sharing your experience and enthusiasm for Databricks MLOps! It's great to hear that you found the sessions valuable and are planning to apply your learnings to build scalable ML solutions for your company. I wanted t...

  • 0 kudos
JamesBrown54
by New Contributor
  • 189 Views
  • 1 replies
  • 0 kudos

Retrieve data from multiple .mdb files using Python.

Hello,I'm interested in accessing several .mdb Access files stored in either Azure Data Lake Storage (ADLS) or the Databricks File System using Python. Could you provide guidance on how to accomplish this? It would be immensely helpful if you could a...

  • 189 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

  Hi @JamesBrown54, Let’s break down the steps for accessing .mdb Access files from both Azure data lake Storage (ADLS) and the Databricks File System using Python. Accessing .mdb Files in Azure data lake Storage (ADLS) Prerequisites: An Azure ...

  • 0 kudos
Labels