Hi all! I am trying to create an endpoint for Easy OCR. I was able to create the experiment using a wrapper class with the code below: # import libraries
import mlflow
import mlflow.pyfunc
import cloudpickle
import cv2
import re
import easyocr
impo...
Hello,Am trying to create a Vector Search Index but never completes, stuck on "Provisioning pipeline compute". I am using Databricks on AWS Sydney.Looks like the DLT pipeline is failing and retrying and fails on "WAITING_FOR_RESOURCES."Has anyone ha...
Ok, this is closed with the assistance of Gurpreet. My UC Metastore didn't have an S3 bucket associated which it turned out was required for the vector search index. Adding the S3 bucket to the Metastore and re-running the pipeline resolved.Thanks!
I have saved a model in the model registry using MLFlow. How can I find the shap values for this model once I have generated predictions in batch mode? Shap tree explainer does not support the mlflow pyfunc model type. When I use mlflow.shap.log_exp...
Thank you so much for this kind of valuable post its amazing post it may helpful for each visitors. For more information go through my websites here:peacocktv.com/tv | peacocktv.com/tv/xbox
With a copy of notebook https://github.com/JohnSnowLabs/spark-nlp-workshop/blob/master/open-source-nlp/03.0.SparkNLP_Pretrained_Models.ipynb imported into Databricks, there's a lovely visualization created by the cell that you can locate by searching...
Here's a solution: use a parameter (here, `return_html = True`) to get an HTML object back, and then call `displayHTML` to actually display the object.from sparknlp_display import NerVisualizer
visualiser = NerVisualizer()
for i in text_list:
...
Dear Databricks Support Team,I am currently encountering an issue while attempting to serve a model using your platform. Whenever I initiate the model serving process, it fails, and I am unable to successfully deploy my model.Additionally, I am facin...
Hi @hardik, I understand that you’re experiencing difficulties with model serving and deployment. Let’s troubleshoot this issue step by step. Model Serving with Databricks:Databricks offers Model Serving, which exposes MLflow machine learning models ...
How does COPY_INTO work with table restore?I made some tests, and the restore method does NOT restore the key-store values of the target at the specific version, which means that the data that came after the chosen version cannot be inserted (unless ...
I signed up for this course via Databricks Academy : LLMs: Application through Production However I am getting this error when trying to download the needed datasets for the course:Installing datasets:| from "wasbs://courseware@dbacademy.blob.core.wi...
You would need to install the python library. You can either:1) Run %pip install datasets2) Put it as part of the PyPi packages to load in your cluster This should solve your issue
Hi ,When I create a job of a machine learning model and run the job I see that the cell outputs do not get updated. The model variables would have updated, however. I also need to keep the notebook updated with cell outputs always when I run the job...
Hi @chari, Certainly! It sounds like you’re encountering a common issue when running machine learning jobs in notebooks.
Let’s explore some tips to address this:
Cell Outputs Not Updating:
When you run a cell in a notebook, the output is typical...
Data science is a multidisciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge and insights from data. It encompasses the entire data lifecycle, from data acquisition to data exploration, modeling, and...
The issue:None of my MLflow experiment results show up in the Experiment UI. Context:I encountered this issue recently, despite having successfully used the MLFlow UI for the past few weeks.Note: I can still access the experiment runs in a notebook, ...
Hello ! Well, fret not, my friend! I've stumbled upon my own little paradise, visit now , and let me tell you, it boasts a video collection that's nothing short of extraordinary. The performers on this platform? Seasoned pros with an unmatched level ...
Hello,I am working on a machine learning project. The dataset I am using has more than 5000000 rows. I am using PySpark, and the attached screenshot is the block I used RandomForestRegressor to train the model.It worked even though it took a pretty l...
Hi @choi_2 , Hello! It sounds like you’re dealing with some challenges while working on your machine learning project.
Handling large datasets can indeed be tricky, especially when memory constraints and processing time come into play.
Let’s ex...
Hi!I'm using MLFlow API to log and load models in Databricks. When we created a dedicated workspace for models registry, one person created multiple models, and for some reason now all models are logged with this person as the creator.This person no ...
Hi @thibault , Hello! It seems you’re dealing with a situation where the original creator of multiple models in your Databricks workspace no longer exists, and you’d like to change the creator to a service principal.
Let’s address this step by step...
I am getting the following error...{
"reason": {
"code": "BOOTSTRAP_TIMEOUT",
"parameters": {
"databricks_error_message": "[id: InstanceId(i-0e552e85c37c9da2d), status: INSTANCE_INITIALIZING, workerEnvId:WorkerEnvId(workerenv-12661843...
Hello,
Thanks for contacting Databricks Support.
From the error message: [Bootstrap Event] Can reach databricks-prod-artifacts-us-east-1.s3.amazonaws.com: [FAILED]. It suggests an issue with reaching a Databricks-related AWS S3 bucket from your env...
Hello,We need to promote models to different environments in different regions, they exist in Unity Catalog.We are setting up DataBricks metastores across different regions.We wanted to follow this approach to copy models via UC https://docs.gcp.data...