Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
Hello,Trying to create a custom serving endpoint, using artifacts argument while logging the run/model to save .jar files. These files are called during when calling .predict. JAVA runtime 8 or higher is required to run the jar file, not sure how to ...
Hi, I am using multiple feature stores in my workflow using feature lookups. In my logged pipeline, I have several stages, including Assembler, Standard Scaler, Indexer and then Model. However, I am facing an issue during inference using the `score b...
Hi All,Can we customize the mail subject and body that we receive from Azure Databricks workflow upon failure jobs? Kindly help me, if we can do so.Thanks,Moshe
I have three workspaces and the alerts sent by the jobs running are not referencing the workspace for example. So if I run the job to dev environemnt I get an alert like if the job has been executed from the prod. This si a huge issue for our admins....
Running a python function in the notebook, i am getting the following InvalidConfigurationError: You haven't configured the CLI yet! Please configure by entering `/databricks/python_shell/scripts/db_ipykernel_launcher.py configure`When i try to run...
Facing the same issuefor me the error comes up when mlflow.get_experiment_by_name is called.I am running a custom docker image built on databricksruntime/standard:13.3-LTScustom image so my packages are installed.
https://docs.databricks.com/en/machine-learning/mlops/mlops-workflow.html#7-retrainingIn this article, it is mentioned that we can trigger retraining from the alerts.Triggered. If the monitoring pipeline can identify model performance issues and send...
I am trying to find a way to locally download the model artifacts that build a chatbot chain registered with MLflow in Databricks, so that I can preserve the whole structure (chain -> model -> steps -> yaml & pkl files).There is a mention in a contri...
OK, eventually I found a solution. I write it below, whether somebody will need it. Basically, if in the download_artifacts method the local directory is an existing and accessible one in the DBFS, the process will work as expected.import os
# Con...
Hi,We are facing an mlflow.exceptions.MlflowException when mlflow is called from stream,when we load the model outside the stream, its loaded fine, while when we load it from within stream it fails with exception. to emphasize that it was working til...
We have deployed Dolly (https://huggingface.co/databricks/dolly-v2-3b) as a REST API endpoint on our infrastructure. The notebook we used to do this is included in the text below my question.The Databricks infra used had the following config - (13.2...
I had a similar problem when I used HuggingFacePipeline(pipeline=generate_text) with langchain. It worked to me when I tried to use HuggingFaceHub instead. I used the same dolly-3b model.
I´m facing this exception after use mlflow.langchain.log_model and test the logged model using the following commandprint(loaded_model.predict([{"query": "how does the performance of llama 2 compare to other local LLMs?"}]))tasks failed. Errors: {0: ...
I verified all steps @Retired_mod and the objects and structure were looking good. As far as I understood on tests. Langchain Rag features such as RetrievalQA.from_chain_type does not work well with llm = HuggingFacePipeline instantiation steps. The...
Trying to start an ML experiment on data in an extant metastore within a catalogue (SQL querys run fine on the database). I can start an ML cluster, then attempt to start an AutoML expirement but I get stuck selecting training data - there are no da...
I'm trying to upgrade Tensorflow version from 2.8 to 2.13 on Databricks notebook that is attached to a cluster with Databricks Runtime 10.4. How can I upgrade cuDNN from 8.0 to at least 8.6 to be compatible with the Tensorflow new version?
Hi @Retired_mod , Thanks for your response. When I run '!conda list cudnn' on databricks notebook, I get the following error: '/bin/bash: conda: command not found'
I am trying to finetune llama2_lora model using the xTuring library, while facing this error. (batch size is 1). I am working on a cluster having 1 Worker (28 GB Memory, 4 Cores) and 1 Driver (110 GB Memory, 16 Cores). I am facing this error: OutOfMe...
I upgraded Tensorflow on Databricks notebook using %pip command. Now when running the training job, I get this error: "DNN library initialization failed."
Hello! I am fairly new to Databricks. I'm trying to do a proof of concept with AutoML in Databricks at my organization, and the dataset I am using is a project management dataset. Here's a sample: project_idmarketgeneral_contractorproject_typepermit_...
Hello! I am fairly new to Databricks. I'm trying to do a proof of concept with AutoML in Databricks at my organization, and the dataset I am using is a project management dataset. Here's a sample: project_idmarketgeneral_contractorproject_typepermit_...
Hello,I am trying to complete the exercises of the course "Scalable Machine Learning with Apache Spark" using Databricks Community Edition, but when I run the Lab Setup I get the following error:HTTPError: 503 Server Error: Service Unavailable for ur...
I'm experiencing the same issue while using community edition for this classroom: https://github.com/databricks-academy/large-language-models. What subscription level do I upgrade to?
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.