Machine Learning

by Kaan • New Contributor

02-02-2023 7:16:24 AM

886 Views
1 replies
1 kudos

Resolved! Using databricks in multi-cloud, and querying data from the same instance.

I'm looking for a good product to use across two clouds at once for Data Engineering, Data modeling and governance. I currently have a GCP platform, but most of my data and future data goes through Azure, and currently is then transfered to GCS/BQ.Cu...

Machine Learning

Reply

886 Views
1 replies
1 kudos

02-02-2023 7:16:24 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-31-2023 9:02:46 AM

1 kudos

@Karl Andrén :Databricks is a great option for data engineering, data modeling, and governance across multiple clouds. It supports integrations with multiple cloud providers, including Azure, AWS, and GCP, and provides a unified interface to access ...

1 kudos

03-31-2023 9:02:46 AM

by Hubert-Dudek • Esteemed Contributor III

03-26-2023 1:03:54 PM

485 Views
1 replies
7 kudos

Have you heard about databricks latest open-source language model called Dolly? It’s a ChatGPT like model that uses the tatsu-lab/alpaca dataset with ...

Have you heard about databricks latest open-source language model called Dolly? It’s a ChatGPT like model that uses the tatsu-lab/alpaca dataset with examples of questions and answers. To train Dolly, you can combine this dataset (simple solution on ...

Machine Learning

Reply

485 Views
1 replies
7 kudos

03-26-2023 1:03:54 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-31-2023 8:25:24 AM

7 kudos

Thanks for posting this! I am so excited about the possibilities that this can do for us. It's an exciting development in the natural language processing field, and it has the potential to be a valuable tool for businesses looking to implement chatb...

7 kudos

03-31-2023 8:25:24 AM

by alisher_pwc • New Contributor II

03-03-2023 6:13:31 AM

1579 Views
2 replies
1 kudos

Model serving with GPU cluster

Hello Databricks community!We are facing a strong need of serving some of public and our private models on GPU clusters and we have several requirements:1) We'd like to be able to start/stop the endpoints (best with scheduling) to avoid excess consum...

Machine Learning

Reply

1579 Views
2 replies
1 kudos

03-03-2023 6:13:31 AM

View Replies

Latest Reply

Vartika
Moderator

03-31-2023 12:35:12 AM

1 kudos

Hi @Alisher Akh Does @Debayan Mukherjee's answer help? If yes, would you be happy to mark the answer as best so that other members can find the solution more quickly? If not, please tell us so we can help you further. Cheers!

1 kudos

03-31-2023 12:35:12 AM

1 More Replies

by fuselessmatt • Contributor

03-01-2023 1:07:18 AM

7210 Views
5 replies
6 kudos

Resolved! What does "Command exited with code 50 mean" and how do you solve it?

Hi!We have this dbt model that generates a table with user activity in the previous days, but we get this vague error message in the Databricks SQL Warehouse.Job aborted due to stage failure: Task 3 in stage 4267.0 failed 4 times, most recent failure...

Machine Learning

Reply

7210 Views
5 replies
6 kudos

03-01-2023 1:07:18 AM

View Replies

Latest Reply

shan_chandra
Honored Contributor III

03-29-2023 10:11:30 AM

6 kudos

@Mattias P - For the executor lost failure, is it trying to bring in large data volume? can you please reduce the date range and try? or run the workload on a bigger DBSQL warehouse than the current one.

6 kudos

03-29-2023 10:11:30 AM

4 More Replies

by Ajay-Pandey • Esteemed Contributor III

01-31-2023 4:39:02 AM

838 Views
2 replies
5 kudos

Share information between tasks in a Databricks job You can use task values to pass arbitrary parameters between tasks in a Databricks job. You pass ...

Share information between tasks in a Databricks jobYou can use task values to pass arbitrary parameters between tasks in a Databricks job. You pass task values using the taskValues subutility in Databricks Utilities. The taskValues subutility provide...

Machine Learning

Reply

838 Views
2 replies
5 kudos

01-31-2023 4:39:02 AM

View Replies

Latest Reply

newforesee
New Contributor II

03-30-2023 1:51:30 AM

5 kudos

We urgently hope for this feature, but to date, we have found that it is only available in Python. Do you have any plans to support Scala?

5 kudos

03-30-2023 1:51:30 AM

1 More Replies

by apatel • New Contributor III

03-28-2023 11:30:18 AM

5246 Views
2 replies
0 kudos

Resolved! How to resolve this error "Error: cannot create global init script: default auth: cannot configure default credentials"

I'm trying to set the global init script via my Terraform deployment. I did a thorough google search and can't seem to find guidance here.I'm using a very generic call to set these scripts in my TF Deployment.terraform { required_providers { data...

Machine Learning

Reply

5246 Views
2 replies
0 kudos

03-28-2023 11:30:18 AM

View Replies

Latest Reply

apatel
New Contributor III

03-28-2023 6:05:22 PM

0 kudos

Ok in case this helps anyone else, I've managed to resolve.I confirmed in this documentation the databricks CLI is required locally, wherever this is being executed. https://learn.microsoft.com/en-us/azure/databricks/dev-tools/terraform/cluster-note...

0 kudos

03-28-2023 6:05:22 PM

1 More Replies

by Koliya • New Contributor II

12-21-2022 6:47:38 PM

10696 Views
5 replies
7 kudos

The Python process exited with exit code 137 (SIGKILL: Killed). This may have been caused by an OOM error. Check your command's memory usage.

I am running a hugging face model on a GPU cluster (g4dn.xlarge, 16GB Memory, 4 cores). I run the same model in four different notebooks with different data sources. I created a workflow to run one model after the other. These notebooks run fine indi...

Machine Learning

Reply

10696 Views
5 replies
7 kudos

12-21-2022 6:47:38 PM

View Replies

Latest Reply

fkemeth
New Contributor II

03-27-2023 2:12:49 AM

7 kudos

You might accumulate gradients when running your Huggingface model, which typically leads to out-of-memory errors after some iterations. If you use it for inference only, dowith torch.no_grad(): # The code where you apply the model

7 kudos

03-27-2023 2:12:49 AM

4 More Replies

by Tilo • New Contributor

03-20-2023 8:20:43 AM

1785 Views
3 replies
3 kudos

Resolved! MLFlow: How to load results from model and continue training

I'd like to continue / finetune training of an existing keras/tensorflow model. We use MLFlow to store the model. How can I load the wieght from an existing model to the model and continue "fit" preferable with a different learning rate.Just loading ...

Machine Learning

Reply

1785 Views
3 replies
3 kudos

03-20-2023 8:20:43 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-26-2023 11:29:26 PM

3 kudos

Hi @Tilo Wünsche Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thank...

3 kudos

03-26-2023 11:29:26 PM

2 More Replies

by NSRBX • Contributor

10-06-2022 6:38:43 AM

2511 Views
6 replies
6 kudos

Resolved! Error loading model from mlflow: java.io.StreamCorruptedException: invalid type code: 00

Hello,I'm using, in my IDE, Databricks Connect version 9.1LTS ML to connect to a databricks cluster with spark version 3.1 and download a spark model that's been trained and saved using mlflow.So it seems like it's able to find a copy the model, but ...

Machine Learning

Reply

2511 Views
6 replies
6 kudos

10-06-2022 6:38:43 AM

View Replies

Latest Reply

NSRBX
Contributor

10-17-2022 4:41:34 AM

6 kudos

Hi @Kaniz Fatma and @Shanmugavel Chandrakasu,It works after putting hadoop.dll into C:\Windows\System32 folder.I have hadoop version 3.3.1.I already had winutils.exe in the Hadoop bin folder.RegardsNath

6 kudos

10-17-2022 4:41:34 AM

5 More Replies

by Mike_sb • New Contributor III

03-15-2023 9:26:11 PM

1610 Views
7 replies
4 kudos

Resolved! I can see and run the schemas from data explorer, but don't see them in sql editor, is there something I can do to fix this?

Machine Learning

Reply

1610 Views
7 replies
4 kudos

03-15-2023 9:26:11 PM

View Replies

Latest Reply

Amit_352107
New Contributor III

03-23-2023 12:19:18 AM

4 kudos

Hi @Mike M Kindly clear cache, and your issue will be resolved

4 kudos

03-23-2023 12:19:18 AM

6 More Replies

by bruno_valero • New Contributor II

02-09-2023 12:06:18 PM

3263 Views
2 replies
1 kudos

How to download a .csv or .pkl file from databricks?

When I save files on "dbfs:/FileStore/shared_uploads/brunofvn6@gmail.com/", it doesn't appear anywhere in my workspace. I've tried to copy the path of the workspace with the right mouse button, pasted on ("my pandas dataframe").to_csv('path'), but wh...

Machine Learning

Reply

3263 Views
2 replies
1 kudos

02-09-2023 12:06:18 PM

View Replies

Latest Reply

bruno_valero
New Contributor II

02-09-2023 1:59:02 PM

1 kudos

I think I discover how to do this. Is in the label called data in the left menu of the databricks environment, in the top left of the menu there are two labels "Database Tables" and "DBFS" in which "Database Table" is the default label. So it is just...

1 kudos

02-09-2023 1:59:02 PM

1 More Replies

by Data_Analytics1 • Contributor III

02-14-2023 9:59:54 PM

664 Views
1 replies
2 kudos

Resolved! File not found error. Does OPTIMIZE deletes initial versions of the delta table?

df = (spark.readStream.format("delta")\ .option("readChangeFeed", "true")\ .option("startingVersion", 1)\ .table("CatalogName.SchemaName.TableName") )display(df)A file referenced in the transaction l...

Machine Learning

Reply

664 Views
1 replies
2 kudos

02-14-2023 9:59:54 PM

View Replies

Latest Reply

swethaNandan
New Contributor III

03-22-2023 5:08:03 PM

2 kudos

Had you run vacuum on the table? Vacuum can clean up data files marked for removal and are older than retention period.Optimize compacts files and marks the small files for removal, but does not physically remove the data files

2 kudos

03-22-2023 5:08:03 PM

by User16461610613 • New Contributor II

03-15-2023 9:38:31 AM

1170 Views
1 replies
2 kudos

Free Databricks Training on AWS, Azure, or Google Cloud Good news! You can now access free, in-depth Databricks training on AWS, Azure or Google Cloud...

Free Databricks Training on AWS, Azure, or Google CloudGood news! You can now access free, in-depth Databricks training on AWS, Azure or Google Cloud. Our on-demand training series walks through how to:Streamline data ingest and management to build ...

Machine Learning

Reply

1170 Views
1 replies
2 kudos

03-15-2023 9:38:31 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

03-22-2023 2:37:20 PM

2 kudos

Thank you for sharing this!!

2 kudos

03-22-2023 2:37:20 PM

by obiamaka • New Contributor III

03-02-2023 9:07:52 AM

2124 Views
6 replies
2 kudos

Resolved! Unable to view jobs in Databricks Workflow

This has been happening for about 2 days now. Any ideas on why this occurs and how it can be resolved. I attached a screenshot.

Machine Learning

Reply

2124 Views
6 replies
2 kudos

03-02-2023 9:07:52 AM

View Replies

Latest Reply

obiamaka
New Contributor III

03-22-2023 6:20:23 AM

2 kudos

This issue got resolved on it's own, but not sure what the problem was, probably a bug from a software update?

2 kudos

03-22-2023 6:20:23 AM

5 More Replies

by zachclem • New Contributor III

03-11-2023 8:52:54 AM

2172 Views
2 replies
1 kudos

Resolved! Logging model to MLflow using Feature Store API. Getting TypeError: join() argument must be str, bytes, or os.PathLike object, not 'dict'

I'm using databricks. Trying to log a model to MLflow using the Feature Store log_model function. but I have this error: TypeError: join() argument must be str, bytes, or os.PathLike object, not 'dict' I'am using the Databricks runtime ml (10.4 LTS M...

Machine Learning

Reply

2172 Views
2 replies
1 kudos

03-11-2023 8:52:54 AM

View Replies

Latest Reply

zachclem
New Contributor III

03-14-2023 1:56:14 PM

1 kudos

I updated by Databricks Run Time from 10.4 to 12.1 and this solved the issue.

1 kudos

03-14-2023 1:56:14 PM

1 More Replies

Databricks

Forum Posts

Resolved! Using databricks in multi-cloud, and querying data from the same instance.

Have you heard about databricks latest open-source language model called Dolly? It’s a ChatGPT like model that uses the tatsu-lab/alpaca dataset with ...

Model serving with GPU cluster

Resolved! What does "Command exited with code 50 mean" and how do you solve it?

Share information between tasks in a Databricks job You can use task values to pass arbitrary parameters between tasks in a Databricks job. You pass ...

Resolved! How to resolve this error "Error: cannot create global init script: default auth: cannot configure default credentials"

The Python process exited with exit code 137 (SIGKILL: Killed). This may have been caused by an OOM error. Check your command's memory usage.

Resolved! MLFlow: How to load results from model and continue training

Resolved! Error loading model from mlflow: java.io.StreamCorruptedException: invalid type code: 00

Resolved! I can see and run the schemas from data explorer, but don't see them in sql editor, is there something I can do to fix this?

How to download a .csv or .pkl file from databricks?

Resolved! File not found error. Does OPTIMIZE deletes initial versions of the delta table?

Free Databricks Training on AWS, Azure, or Google Cloud Good news! You can now access free, in-depth Databricks training on AWS, Azure or Google Cloud...

Resolved! Unable to view jobs in Databricks Workflow

Resolved! Logging model to MLflow using Feature Store API. Getting TypeError: join() argument must be str, bytes, or os.PathLike object, not 'dict'

pdb debugger on databricks

import ml.dmlc.xgboost4j.scala.spark.{XGBoostEstim...

Query ML Endpoint with R and Curl

'error_code': 'INVALID_PARAMETER_VALUE', 'message'...

AutoMl Dataset too large