Topics with Label: Machine Learning

Forum Posts

Sorted by:

by MCosta • New Contributor III

08-20-2021 10:23:46 AM

11809 Views
10 replies
19 kudos

Resolved! Debugging!

Hi ML folks, We are using Databricks to train deep learning models. The code, however, has a complex structure of classes. This would work fine in a perfect bug-free world like Alice in Wonderland. Debugging in Databricks is awkward. We ended up do...

Data Engineering

11809 Views
10 replies
19 kudos

08-20-2021 10:23:46 AM

View Replies

Latest Reply

petern
New Contributor II

03-04-2024 1:06:47 PM

19 kudos

Has this been solved yet; a mature way to debug code on databricks. I'm running in the same kind of issue.Variable explorer can be used and pdb, but not the same really..

19 kudos

03-04-2024 1:06:47 PM

9 More Replies

by PHorniak • New Contributor II

04-30-2019 4:20:35 PM

17386 Views
3 replies
4 kudos

Resolved! AttributeError: 'DataFrame' object has no attribute 'rename'

Hello, I am doing the Data Science and Machine Learning course. The Boston housing has unintuitive column names. I want to rename them, e.g. so 'zn' becomes 'Zoning'. When I run this command: df_bostonLegible = df_boston.rename({'zn':'Zoning'}, axi...

Data Engineering

17386 Views
3 replies
4 kudos

04-30-2019 4:20:35 PM

View Replies

Latest Reply

KrunalLathiya
New Contributor II

01-02-2024 2:50:36 AM

4 kudos

If df_boston is a DataFrame, but you still face issues, try an alternative syntax: df_boston = df_boston.rename(columns={'zn': 'Zoning'}).Make sure df_boston is a proper DataFrame and you're using a recent version of Pandas.

4 kudos

01-02-2024 2:50:36 AM

2 More Replies

by varunsaagar • New Contributor III

01-13-2023 6:01:18 AM

10248 Views
17 replies
28 kudos

Request for reattempt voucher. Databricks Certified Machine Learning Professional exam

HiOn December 28th ,I attempted the Databricks Certified Machine Learning Professional exam for 1st time , unfortunately I ended up by failing grade. My passing grade was 70%, and I received 68.33%.I am planning to reattempt the exam, Could you kindl...

Data Engineering

10248 Views
17 replies
28 kudos

01-13-2023 6:01:18 AM

View Replies

Latest Reply

girl_chan
New Contributor II

05-04-2023 6:52:55 AM

28 kudos

What is the next event where they will give a voucher?

28 kudos

05-04-2023 6:52:55 AM

16 More Replies

by Anonymous • Not applicable

04-10-2023 6:21:27 PM

2298 Views
2 replies
2 kudos

Hello Everyone, I'm interested to learn about the certifications you're pursuing to enhance your skills. Sharing your goals can inspire those ...

Hello Everyone,I'm interested to learn about the certifications you're pursuing to enhance your skills. Sharing your goals can inspire those who may have started their certification journey but struggled with motivation. Personally, I recently comple...

Data Engineering

2298 Views
2 replies
2 kudos

04-10-2023 6:21:27 PM

View Replies

Latest Reply

FJ
Contributor III

04-16-2023 7:32:24 PM

2 kudos

I'm trying the Data Engineering professional exam at the end of the month. It's like a shot in the dark because no practice exams stop are available and from what I've seen online from people who already passed it, the Advanced Data Engineering with ...

2 kudos

04-16-2023 7:32:24 PM

1 More Replies

by johnb1 • Contributor

03-30-2023 1:28:30 AM

2635 Views
3 replies
0 kudos

Cluster Configuration for ML Model Training

Hi!I am training a Random Forest (pyspark.ml.classification.RandomForestClassifier) on Databricks with 1,000,000 training examples and 25 features. I employ a cluster with one driver (16 GB Memory, 4 Cores), 2-6 workers (32-96 GB Memory, 8-24 Cores),...

Data Engineering

2635 Views
3 replies
0 kudos

03-30-2023 1:28:30 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-31-2023 7:11:15 PM

0 kudos

Hi @John B Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so we can...

0 kudos

03-31-2023 7:11:15 PM

2 More Replies

by Rishabh-Pandey • Esteemed Contributor

03-09-2023 8:19:11 AM

1620 Views
2 replies
5 kudos

"Hey everyone, it seems like there's some confusion about enhanced autoscaling in Databricks lately. If you're feeling lost or unsure abo...

"Hey everyone, it seems like there's some confusion about enhanced autoscaling in Databricks lately. If you're feeling lost or unsure about how it works, don't worry - you're not"Enhanced autoscaling is a feature in Databricks that enables dynamic sc...

Data Engineering

1620 Views
2 replies
5 kudos

03-09-2023 8:19:11 AM

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

03-09-2023 8:07:41 PM

5 kudos

Very informativeThanks for sharing!

5 kudos

03-09-2023 8:07:41 PM

1 More Replies

by Deiry • New Contributor III

11-15-2022 1:53:55 PM

989 Views
1 replies
3 kudos

Hi I'm Deiry &#xd83d;&#xde0a; I'm 25 (almost 26) years old, I'm a Databricks expert &#xd83d;&#xde0e; Or at least that's my goal I work at Celerik....

Hi I'm Deiry I'm 25 (almost 26) years old, I'm a Databricks expert Or at least that's my goalI work at Celerik.My goal is to be a certified Machine Learning professional, so here we go

Data Engineering

989 Views
1 replies
3 kudos

11-15-2022 1:53:55 PM

View Replies

Latest Reply

NhatHoang
Valued Contributor II

11-15-2022 11:35:20 PM

3 kudos

Very confident, go ahead. :D

3 kudos

11-15-2022 11:35:20 PM

by Sri_H • New Contributor III

07-27-2022 7:53:09 AM

1848 Views
2 replies
1 kudos

Databricks Academy - Access to training recording attended during Data & AI Summit 2022

Hi All,I attended a 2 day ML training during the Data & AI 2022 summit and I received an email from the events team (ataaisummit@typeaevents.com) telling that the recordings for training and related material will be available in my Databricks Academy...

Data Engineering

1848 Views
2 replies
1 kudos

07-27-2022 7:53:09 AM

View Replies

Latest Reply

Anonymous
Not applicable

08-03-2022 11:10:43 AM

1 kudos

Hi @Sri H ! I am checking on this for you - hang tight! I'll try and get an update asap from the Academy Team.

1 kudos

08-03-2022 11:10:43 AM

1 More Replies

by sannycse • New Contributor II

03-30-2022 11:54:53 AM

4042 Views
4 replies
6 kudos

Resolved! read the csv file as shown in description

Data Engineering

4042 Views
4 replies
6 kudos

03-30-2022 11:54:53 AM

View Replies

Latest Reply

User16764241763
Honored Contributor

04-13-2022 8:56:47 AM

6 kudos

@SANJEEV BANDRU You can simply do thisJust change the file path CREATE TEMPORARY VIEW readcsv USING CSV OPTIONS ( path "dbfs:/docs/test.csv", header "true", delimiter "|", mode "FAILFAST");select ProjectNo, collect_list(EmployeeNo) Employeesfrom re...

6 kudos

04-13-2022 8:56:47 AM

3 More Replies

by adnanzak • New Contributor II

02-20-2022 5:43:29 PM

3132 Views
3 replies
0 kudos

Resolved! Deploy Databricks Machine Learing Models On Power BI

Hi Guys. I've implemented a Machine Learning model on Databricks and have registered it with a Model URL. I wanted to enquire if I could use this model on Power BI. Basically the model predicts industries based on client demographics. Ideally I would...

Data Engineering

3132 Views
3 replies
0 kudos

02-20-2022 5:43:29 PM

View Replies

Latest Reply

adnanzak
New Contributor II

02-21-2022 2:36:20 PM

0 kudos

Thank you @Werner Stinckens and @Joseph Kambourakis for your replies.

0 kudos

02-21-2022 2:36:20 PM

2 More Replies

by MichaelO • New Contributor III

01-28-2022 1:49:44 PM

13014 Views
2 replies
2 kudos

Resolved! Transfer files saved in filestore to either the workspace or to a repo

I built a machine learning model:lr = LinearRegression() lr.fit(X_train, y_train)which I can save to the filestore by:filename = "/dbfs/FileStore/lr_model.pkl" with open(filename, 'wb') as f: pickle.dump(lr, f)Ideally, I wanted to save the model ...

Data Engineering

13014 Views
2 replies
2 kudos

01-28-2022 1:49:44 PM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

02-01-2022 7:25:47 AM

2 kudos

Workspace and Repo is not full available via dbfs as they have separate access rights. It is better to use MLFlow for your models as it is like git but for ML. I think using MLOps you can than put your model also to git.

2 kudos

02-01-2022 7:25:47 AM

1 More Replies

by NextIT • New Contributor

01-15-2022 10:16:07 PM

784 Views
0 replies
0 kudos

www.nextitvision.com

Data Engineering

784 Views
0 replies
0 kudos

01-15-2022 10:16:07 PM

by Joseph_B • Databricks Employee

12-20-2021 9:03:12 AM

2051 Views
1 replies
0 kudos

How should I tune hyperparameters when fitting models for every item?

My dataset has an "item" column which groups the rows into many groups. (Think of these groups as items in a store.) I want to fit 1 ML model per group. Should I tune hyperparameters for each group separately? Or should I tune them for the entire...

Data Engineering

2051 Views
1 replies
0 kudos

12-20-2021 9:03:12 AM

View Replies

Latest Reply

Joseph_B
Databricks Employee

12-20-2021 9:31:16 AM

0 kudos

For the first question ("which option is better?"), you need to answer that via your understanding of the problem domain.Do you expect similar behavior across the groups (items)?If so, that's a +1 in favor of sharing hyperparameters. And vice versa....

0 kudos

12-20-2021 9:31:16 AM

by Joseph_B • Databricks Employee

06-24-2021 1:29:49 PM

1977 Views
1 replies
0 kudos

How can I use Databricks to "automagically" distribute scikit-learn model training?

Is there a way to automatically distribute training and model tuning across a Spark cluster, if I want to keep using scikit-learn?

Data Engineering

1977 Views
1 replies
0 kudos

06-24-2021 1:29:49 PM

View Replies

Latest Reply

Joseph_B
Databricks Employee

06-24-2021 1:42:11 PM

0 kudos

It depends on what you mean by "automagically."If you want to keep using scikit-learn, there are ways to distribute parts of training and tuning with minimal effort. However, there is no "magic" way to distribute training an individual model in scik...

0 kudos

06-24-2021 1:42:11 PM

by User15787040559 • Databricks Employee

06-22-2021 3:31:30 PM

3549 Views
1 replies
0 kudos

What's the difference between Normalization and Standardization?

Normalization typically means rescales the values into a range of [0,1].Standardization typically means rescales data to have a mean of 0 and a standard deviation of 1 (unit variance).

Data Engineering

3549 Views
1 replies
0 kudos

06-22-2021 3:31:30 PM

View Replies

Latest Reply

User16826994223
Honored Contributor III

06-22-2021 10:37:08 PM

0 kudos

Normalization typically means rescales the values into a range of [0,1]. Standardization typically means rescales data to have a mean of 0 and a standard deviation of 1 (unit variance).A link which explains better is - https://towardsdatascience.com...

0 kudos

06-22-2021 10:37:08 PM