Topics with Label: Pandas dataframe

Forum Posts

Sorted by:

by halfwind22 • New Contributor III

10-11-2021 1:42:37 AM

6874 Views
11 replies
12 kudos

Resolved! Unable to write csv files to Azure BLOB using pandas to_csv ()

I am using a Py function to read some data from a GET endpoint and write them as a CSV file to a Azure BLOB location.My GET endpoint takes 2 query parameters,param1 and param2. So initially, I have a dataframe paramDf that has two columns param1 and ...

Data Engineering

6874 Views
11 replies
12 kudos

10-11-2021 1:42:37 AM

View Replies

Latest Reply

halfwind22
New Contributor III

10-12-2021 6:38:33 AM

12 kudos

@Hubert Dudek I cant issue a spark command to executor node, throws up an error ,because foreach distributes the processing.

12 kudos

10-12-2021 6:38:33 AM

10 More Replies

by MartinB • Contributor III

09-11-2021 3:34:17 AM

6321 Views
4 replies
3 kudos

Resolved! Interoperability Spark ↔ Pandas: can't convert Spark dataframe to Pandas dataframe via df.toPandas() when it contains datetime value in distant future

Hi,I have multiple datasets in my data lake that feature valid_from and valid_to columns indicating validity of rows.If a row is valid currently, this is indicated by valid_to=9999-12-31 00:00:00.Example:Loading this into a Spark dataframe works fine...

Data Engineering

6321 Views
4 replies
3 kudos

09-11-2021 3:34:17 AM

View Replies

Latest Reply

shan_chandra
Honored Contributor III

10-06-2021 7:42:15 AM

3 kudos

Currently, out of bound timestamps are not supported in pyArrow/pandas. Please refer to the below associated JIRA issue. https://issues.apache.org/jira/browse/ARROW-5359?focusedCommentId=17104355&page=com.atlassian.jira.plugin.system.issuetabpanels%3...

3 kudos

10-06-2021 7:42:15 AM

3 More Replies

by brij • New Contributor III

09-23-2021 2:11:13 AM

3180 Views
8 replies
3 kudos

Resolved! Databricks snowflake dataframe.toPandas() taking more space and time

I have 2 exactly same table(rows and schema). One table recides in AZSQL server data base and other one is in snowflake database. Now we have some existing code which we want to migrate from azsql to snowflake but when we are trying to create a panda...

Data Engineering

3180 Views
8 replies
3 kudos

09-23-2021 2:11:13 AM

View Replies

Latest Reply

Anonymous
Not applicable

10-01-2021 12:17:50 PM

3 kudos

@Brijan Elwadhi - That's wonderful. Thank you for sharing your solution.

3 kudos

10-01-2021 12:17:50 PM

7 More Replies

by Jack • New Contributor II

09-14-2021 11:07:01 AM

1374 Views
1 replies
0 kudos

Resolved! Creating Pandas Data Frame of Features After Applying Variance Reduction

I am building a classification model using the following data frame of 120,000 records (sample of 5 records shown):Using this data, I have built the following model:from sklearn.model_selection import train_test_split from sklearn.feature_extraction....

Data Engineering

1374 Views
1 replies
0 kudos

09-14-2021 11:07:01 AM

View Replies

Latest Reply

Dan_Z
Honored Contributor

09-14-2021 11:28:14 AM

0 kudos

This is more of a scikit-learn question than a Databricks question. But poking around I think VT_reduced.get_support() is probably what you are looking for:https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.VarianceThreshold....

0 kudos

09-14-2021 11:28:14 AM

by SindhuG • New Contributor

07-29-2021 11:55:03 AM

592 Views
1 replies
0 kudos

Hi All, I need to extract rows of dates from a dataframe based on list of values(e.g. dates) located in a CSV file. Can anyone please help me? I have tried groupby function but am not able to get the expected result. Thanks in advance.

my dataframe looks like this.df = Datecolumn2column3Machine1-jan-2020A2-jan-2020--- A 18-jan-2020 A 11-jan-2020 B 12-jan-2020 B 6-feb-2020C7-feb-2020---C14-feb-2020C Date details csv file looks like this D = MachineSelected DateA15-jan-2020C12-f...

Data Engineering

592 Views
1 replies
0 kudos

07-29-2021 11:55:03 AM

View Replies

Latest Reply

Kaniz
Community Manager

09-02-2021 5:54:30 AM

0 kudos

Hi @ SindhuG! My name is Kaniz, and I'm a technical moderator here. Great to meet you, and thanks for your question! Let's see if your peers on the Forum have an answer to your questions first. Or else I will follow up shortly with a response.

0 kudos

09-02-2021 5:54:30 AM