Topics with Label: Pandas

Forum Posts

Sorted by:

by DiCamps • New Contributor II

09-29-2022 4:25:15 AM

1832 Views
1 replies
3 kudos

Resolved! Installing pyspark.pandas

Hello guys,I'm trying to migrate a python project from Pandas to Pandas API on Spark, on Azure Databricks using MLFlow on a conda env.The thing is I'm getting the next error:Traceback (most recent call last): File "/databricks/mlflow/projects/x/data_...

Machine Learning

1832 Views
1 replies
3 kudos

09-29-2022 4:25:15 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

09-29-2022 5:40:36 AM

3 kudos

it should be yes.can you elaborate on how you create your notebook (and the conda env you talk about)?

3 kudos

09-29-2022 5:40:36 AM

by Vik1 • New Contributor II

01-21-2022 9:16:42 AM

2404 Views
4 replies
2 kudos

Resolved! Cluster setup for ML work for Pandas in Spark, and vanilla Python.

My setup:Worker type: Standard_D32d_v4, 128 GB Memory, 32 Cores, Min Workers: 2, Max Workers: 8Driver type: Standard_D32ds_v4, 128 GB Memory, 32 CoresDatabricks Runtime Version: 10.2 ML (includes Apache Spark 3.2.0, Scala 2.12)I ran a snowflake quer...

Machine Learning

2404 Views
4 replies
2 kudos

01-21-2022 9:16:42 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-22-2022 7:23:05 AM

2 kudos

Hey there @Vivek Ranjan Checking in. If Joseph's answer helped, would you let us know and mark the answer as best? It would be really helpful for the other members to find the solution more quickly.Thanks!

2 kudos

04-22-2022 7:23:05 AM

3 More Replies

by Dan_Z • Honored Contributor

10-22-2021 9:06:35 AM

324 Views
0 replies
0 kudos

spark.apache.org

mapInPandas is one of the most powerful Spark functions. It uses an arrow-like in-memory data structure to split up Spark Data Frames into chunks and feeding them to a function that takes a Pandas DF as input and output. Check it out here:https://spa...

Machine Learning

324 Views
0 replies
0 kudos

10-22-2021 9:06:35 AM