Why a new project instead of putting this in Apache Spark itself

User16826994223 — Fri, 25 Jun 2021 12:42:34 GMT

Re: Why a new project instead of putting this in Apache Spark itself

User16826994223 — Fri, 25 Jun 2021 12:42:58 GMT

1. We want a venue in which we can rapidly iterate and make new releases. The overhead of making a release as a separate project is minuscule (in the order of minutes). A release on Spark takes a lot longer (in the order of days)

2. Koalas takes a different approach that might contradict Spark’s API design principles, and those principles cannot be changed lightly given the large user base of Spark. A new, separate project provides an opportunity for us to experiment with new design principles.

topic Re: Why a new project instead of putting this in Apache Spark itself in Data Engineering

Why a new project instead of putting this in Apache Spark itself

Re: Why a new project instead of putting this in Apache Spark itself