Thanks, @Louis_Frolio! this is very helpful...

I especially liked the framing around the role of the raw layer. That makes the decision much clearer:

Option A: if Delta is enough as the recovery mechanism
Option B: if file-level auditability and reprocessing from original payloads are required

To me, that reinforces the idea that the Python Data Source API is mostly about standardizing extraction, auth, pagination, retries, and schema handling at the edge, not replacing ingestion architecture decisions.

Really appreciate you building on this.