Any tips if the data is more than a few GB? Obviously the concern is a call to coalesce will bring all data into drive memory.