On 4/9/15 7:47 PM, Simon Riggs wrote:
> Having a function-base implementation allows stratified sampling or
> other approaches suited directly to user's data.
How would you implement stratified sampling with this function
interface? You'd need to pass the stratification criteria into the
function somehow. But those would be column names or expressions.
> I don't think its reasonable to force all methods to offer both limits
> on numbers of rows or percentages. They may not be applicable.
Examples?
In a stratified sample I would still ask for X percent from each stratum
or Y rows from each stratum.