A Simulated Retest Method for Estimating Classification Reliability