How to optimize sample in active learning: Dispersion, an optimum criterion for classification ?
Comment optimiser les exemples en apprentissage actif : la dispersion, un critère optimal en classification ?
Résumé
We want generate learning data appropriated to classification problems. First, we show that theorical results about low discrepancy sequences in regression problems are not adequate for classification problems. Then, we show with theorical and experimental arguments that minimising the dispersion of the sample is a relevant strategy to optimize performance of classification learning.