4.3 Data-driven penalties for optimal calibration of learning algorithms

Prendre des notes

Il n’y a pas de note disponible pour vous pour cette vidéo.

Connectez-vous pour en créer une nouvelle.

Disciplines

Types

Mots clés

pad 203 droit 177 environnement 116 histoire 93 durable 72 biodiversite 71 citizen science 64 science participative 64 patrimoine culturel 63 una europa 62 archeologie 60 cultural heritage 60 de 53 syndicalisme 48 please 43 relations internationales 43 histoire contemporaine 42 droit des entreprises 41 geographie 41 droit des contrats 37

Learning algorithms usually depend on one or several parameters that need to be chosen carefully. We tackle in this talk the question of designing penalties for an optimal choice of such regularization parameters in non-parametric regression. First, we consider the problem of selecting among several linear estimators, which includes model selection for linear regression, the choice of a regularization parameter in kernel ridge regression or spline smoothing, and the choice of a kernel in multiple kernel learning. We propose a new penalization procedure which first estimates consistently the variance of the noise, based upon the concept of minimal penalty which was previously introduced in the context of model selection. Then, plugging our variance estimate in Mallows’ CL penalty is proved to lead to an algorithm satisfying an oracle inequality. Second, when data are heteroscedastic, we can show that dimensionality-based penalties are suboptimal for model selection in least-squares regression. So, the shape of the penalty itself has to be estimated. Resampling is used for building penalties robust to heteroscedasticity, without requiring prior information on the noise-level. For instance, V-fold penalization is shown to improve V-fold cross-validation for a fixed computational cost.

Ajouté par : Yannick Mahe (ymahe)
Contributeur(s) :
- Université Paris 1 Panthéon - Sorbonne (production)
- Sylvain Arlot (Intervenant)
Mis à jour le : 21 juillet 2017 00:00
Chaîne :
- UFR 02 - Ecole d'Economie de la Sorbonne (EES)
Type : Cours / MOOC / SPOC
Langue principale : Anglais
Discipline(s) :
- Mathématiques et informatique appliquées aux sciences humaines et sociales

Réseaux sociaux

4.3 Data-driven penalties for optimal calibration of learning algorithms

Informations