Weighted Empirical Risk Minimization: Transfer Learning based on Importance Sampling - Institut Polytechnique de Paris Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Weighted Empirical Risk Minimization: Transfer Learning based on Importance Sampling

Résumé

We consider statistical learning problems, when the distribution P of the training observations Z 1 ,. .. , Z n differs from the distribution P involved in the risk one seeks to minimize (referred to as the test distribution) but is still defined on the same measurable space as P and dominates it. In the unrealistic case where the likelihood ratio Φ(z) = dP/dP (z) is known, one may straightforwardly extends the Empirical Risk Minimization (ERM) approach to this specific transfer learning setup using the same idea as that behind Importance Sampling, by minimizing a weighted version of the empirical risk functional computed from the 'biased' training data Z i with weights Φ(Z i). Although the importance function Φ(z) is generally unknown in practice, we show that, in various situations frequently encountered in practice, it takes a simple form and can be directly estimated from the Z i 's and some auxiliary information on the statistical population P. By means of linearization techniques, we then prove that the generalization capacity of the approach aforementioned is preserved when plugging the resulting estimates of the Φ(Z i)'s into the weighted empirical risk. Beyond these theoretical guarantees, numerical results provide strong empirical evidence of the relevance of the approach promoted in this article.
Fichier principal
Vignette du fichier
ES2020-120.pdf (1.42 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03559387 , version 1 (06-02-2022)

Identifiants

  • HAL Id : hal-03559387 , version 1

Citer

Stéphan Clémençon, Robin Vogel, Mastane Achab, Charles Tillier. Weighted Empirical Risk Minimization: Transfer Learning based on Importance Sampling. 28th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning ( ESANN 2020), 2020, Bruges (on line), Belgium. ⟨hal-03559387⟩
42 Consultations
49 Téléchargements

Partager

Gmail Facebook X LinkedIn More