Identifiability of Priors from Bounded Sample Sizes  with Applications to Transfer Learning

Liu Yang; Steve Hanneke; Jaime Carbonell

Identifiability of Priors from Bounded Sample Sizes with Applications to Transfer Learning

Liu Yang, Steve Hanneke, Jaime Carbonell

Proceedings of the 24th Annual Conference on Learning Theory, PMLR 19:789-806, 2011.

Abstract

We explore a transfer learning setting, in which a finite sequence of target concepts are sampled independently with an unknown distribution from a known family. We study the total number of labeled examples required to learn all targets to an arbitrary specified expected accuracy, focusing on the asymptotics in the number of tasks and the desired accuracy. Our primary interest is formally understanding the fundamental benefits of transfer learning, compared to learning each target independently from the others. Our approach to the transfer problem is general, in the sense that it can be used with a variety of learning protocols. The key insight driving our approach is that the distribution of the target concepts is identifiable from the joint distribution over a number of random labeled data points equal the Vapnik-Chervonenkis dimension of the concept space. This is not necessarily the case for the joint distribution over any smaller number of points. This work has particularly interesting implications when applied to active learning methods.

Cite this Paper

BibTeX


@InProceedings{pmlr-v19-yang11a,
  title = 	 {Identifiability of Priors from Bounded Sample Sizes  with Applications to Transfer Learning},
  author = 	 {Yang, Liu and Hanneke, Steve and Carbonell, Jaime},
  booktitle = 	 {Proceedings of the 24th Annual Conference on Learning Theory},
  pages = 	 {789--806},
  year = 	 {2011},
  editor = 	 {Kakade, Sham M. and von Luxburg, Ulrike},
  volume = 	 {19},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Budapest, Hungary},
  month = 	 {09--11 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v19/yang11a/yang11a.pdf},
  url = 	 {https://proceedings.mlr.press/v19/yang11a.html},
  abstract = 	 {We explore a transfer learning setting, in which a finite sequence of target concepts are sampled independently with an unknown distribution from a known family. We study the total number of labeled examples required to learn all targets to an arbitrary specified expected accuracy, focusing on the asymptotics in the number of tasks and the desired accuracy.  Our primary interest is formally understanding the fundamental benefits of transfer learning, compared to learning each target independently from the others. Our approach to the transfer problem is general, in the sense that it can be used with a variety of learning protocols.  The key insight driving our approach is that the distribution of the target concepts is identifiable from the joint distribution over a number of random labeled data points equal the Vapnik-Chervonenkis dimension of the concept space.  This is not necessarily the case for the joint distribution over any smaller number of points. This work has particularly interesting implications when applied to active learning methods.}
}

Endnote

%0 Conference Paper
%T Identifiability of Priors from Bounded Sample Sizes  with Applications to Transfer Learning
%A Liu Yang
%A Steve Hanneke
%A Jaime Carbonell
%B Proceedings of the 24th Annual Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2011
%E Sham M. Kakade
%E Ulrike von Luxburg	
%F pmlr-v19-yang11a
%I PMLR
%P 789--806
%U https://proceedings.mlr.press/v19/yang11a.html
%V 19
%X We explore a transfer learning setting, in which a finite sequence of target concepts are sampled independently with an unknown distribution from a known family. We study the total number of labeled examples required to learn all targets to an arbitrary specified expected accuracy, focusing on the asymptotics in the number of tasks and the desired accuracy.  Our primary interest is formally understanding the fundamental benefits of transfer learning, compared to learning each target independently from the others. Our approach to the transfer problem is general, in the sense that it can be used with a variety of learning protocols.  The key insight driving our approach is that the distribution of the target concepts is identifiable from the joint distribution over a number of random labeled data points equal the Vapnik-Chervonenkis dimension of the concept space.  This is not necessarily the case for the joint distribution over any smaller number of points. This work has particularly interesting implications when applied to active learning methods.

RIS


TY  - CPAPER
TI  - Identifiability of Priors from Bounded Sample Sizes  with Applications to Transfer Learning
AU  - Liu Yang
AU  - Steve Hanneke
AU  - Jaime Carbonell
BT  - Proceedings of the 24th Annual Conference on Learning Theory
DA  - 2011/12/21
ED  - Sham M. Kakade
ED  - Ulrike von Luxburg	
ID  - pmlr-v19-yang11a
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 19
SP  - 789
EP  - 806
L1  - http://proceedings.mlr.press/v19/yang11a/yang11a.pdf
UR  - https://proceedings.mlr.press/v19/yang11a.html
AB  - We explore a transfer learning setting, in which a finite sequence of target concepts are sampled independently with an unknown distribution from a known family. We study the total number of labeled examples required to learn all targets to an arbitrary specified expected accuracy, focusing on the asymptotics in the number of tasks and the desired accuracy.  Our primary interest is formally understanding the fundamental benefits of transfer learning, compared to learning each target independently from the others. Our approach to the transfer problem is general, in the sense that it can be used with a variety of learning protocols.  The key insight driving our approach is that the distribution of the target concepts is identifiable from the joint distribution over a number of random labeled data points equal the Vapnik-Chervonenkis dimension of the concept space.  This is not necessarily the case for the joint distribution over any smaller number of points. This work has particularly interesting implications when applied to active learning methods.
ER  -

APA


Yang, L., Hanneke, S. & Carbonell, J.. (2011). Identifiability of Priors from Bounded Sample Sizes  with Applications to Transfer Learning. Proceedings of the 24th Annual Conference on Learning Theory, in Proceedings of Machine Learning Research 19:789-806 Available from https://proceedings.mlr.press/v19/yang11a.html.

Related Material

Download PDF