A-Optimality for Active Learning of Logistic Regression Classifiers

Schein, Andrew I; Ungar, Lyle

A-Optimality for Active Learning of Logistic Regression Classifiers

Files

aactive.pdf (151.79 KB)

Related Collections

Technical Reports (CIS)

Subject

computer science
active learning
logistic regression

Permalink

https://repository.upenn.edu/handle/20.500.14332/7103

View all metadata

Author

Schein, Andrew I

Ungar, Lyle

Abstract

Over the last decade there has been growing interest in pool-based active learning techniques, where instead of receiving an i.i.d. sample from a pool of unlabeled data, a learner may take an active role in selecting examples from the pool. Queries to an oracle (a human annotator in most applications) provide label information for the selected observations, but at a cost. The challenge is to end up with a model that provides the best possible generalization error at the least cost. Popular methods such as uncertainty sampling often work well, but sometimes fail badly. We take the A-optimality criterion used in optimal experimental design, and extend it so that it can be used for pool-based active learning of logistic regression classifiers. A-optimality has attractive theoretical properties, and empirical evaluation confirms that it offers a more robust approach to active learning for logistic regression than alternatives.

Publication date

2004-01-01

Comments

University of Pennsylvania Department of Computer and Information Science Technical Report No. MS-CIS-04-07.

Collection

Reports

A-Optimality for Active Learning of Logistic Regression Classifiers

Files

Embargo Date

Related Collections

Degree type

Discipline

Subject

Funder

Grant number

License

Copyright date

Distributor

Related resources

Permalink

Author

Contributor

Abstract

Advisor

Date Range for Data Collection (Start Date)

Date Range for Data Collection (End Date)

Digital Object Identifier

Series name and number

Publication date

Volume number

Issue number

Publisher

Publisher DOI

Journal Issues

Comments

Recommended citation

Collection

Penn's Heritage