Skip to main navigation Skip to search Skip to main content

Weakly supervised discriminative localization and classification: A joint learning process

  • Minh Hoai Nguyen
  • , Lorenzo Torresani
  • , Fernando De La Torre
  • , Carsten Rother

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

151 Scopus citations

Abstract

Visual categorization problems, such as object classification or action recognition, are increasingly often approached using a detection strategy: a classifier function is first applied to candidate subwindows of the image or the video, and then the maximum classifier score is used for class decision. Traditionally, the subwindow classifiers are trained on a large collection of examples manually annotated with masks or bounding boxes. The reliance on time-consuming human labeling effectively limits the application of these methods to problems involving very few categories. Furthermore, the human selection of the masks introduces arbitrary biases (e.g. in terms of window size and location) which may be suboptimal for classification. In this paper we propose a novel method for learning a discriminative subwindow classifier from examples annotated with binary labels indicating the presence of an object or action of interest, but not its location. During training, our approach simultaneously localizes the instances of the positive class and learns a subwindow SVM to recognize them. We extend our method to classification of time series by presenting an algorithm that localizes the most discriminative set of temporal segments in the signal. We evaluate our approach on several datasets for object and action recognition and show that it achieves results similar and in many cases superior to those obtained with full supervision.

Original languageEnglish
Title of host publication2009 IEEE 12th International Conference on Computer Vision, ICCV 2009
Pages1925-1932
Number of pages8
DOIs
StatePublished - 2009
Event12th International Conference on Computer Vision, ICCV 2009 - Kyoto, Japan
Duration: Sep 29 2009Oct 2 2009

Publication series

NameProceedings of the IEEE International Conference on Computer Vision

Conference

Conference12th International Conference on Computer Vision, ICCV 2009
Country/TerritoryJapan
CityKyoto
Period09/29/0910/2/09

Fingerprint

Dive into the research topics of 'Weakly supervised discriminative localization and classification: A joint learning process'. Together they form a unique fingerprint.

Cite this