Skip to main navigation Skip to search Skip to main content

WorkingHands: A hand-tool assembly dataset for image segmentation and activity mining

  • Roy Shilkrot
  • , Supreeth Narasimhaswamy
  • , Saif Vazir
  • , Minh Hoai
  • Stony Brook University
  • Tulip Interfaces Inc.

Research output: Contribution to conferencePaperpeer-review

9 Scopus citations

Abstract

Computer vision in manufacturing is a decades long effort into automatic inspection and verification of the work pieces, while visual recognition focusing on the human operators is becoming ever prominent. Semantic segmentation is an exemplary vision task that is key to enabling crucial assembly applications such as completion time tracking and manual process verification. However, focus on segmentation of human hands while performing complex tasks such as manual assembly is still lacking. Segmenting hands from tools, work pieces, background and other body parts is difficult because of self-occlusions and intricate hand grips and poses. In this paper we introduce WorkingHands, a dataset of pixel-level annotated images of hands performing 13 different tool-based assembly tasks, from both real-world captures and virtual-world renderings, with RGB+D images from a high-resolution range camera and ray casting engine. Moreover, using the dataset, we can learn a generic Hand-Task Descriptor that is useful for retrieving hand images and video performing similar operations across different non-annotated datasets.

Original languageEnglish
StatePublished - 2020
Event30th British Machine Vision Conference, BMVC 2019 - Cardiff, United Kingdom
Duration: Sep 9 2019Sep 12 2019

Conference

Conference30th British Machine Vision Conference, BMVC 2019
Country/TerritoryUnited Kingdom
CityCardiff
Period09/9/1909/12/19

Fingerprint

Dive into the research topics of 'WorkingHands: A hand-tool assembly dataset for image segmentation and activity mining'. Together they form a unique fingerprint.

Cite this