Skip to main navigation Skip to search Skip to main content

Cognitive-Biometric Recognition from Language Usage: A Feasibility Study

Research output: Contribution to journalArticlepeer-review

24 Scopus citations

Abstract

We propose a novel cognitive biometrics modality based on written language-usage of an individual. This is a feasibility study using the Internet-scale blogs, with tens of thousands of authors to create a cognitive fingerprint for an individual. Existing cognitive biometric modalities involve learning from obtrusive sensors placed on human body. Our modality is based on the characteristic pattern of how individuals express their thoughts through written language. The problems of cognitive authentication (1:1 comparison of genuine versus impostor) and identification (1:n search) are formulated. We detail the algorithms to learn a classifier to distinguish between genuine and impostor classes (for authentication) and multiple classes (for identification). We conclude that a cognitive fingerprint can be successfully learnt, using stylistic (writing style), semantic (themes), and syntactic (grammatical) features extracted from blogs. Our methodology shows promising results (with 79% as the area under the ROC (AUC) in case of authentication). For identification, the individual class accuracies are up to 90%. We performed stricter tests to see how our system performs for unseen user, and report the accuracies of 72% (genuine) and 71% (impostor). Such a study lays the groundwork for building alternative cognitive systems. The modality, presented here, is easy to obtain, unobtrusive and needs no additional hardware.

Original languageEnglish
Pages (from-to)134-143
Number of pages10
JournalIEEE Transactions on Information Forensics and Security
Volume12
Issue number1
DOIs
StatePublished - Jan 2017

Keywords

  • Soft biometrics
  • class imbalance
  • large-scale biometric datasets
  • multi-class classification
  • novel cognitive biometrics

Fingerprint

Dive into the research topics of 'Cognitive-Biometric Recognition from Language Usage: A Feasibility Study'. Together they form a unique fingerprint.

Cite this