Skip to main navigation Skip to search Skip to main content

Empath: A framework for evaluating entity-level sentiment analysis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

Sentiment analysis is the fundamental component in text-driven monitoring or forecasting systems, where the general sentiment towards real-world entities (e.g., people, products, organizations) are analyzed based on the sentiment signals embedded in a myriad of web text available today. Building such systems involves several practically important problems, from data cleansing (e.g., boilerplate removal, web-spam detection), and sentiment analysis at individual mention-level (e.g., phrase, sentence-, document-level) to the aggregation of sentiment for each entity-level (e.g., person, company) analysis. Most previous research in sentiment analysis however, has focused only on individual mention-level analysis, and there has been relatively less work that copes with other practically important problems for enabling a large-scale sentiment monitoring system. In this paper, we propose Empath, a new framework for evaluating entity-level sentiment analysis. Empath leverages objective measurements of entities in various domains such as people, companies, countries, movies, and sports, to facilitate entity-level sentiment analysis and tracking. We demonstrate the utility of Empath for the evaluation of a large-scale sentiment system by applying it to various lexicons using Lydia, our own large scale text-analytics tool, over a corpus consisting of more than a terabyte of newspaper data. We expect that Empath will encourage research that encompasses end-to-end pipelines to enable a large-scale text-driven monitoring and forecasting systems.

Original languageEnglish
Title of host publication2011 8th International Conference and Expo on Emerging Technologies for a Smarter World, CEWIT 2011
DOIs
StatePublished - 2011
Event2011 8th International Conference and Expo on Emerging Technologies for a Smarter World, CEWIT 2011 - Hauppauge, NY, United States
Duration: Nov 2 2011Nov 3 2011

Publication series

Name2011 8th International Conference and Expo on Emerging Technologies for a Smarter World, CEWIT 2011

Conference

Conference2011 8th International Conference and Expo on Emerging Technologies for a Smarter World, CEWIT 2011
Country/TerritoryUnited States
CityHauppauge, NY
Period11/2/1111/3/11

Fingerprint

Dive into the research topics of 'Empath: A framework for evaluating entity-level sentiment analysis'. Together they form a unique fingerprint.

Cite this