NRC Word-Emotion Association Lexicon (aka EmoLex)


The NRC Emotion Lexicon is a list of English words and their associations with eight basic emotions (anger, fear, anticipation, trust, surprise, sadness, joy, and disgust) and two sentiments (negative and positive). The annotations were manually done by crowdsourcing.
Email: saif.mohammad@nrc-cnrc.gc.ca


To access the NRC Word-Emotion Association Lexicon, select the appropriate license below:

Please see the Emotion Lexicons: Ethics and Data Statement before using the lexicon.

This study has been approved by the NRC Research Ethics Board (NRC-REB) under protocol number 2009-94. REB review seeks to ensure that research projects involving humans as participants meet Canadian standards of ethics.

You may also be interested in these companion lexicons: NRC Valence, Arousal, and Dominance Lexcion and NRC Emotion Intensity Lexicon
(The full list of word-emotion, word-sentiment, and word-colour lexicons is available here.)


Ten years back, with some anticipation and lots of excitement, Peter Turney and I introduced the NRC Word-Emotion Association Lexicon. So grateful that, over the years, so many people have put their hopes and trust in it. Such joy to see them boldly shine a light on the human condition — the positives and the negatives; not shying away even from sadness, fear, and anger. This blog bost puts a spotlight on ten favorites (includes fun video and audio clips as well as links to papers and popular press articles):

 


Summary Details of the NRC Emotion Lexicon



Association Lexicon

Version

# of Terms Categories Association Scores Method of Creation Papers
Word-Emotion and Word-Sentiment Association Lexicon

NRC Word-Emotion Association Lexicon
(also called EmoLex)

README

 

0.92

(2010)

14,182 unigrams (words)

sentiments: negative, positive
emotions: anger, anticipation, disgust, fear, joy, sadness, surprise, trust
0 (not associated) or 1 (associated)

Manual: By crowdsourcing on Mechanical Turk.

Domain: General

Crowdsourcing a Word-Emotion Association Lexicon, Saif Mohammad and Peter Turney, Computational Intelligence, 29 (3), 436-465, 2013.
Paper (pdf)    BibTeX

Emotions Evoked by Common Words and Phrases: Using Mechanical Turk to Create an Emotion Lexicon
, Saif Mohammad and Peter Turney, In Proceedings of the NAACL-HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text, June 2010, LA, California.
Abstract    Paper (pdf)    Presentation

~25,000 senses*

not associated, weakly, moderately, or strongly associated

* The sense-level annotations provided by individual annotators for the eight emotions can be downloaded by clicking here.
Access various other word-emotion, word-sentiment, and word-colour lexicons here.

These third party packages faculitate the use of the NRC Emotion Lexicon:


NRC Emotion Lexicon in Other Languages

The NRC Emotion Lexicon has affect annotations for English words. Despite some cultural differences, it has been shown that a majority of affective norms are stable across languages. Thus we provide versions of the lexicon in over one hundred languages by translating the English terms using Google Translate (November 2017). Note that some translations by Google Translate may be incorrect or they may simply be transliterations of the original English terms. If you have a list of corrections for any language, we will be happy to hear from you.

Go here to obtain the lexicon in over one hundred languages. (The package also includes an older version of the distribution that had translations for just over 40 languages using Google Translate in 2015.) See README of the NRC Emotion Lexicon for more details about the lexicon.


An Interactive Visualizer


Impact

Some notable ways in which the NRC Emotion Lexicon has made impact include:
  • First of its kind: It was the first word-emotion association lexicon, with entries for eight basic emotions as well as positive and negative sentiment. It still remains the largest such lexicon. Prior work largely focused on positive and negative sentiment. While earlier work focused on words that *denotate* emotion, this work included the larger set of words that are associated with or connotate an emotion.

    • Quality Control: Careful attention was paid to ensure appropriate annotations including the use of a separate word choice question to make sure annotators knew the word and to guide them to the desired sense of the word for which annotations were solicited.

  • Impact on NLP: The lexicon impacted work in sentiment and emotion analysis in NLP. Notably, facilitating work beyond just the positive-negative affect dimension. The lexicon has been used for word-, sentence-, tweet-, and document-level sentiment and emotion analysis, abusive language detection, personality trait identification, stance detection, etc. The lexicon is especially useful in unsupervised settings and when training data is limited or not available. However, even with the onset of deep learning methods, many top systems in shared tasks (such as SemEval-2018 Task 1 Affect in Tweets) continue to benefit from the lexicon by using it to initialize their embeddings and adding additional lexicon-derived features.

  • Impact on fields beyond NLP:

    • Work on Well-Being and Health Disorders: Used in work on understanding pandemic response, feelings towards influenza vaccinations, depression detection, hate speech detection, identifying cyber-bullying, etc. Proceedings of workshops such as CL-psych and i2b2 describe systems that use the NRC Emotion Lexicon.

    • Psychology, Behavioural Science, Psycolinguistics, Fairness, and Social Science: Used in work on understanding how people express emotions, relationships between word characteristics (such as length and concreteness) with its associated emotion,  gender attitudes, as well as the role of emotions in the spread of information, especially news, fake news, and viral videos. The highly cited paper "The spread of true and false news online" uses the lexicon to determine associations of emotions with fake news and its virality.

    • Digital Humanities and Computational Literature (detecting narrative arcs in novels and fairy tales). Notable works include:

    • Art:
      • on creations such as the Wishing Wall, that that were displayed in: 
        • Barbican Centre, London, UK 
        • Tekniska Museet, Stockholm, Sweden (Oct 14 Aug 15) 
        • Onassis Cultural Centre, Athens (19th Oct15 10th Jan16) 
        • Zorlu Centre in Istanbul (16th Feb 12th June16)

      • on work like generating music that captures the flow of emotions in novels; with some music eventually being played at the Louvre: 
        • TIME, May 7, 2014: This Is What Classic Novels Sound Like When a Computer Turns Them Into Piano Music.
    • Human-Computer Interaction (virtual assistants, physiotherapy robots, etc.)

    • Ethics and Fairness (work on comparing attitudes towards men and women at work)

    • Data Science: see example data science projects highlighted in popular press (bottom of this page). Also see: Chatty maps. Fast Company, March 25, 2016: An Emotional Map Of The City, As Captured Through Its Sounds.

  • Citations:

  • Democratization of Emotion Analysis: The large lexicon allowed for the use of simple methods to track trends in emotions. This democratized emotion analysis and it was used not just by computer scientists, but also by journalists, psychologists, social scientists, and amateur data enthusiasts to detect trends in emotions in everything from the Brexit discourse, election tweets, Radiohead songs, abusive language, reddit posts, and more.

 
Terms of Use

 

If you are interested in our lexicon, please see these terms of use:

  • The lexicons mentioned in this page are available for direct download and can be used freely for research purposes.
  • The papers listed next to the lexicons provide details of the creation and use. If you use a lexicon, then please cite the associated papers.
  • If interested in commercial use of any of these lexicons, send email to the contact.
  • If you use a lexicon in a product or application, then please credit the authors and NRC appropriately. Also, if you send us an email, we will be thrilled to know about how you have used the lexicon.
  • Rather than redistributing the data, please direct interested parties to this page.
  • National Research Council Canada (NRC) disclaims any responsibility for the use of the lexicons listed here and does not provide technical support. However, the contact listed above will be happy to respond to queries and clarifications.
 
 
Feedback


We will be happy to hear from you, especially if:

  • you give us feedback regarding these lexicons.
  • you tell us how you have (or plan to) use the lexicons.
  • you are interested in having us analyze your data for sentiment, emotion, and other affectual information.
  • you are interested in a collaborative research project. We also regularly hire graduate students for research internships.

Email: saif.mohammad@nrc-cnrc.gc.ca

Twitter: