LINGUIST List 28.1377

Mon Mar 20 2017

Calls: Computational Linguistics/Denmark

Editor for this issue: Kenneth Steimel <>

Date: 17-Mar-2017
From: Saif Mohammad <>
Subject: Shared task on Emotion Intensity
E-mail this message to a friend

Full Title: Shared task on Emotion Intensity

Date: 08-Sep-2017 - 08-Sep-2017
Location: Copenhagen, Denmark
Contact Person: Saif Mohammad
Meeting Email: < click here to access email >
Web Site:

Linguistic Field(s): Computational Linguistics

Call Deadline: 10-Jun-2017

Meeting Description:

Part of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA-2017), which is to be held in conjunction with EMNLP-2017.

Background and Significance:

Existing emotion datasets are mainly annotated categorically without an indication of degree of emotion. Further, the tasks are almost always framed as classification tasks (identify 1 among n emotions for this sentence). In contrast, it is often useful for applications to know the degree to which an emotion is expressed in text. In this task, systems have to automatically determine the intensity of emotions in tweets.


Given a tweet and an emotion X, determine the intensity or degree of emotion X felt by the speaker -- a real-valued score between 0 and 1. The maximum possible score 1 stands for feeling the maximum amount of emotion X (or having a mental state maximally inclined towards feeling emotion X). The minimum possible score 0 stands for feeling the least amount of emotion X (or having a mental state maximally away from feeling emotion X). The tweet along with the emotion X will be referred to as an instance. Note that the absolute scores have no inherent meaning -- they are used only as a means to convey that the instances with higher scores correspond to a greater degree of emotion X than instances with lower scores.


Training, development, and test datasets are provided for four emotions: joy, sadness, fear, and anger. For example, the anger training dataset has tweets along with a real-valued score between 0 and 1 indicating the degree of anger felt by the speaker. The test data includes only the tweet text. Gold emotion intensity scores will be released after the evaluation period.


For each emotion, systems are evaluated by calculating the Pearson Correlation Coefficient with Gold ratings. The correlation scores across all four emotions will be averaged to determine the bottom-line competition metric by which the submissions will be ranked.

The official evaluation script (which also acts as a format checker) is available for download. You may want to run it on the training set to determine your progress, and eventually on the test set to check the format of your submission.

Web Hosting of the Competition:

The entire competition will be hosted on CodaLab Competitions ( A direct link to the Emotion Intensity CodaLab competition is here:

(CodaLab has been used in many research evaluation competitions in the past such as Microsoft COCO Image Captioning Challenge and SemEval-2017.)

Call for Papers:

Participants will be given the opportunity to write a system-description paper that describes their system, resources used, results, and analysis. This paper will be part of the official WASSA-2017 proceedings. The paper is to be four pages long plus two pages at most for references. The papers are to follow the format and style files provided by EMNLP-2017.


Training data ready: Data for anger, fear, and joy are already available; data for sadness will be made available in the second half of February 2017
Evaluation period starts: May 02, 2017
Evaluation period ends: May 14, 2017
Results posted: May 21, 2017
Workshop paper submission deadline: June 10, 2017
Author notifications : July 9, 2017
Camera ready submissions due: July 23, 2017

Baseline Weka System for Determining Emotion Intensity:

You are free to build a system from scratch using any available software packages and resources, as long as they are not against the spirit of fair competition. In order to assist testing of ideas, we also provide a baseline emotion intensity system that you can build on. The use of this system is completely optional. The system is available here:

Organizers of the shared task:

Saif M. Mohammad
National Research Council Canada

Felipe Bravo-Marquez
The University of Waikato

Alexandra Balahur
European Commission, Brussels

Saif M. Mohammad,

Page Updated: 20-Mar-2017