Beta
60192

MASHAEIR: Bootstrapping a Multi-Dialect Fine- Grained Emotion Thesaurus for Arabic Using Twitter

Article

Last updated: 04 Jan 2025

Subjects

-

Tags

-

Abstract

The user-generated content on social media sites, e.g. Twitter and Facebook, provides a rich source of
people's emotions towards products, issues, people and major events. Accordingly, the focus of more research has moved
from negative-positive sentiment classification tasks to tasks of recognizing more fine-grained emotions. However,
research on and resources for fine-grained emotion identification in Arabic texts are still lacking. To fill in this gap, this
paper introduces MASHAEIR (an Arabic word that means ‘emotions'), a corpus-based multi-dialect fine-grained
emotion thesaurus for Arabic. MASHAEIR was bootstrapped using 'big data' from Arabic Twitter from January 2007 to
July 2015. The thesaurus is enriched with (i) different types of single- as well as multi-word terms expressing emotions,
(ii) Arabic dialectal variations in the expression of emotions and (iii) scores that reflect the intensity of the emotions
conveyed through these units. The paper also presents a simple evaluation of the thesaurus coverage on a sample Twitter
corpus. MASHAEIR is intended to present an outline of a large-scale and easy-to-update emotion thesaurus for Arabic
that could also be enriched in the future with more information such as gender and age preferences in expressing
emotions.

DOI

10.21608/ejle.2015.60192

Keywords

Arabic Sentiment Analysis, Emotion Thesaurus, social media

Authors

First Name

Khaled

Last Name

Elghamry

MiddleName

-

Affiliation

Faculty of Alsun, Ain Shams University

Email

elghamryk@gmail.com

City

Cairo, Egypt

Orcid

-

Volume

2

Article Issue

2

Related Issue

9133

Issue Date

2015-09-01

Receive Date

2015-06-19

Publish Date

2015-09-01

Page Start

10

Page End

21

Print ISSN

2356-8208

Online ISSN

2356-8216

Link

https://ejle.journals.ekb.eg/article_60192.html

Detail API

https://ejle.journals.ekb.eg/service?article_code=60192

Order

2

Type

Original Article

Type Code

1,039

Publication Type

Journal

Publication Title

The Egyptian Journal of Language Engineering

Publication Link

https://ejle.journals.ekb.eg/

MainTitle

MASHAEIR: Bootstrapping a Multi-Dialect Fine- Grained Emotion Thesaurus for Arabic Using Twitter

Details

Type

Article

Created At

22 Jan 2023