Beta
8245

REAL TIME FIRST STORY DETECTION IN TWITTER USING A MODIFIED TF-IDF ALGORITHM

Article

Last updated: 22 Jan 2023

Subjects

-

Tags

-

Abstract

Twitter is a social micro blogging, it has its own feature that it enables to tweet only a maximum of 140 characters per tweet. Even with this small number of characters per tweet, analyzing the tweets for billions of users faces the challenges of real-time data processing. One of the important aspects of social behavior is that we can detect the significance of the events and the way the people reacted to them. In this paper, we focus on First Story Detection (FSD) that means we can detect bursts of tweets that refer to a particular topic. First story is defined as the first document from a given series of documents to discuss a specific event, which occurred at a particular time and place. TF-IDF denotes to term frequency–inverse document frequency is an algorithm traditionally used in most of Text similarity applications like FSD. In this paper, we embedded a modified version of TF-IDF algorithm to enhance the accuracy of a pre-implemented open source for FSD that uses Storm platform to benefit from its scalability, efficiency and robustness in analyzing the tweets in real time. The empirical results show significant enhancements in the accuracy of the detection without noticeable effect on performance

DOI

10.21608/ijicis.2017.8245

Keywords

Real Time, Similarity Algorithms, social media, Information Retrieval, Big Data

Authors

First Name

Samar

Last Name

elbedwehy

MiddleName

-

Affiliation

Computer Science Department Faculty of Computer and Information Sciences, Mansoura University - Egypt

Email

samarelbedwehy@mans.edu.eg

City

-

Orcid

-

First Name

M

Last Name

Alrahmawy

MiddleName

-

Affiliation

Computer Science Department Faculty of Computer and Information Sciences, Mansoura University - Egypt

Email

mrahmawy@mans.edu.eg

City

-

Orcid

-

First Name

Taher

Last Name

Hamza

MiddleName

-

Affiliation

Computer Science Department Faculty of Computer and Information Sciences, Mansoura University - Egypt

Email

taher_hamza@yahoo.com

City

-

Orcid

-

Volume

17

Article Issue

3

Related Issue

1602

Issue Date

2017-07-01

Receive Date

2018-06-27

Publish Date

2017-07-01

Page Start

11

Page End

31

Print ISSN

1687-109X

Online ISSN

2535-1710

Link

https://ijicis.journals.ekb.eg/article_8245.html

Detail API

https://ijicis.journals.ekb.eg/service?article_code=8245

Order

2

Type

Original Article

Type Code

494

Publication Type

Journal

Publication Title

International Journal of Intelligent Computing and Information Sciences

Publication Link

https://ijicis.journals.ekb.eg/

MainTitle

-

Details

Type

Article

Created At

22 Jan 2023