Theses and Dissertations

Classification and Keyword Identification of COVID 19 Misinformation on Social Media: A Framework for Semantic Analysis

Grace Y. Smith

Date of Award

3-2022

Document Type

Thesis

Degree Name

Master of Science

Department

Department of Mathematics and Statistics

First Advisor

Christine M. Schubert Kabban, PhD

Abstract

The growing surge of misinformation among COVID-19 communication can pose great hindrance to truth, magnify distrust in policy makers and/or degrade authorities’ credibility, and it can even harm public health. Classification of textual context on social media data relating to COVID-19 is an effective tool to combat misinformation on social media platforms. In this research, Twitter data was leveraged to 1) develop classification methods to detect misinformation and identify Tweet sentiment with respect to COVID-19 and 2) develop a human-in-the-loop interactive framework to enable identification of keywords associated with social context, here, being misinformation regarding COVID-19. 1) Six fusion-based classification models were built fusing three classical machine learning algorithms. The best performing models were selected to detect misinformation and to classify sentiment. We found the public reacted more positively towards COVID-19 misinformation and positive sentiment increased in August 2020 relative to April 2020 for all but political or biased related misinformation. 2) The most semantically similar keywords were chosen via distribution representations of topics and recommended by optimal ROC curves. The interactive system recommended 21 and 22 keywords related to conspiracy and unreliable misinformation, respectively and are most semantically similar to the user inquiry “COVID start lab.”

AFIT Designator

AFIT-ENC-MS-22-M-002

DTIC Accession Number

AD1166828

Recommended Citation

Smith, Grace Y., "Classification and Keyword Identification of COVID 19 Misinformation on Social Media: A Framework for Semantic Analysis" (2022). Theses and Dissertations. 5365.
https://scholar.afit.edu/etd/5365

Download

Included in

Numerical Analysis and Computation Commons

COinS

Theses and Dissertations

Classification and Keyword Identification of COVID 19 Misinformation on Social Media: A Framework for Semantic Analysis

Date of Award

Document Type

Degree Name

Department

First Advisor

Abstract

AFIT Designator

DTIC Accession Number

Recommended Citation

Included in

Search

Browse

Author Corner

Theses and Dissertations

Classification and Keyword Identification of COVID 19 Misinformation on Social Media: A Framework for Semantic Analysis

Author

Date of Award

Document Type

Degree Name

Department

First Advisor

Abstract

AFIT Designator

DTIC Accession Number

Recommended Citation

Included in

Share

Search

Browse

Author Corner