Faculty Publications

Leveraging Large Language Models for Word Sense Disambiguation

Jung H. Yae, Air Force Institute of Technology
Nolan C. Skelly, Air Force Institute of Technology
Neil C. Ranly, Air Force Institute of Technology
Phillip M. LaCasse, Air Force Institute of TechnologyFollow

Document Type

Article

Publication Date

12-19-2024

Abstract

Natural language processing (NLP) is difficult because human language contains ambiguity. The same word can have a different meaning depending on the context and may result in different interpretations given biases held by a NLP technique. Correctly interpreting this ambiguity is not simply an important task in its own right but is a key enabler to major NLP activities such as machine translation and question answering. This research proposes three techniques to evaluate a large language models’(LLMs) ability to perform word sense disambiguation (WSD) and explores the efficacy of seven generative LLMs. The first technique assesses whether LLMs can, given a context sentence, select the correct word sense from a menu of options. The second asks LLMs, without options provided, to state whether or not a provided word sense is correct. The third technique presents the LLMs with context and an unseen word, assessing whether the LLMs can infer from context the sense of a word that it has not seen during training. Results demonstrate a strong relationship between model size and performance. Applications of WSD are demonstrated as part of an information extraction pipelines supporting sentiment analysis and as part of an LLM-evaluation suite to support machine learning operations.

Comments

The license for this article changed 31 January 2025. This change was issued through a correction posted to the journal at https://doi.org/10.1007/s00521-025-11082-z

© The Author(s) 2024

The "Link to Full Text" on this page opens the article at the publisher website.

This is an Open Access article published by Springer and distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License, which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way. CC BY-NC-ND 4.0

All data used in support of this research are drawn from publicly available corpora and are accessible at https://github/neil-r/study_wsd

DOI

10.1007/s00521-024-10747-5

Source Publication

Neural Computing and Applications (ISSN 0941-0643 | eISSN 1433-3058)

Recommended Citation

Yae, J. H., Skelly, N. C., Ranly, N. C., & LaCasse, P. M. (2025). Leveraging large language models for word sense disambiguation. Neural Computing and Applications, 37:4093-4110. https://doi.org/10.1007/s00521-024-10747-5

Link to Full Text

COinS

Faculty Publications

Leveraging Large Language Models for Word Sense Disambiguation

Document Type

Publication Date

Abstract

Comments

DOI

Source Publication

Recommended Citation

Search

Browse

Author Corner

Faculty Publications

Leveraging Large Language Models for Word Sense Disambiguation

Authors

Document Type

Publication Date

Abstract

Comments

DOI

Source Publication

Recommended Citation

Share

Search

Browse

Author Corner