Detecting Questions in Online Communities: A Machine Learning Approach

Küçük Resim Yok

Tarih

2024

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Institute of Electrical and Electronics Engineers Inc.

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

The proliferation of online forums and communities has greatly facilitated knowledge sharing and user support but has also introduced the significant challenge of managing redundant and semantically similar questions. Traditional keyword-based methods have proven inadequate in addressing this issue due to the inherent complexities of natural language, where the same idea can be expressed in numerous ways. This study investigates the use of advanced machine learning algorithms - Logistic Regression, Random Forest, and Gradient Boosting (XGBoost) - to detect semantically similar questions. By employing the Quora Question Pairs dataset, the performance of these models is evaluated using metrics such as accuracy, precision, recall, and F1-score. This research not only provides a comparative analysis of these machine learning models but also suggests a framework for improving information retrieval and user experience in online forums. The study highlights the potential for future integration of deep learning models and advanced semantic understanding techniques to further enhance the detection of semantically similar questions. © 2024 IEEE.

Açıklama

IEEE MP Section; Institution of Electronics and Telecommunications Engineers (IETE)
16th IEEE International Conference on Computational Intelligence and Communication Networks, CICN 2024 -- 22 December 2024 through 23 December 2024 -- Indore -- 206392

Anahtar Kelimeler

Machine Learning, Natural Language Processing, Sentiment Analysis, Word Embeddings

Künye