Topic Modeling Using LDA and BERT Techniques: Teknofest Example

Yükleniyor...
Küçük Resim

Tarih

2021

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Institute of Electrical and Electronics Engineers Inc.

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

This paper is a natural language processing study and includes models used in natural language processing. In this paper, topic modeling, which is one of the sub-fields of natural language processing, has been studied. In order to make topic modeling, the data set was obtained by using the data scraping method, which has been very popular in recent years, over social media. The dataset is related to Teknofest competitions. The dataset was created by utilizing the Selenium library, one of the popular libraries used for the data scraping method. In order to be able to analyze on the prepared data set and to ensure the consistency of the clustering process, the text to be used before the analysis was preprocessed. After text preprocessing, clustering was performed on the data set with natural language processing techniques such as BERT and LDA . © 2021 IEEE

Açıklama

6th International Conference on Computer Science and Engineering, UBMK 2021 -- 15 September 2021 through 17 September 2021 -- 176826

Anahtar Kelimeler

Bert, Data Scraping, LDA, Teknofest, Topic Modeling, Clustering algorithms, Modeling languages, Bert, Clustering process, Data scraping, Data set, LDA, Social media, Sub fields, Teknofest, Text preprocessing, Topic Modeling, Natural language processing systems

Kaynak

Proceedings - 6th International Conference on Computer Science and Engineering, UBMK 2021

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye