Topic Modeling Using LDA and BERT Techniques: Teknofest Example
Yükleniyor...
Dosyalar
Tarih
2021
Yazarlar
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
Institute of Electrical and Electronics Engineers Inc.
Erişim Hakkı
info:eu-repo/semantics/closedAccess
Özet
This paper is a natural language processing study and includes models used in natural language processing. In this paper, topic modeling, which is one of the sub-fields of natural language processing, has been studied. In order to make topic modeling, the data set was obtained by using the data scraping method, which has been very popular in recent years, over social media. The dataset is related to Teknofest competitions. The dataset was created by utilizing the Selenium library, one of the popular libraries used for the data scraping method. In order to be able to analyze on the prepared data set and to ensure the consistency of the clustering process, the text to be used before the analysis was preprocessed. After text preprocessing, clustering was performed on the data set with natural language processing techniques such as BERT and LDA . © 2021 IEEE
Açıklama
6th International Conference on Computer Science and Engineering, UBMK 2021 -- 15 September 2021 through 17 September 2021 -- 176826
Anahtar Kelimeler
Bert, Data Scraping, LDA, Teknofest, Topic Modeling, Clustering algorithms, Modeling languages, Bert, Clustering process, Data scraping, Data set, LDA, Social media, Sub fields, Teknofest, Text preprocessing, Topic Modeling, Natural language processing systems
Kaynak
Proceedings - 6th International Conference on Computer Science and Engineering, UBMK 2021