Web Application for Sentiment Analysis of Thai Viewers on YouTube

Authors

  • Supaporn Simcharoen KMUTNB
  • Jakapat Jodduangchan
  • Papop Sangeamsak
  • Kongphop Sri-on
  • Bunnapon Takumwan

Keywords:

Sentiment Analysis, Thai Word Segmentation, Social Media, YouTube

Abstract

This research presents the development and application of sentiment analysis techniques for Thai-language comments on YouTube, using a dataset categorized into three types of sentiment: positive, negative, and neutral. The research begins with text preparation and preprocessing, such as Thai word segmentation and removing stopwords to clean the data. Text modeling techniques and the VADER tool were employed to analyze the sentiment of the comments. After processing, a Word Cloud was generated to visualize frequently occurring words in positive and negative sentiment comments, along with graphs depicting the sentiment distribution within the dataset. The analysis results reveal patterns of sentiment distribution across various comments, which can be utilized to study online user behavior. Besides, data from a trending video featuring a pygmy hippopotamus named "Moo Deng" was used as a case study.

References

Zhang, L., & Liu, B. (2017). Sentiment Analysis and Opinion Mining. In C. Sammut & G. I. Webb (Eds.), Encyclopedia of

Machine Learning and Data Mining (pp. 1152–1161). Springer US.

Saad, S., & Saberi, B. (2017). Sentiment Analysis or Opinion Mining: A Review. International Journal on Advanced Science, Engineering and Information Technology, 7(5), 1660 -1666.

Statista (2023). YouTube – Statistics & Facts. Retrieved from https://www.statista.com

DataReportal (2022). Digital 2022: Thailand. Retrieved from https://datareportal.com

YouTube Marketing (2022). Consumer Behavior Insights on YouTube. Retrieved from https://www.thinkwithgoogle.com

Ongkrutraksa, W. (2021). The Influences of Marketing Communications in YouTube on Behavior of Generation Y and Z Consumer. Journal of Public Relations and Advertising, 14(1), 1-12.

Phatthiyaphaibun, W., Chaovavanich, K., Polpanumas, C., Suriyawongkul, A., Lowphansirikul, L., Chormai, P., Limkonchotiwat, P., Suntorntip, T., & Udomcharoenchaikit, C.(2023). PyThaiNLP: Thai Natural Language Processing in Python. In L. Tan, D. Milajevs, G. Chauhan, J. Gwinnup, & E. Rippeth (Eds.), Proceedings of the 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS 2023) (pp. 25-36).Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.nlposs-1.4

Hutto C. J. & Gilbert E. (2014). VADER: A Parsimonious Rulebased Model for Sentiment Analysis of Social Media Text. Proceedings of the International AAAI Conference on Web and Social Media, 8(1), 216 – 225. https://doi.org/10.1609/icwsm.v8i1.14550

Jurafsky D. & Martin J.H. (2025) . Speech and Language Processing. (3rd ed.). Pearson Education.

Mikolov T., Chen K., Corrado G., & Dean J. (2013). Efficient Estimation of Word Representations in Vector Space. arXiv preprint arXiv:1301.3781. https://doi.org/10.48550/arXiv.1301.3781

Greyling, L., & Rossouw, J. (2022). Twitter sentiment and stock market movements: The predictive power of social media. VoxEU.org. 6.

Devlin J., Chang M.-W., Lee K., & Toutanova K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 NAACL-HLT.pp.4171 – 4186.

Jenkarn, N. (2020). Thai-textual Cyberbullying Detection using Support Vector Machines. Science Technology and Innovation Journal,1(1).

Downloads

Published

2025-06-23

How to Cite

1.
Simcharoen S, Jodduangchan J, Sangeamsak P, Sri-on K, Takumwan B. Web Application for Sentiment Analysis of Thai Viewers on YouTube. Acad. J. Sci. Appl. Sci. [internet]. 2025 Jun. 23 [cited 2025 Dec. 13];9(17):e3879. available from: https://ph03.tci-thaijo.org/index.php/ajsas/article/view/3879