SENTIMENT ANALYSIS COMPARE LINEAR REGRESSION AND DECISION TREE REGRESSION ALGORITHM TO DETERMINE FILM RATING ACCURACY

  • Rivaldo Sitanggang Universitas Prima Indonesia
  • Daniel Ryan Hamonangan Sitompul Universitas Prima Indonesia
  • Stiven Hamonangan Sinurat Universitas Prima Indonesia
  • Ruben, Andreas Situmorang Universitas Prima Indonesia
  • Denis Jusuf Ziegel Universitas Prima Indonesia
  • Julfikar Rahmad Universitas Prima Indonesia
  • Evta Indra Universitas Prima Indonesia
Keywords: Movie rating, IMDb, Linear Regression, Decision Tree Regression, Machine Learning.

Abstract

Rating assessment in a film is the most important thing because it describes
the satisfaction of film lovers with the films they have watched. With
technological advances like now, we can easily find out the rating of a film
by using a platform to accommodate the audience's review results, namely
the Internet Movie Database (Imdb). The Machune Learning model that has
been created can determine whether the film we watch is good based on
ratings and reviews from moviegoers who share their experiences in
watching similar films. Based on the results of the analysis of the two
algorithms Linear Regression and Dicision Tree Regression, the best
accuracy results from the Decision Tree Regression algorithm are 95.47%

Downloads

Download data is not yet available.

References

N. A. Mayangky, D. N. Kholifah, I. Balla, and I. J. Thira, “Pengaruh Rating Film Terhadap Jumlah Audience Yang Menonton Film,” Indones. J. Softw. Eng., vol. 5, no. 2, pp. 113–120, 2019, doi: 10.31294/ijse.v5i2.6963.
[2] N. Armstrong and K. Yoon, “Movie Rating Prediction,” Citeseer, pp. 507–511, 2009, [Online]. Available: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.87.1964&rep=rep1&type=pdf.
[3] W. R. Bristi, Z. Zaman, and N. Sultana, “Predicting IMDb Rating of Movies by Machine Learning Techniques,” 2019 10th Int. Conf. Comput. Commun. Netw. Technol. ICCCNT 2019, pp. 1–5, 2019, doi: 10.1109/ICCCNT45670.2019.8944604.
[4] R. Dhir and A. Raj, “Movie Success Prediction using Machine Learning Algorithms and their Comparison,” ICSCCC 2018 - 1st Int. Conf. Secur. Cyber Comput. Commun., pp. 385–390, 2018, doi: 10.1109/ICSCCC.2018.8703320.
[5] A. G. Sooai and M. Laniwati, “Ekstraksi Ulasan Sentimen Film dari Twitter dengan Naïve Bayes pada Situs Web Media Sosial Penggemar Film,” J. Intell. Syst. Comput., vol. 3, no. 1, pp. 49–54, 2021, doi: 10.52985/insyst.v3i1.186.
[6] P. S. M. Suryani, L. Linawati, and K. O. Saputra, “Penggunaan Metode Naïve Bayes Classifier pada Analisis Sentimen Facebook Berbahasa Indonesia,” Maj. Ilm. Teknol. Elektro, vol. 18, no. 1, p. 145, 2019, doi: 10.24843/mite.2019.v18i01.p22.
[7] P. Antinasari, R. S. Perdana, and M. A. Fauzi, “Analisis Sentimen Tentang Opini Film Pada Dokumen Twitter Berbahasa Indonesia Menggunakan Naive Bayes Dengan Perbaikan Kata Tidak Baku,” J. Pengemb. Teknol. Inf. dan Ilmu Komput., vol. 1, no. 12, pp. 1718–1724, 2017, [Online]. Available: http://j-ptiik.ub.ac.id.
[8] R. Yanto, “Implementasi Data Mining Estimasi Ketersediaan Lahan Pembuangan Sampah menggunakan Algoritma Simple Linear Regression,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 2, no. 1, pp. 361–366, 2018, doi: 10.29207/resti.v2i1.282.
[9] M. Alfi, R. Reynaldhi, and Y. Sibaroni, “Analisis Sentimen Review Film pada Twitter menggunakan Metode Klasifikasi Hybrid Naïve Bayes dan Decision Tree,” vol. 8, no. 5, pp. 10127–10137, 2021.
[10] J. Prima et al., “PREDIKSI HARGA MOBIL MENGGUNAKAN ALGORITMA REGRESSI DENGAN HYPER-PARAMETER TUNING,” vol. 4, no. 2, pp. 1–5, 2021.
[11] J. M. Hellerstein and U. C. Berkeley, “DataCleaning-ucb(2),” United Nations Econ. Comm. Eur., pp. 1–42, 2008, [Online]. Available: http://db.cs.berkeley.edu/jmh/cleaning-unece.pdf%5Cnpapers2://publication/uuid/DC7173AB-6B26-4B8B-AEC3-4C7E65CEEFED.
[12] J. Prima et al., “PREDIKSI WATER QUALITY INDEX ( WQI ) MENGGUNAKAN ALGORITMA REGRESSI DENGAN HYPER-PARAMETER TUNING,” vol. 5, no. 1, pp. 44–50, 2021.
[13] J. Prima et al., “ANALISIS PERBANDINGAN SENTIMEN CORONA VIRUS DISEASE- 2019 ( COVID19 ) PADA TWITTER MENGGUNAKAN METODE LOGISTIC REGRESSION DAN SUPPORT VECTOR MACHINE ( SVM ),” vol. 5, no. 2, 2022.
[14] N. S. Fathullah, Y. A. Sari, and P. P. Adikara, “Analisis Sentimen Terhadap Rating dan Ulasan Film dengan menggunakan Metode Klasifikasi Naïve Bayes dengan Fitur Lexicon-Based,” J. Pengemb. Teknol. Inf. dan Ilmu Komput., vol. 4, no. 2, pp. 590–593, 2020, [Online]. Available: http://j-ptiik.ub.ac.id.
[15] A. Saleh, E. Indra, and M. Harahap, “Kombinasi Jaringan Learning Vector Quantization Dan Normalized Cross Correlation Pada Pengenalan Wajah,” J. Sist. Inf. dan Ilmu Komput. Prima(JUSIKOM PRIMA), vol. 3, no. 2, pp. 13–20, 2020, doi: 10.34012/jusikom.v3i2.851.
[16] Tamba, S.P., 2022. Penerapan Data Mining Algoritma Apriori Dalam Menentukan Stok Bahan Baku Pada Restoran Nelayan Menggunakan Metode Association Rule. Jurnal Sistem Informasi dan Ilmu Komputer Prima (JUSIKOM PRIMA), 5(2), pp.97-102.
Published
2022-06-23
How to Cite
Sitanggang, R., Sitompul, D. R. H., Sinurat, S. H., Situmorang, R. A., Ziegel, D. J., Rahmad, J., & Indra, E. (2022). SENTIMENT ANALYSIS COMPARE LINEAR REGRESSION AND DECISION TREE REGRESSION ALGORITHM TO DETERMINE FILM RATING ACCURACY. INFOKUM, 10(02), 880-890. Retrieved from http://infor.seaninstitute.org/index.php/infokum/article/view/427