Research Trends on Sentiment Analysis and Imbalanced Data Handling in Fake Review Detection: A Systematic Literature Review

Authors

DOI:

https://doi.org/10.21015/vtse.v14i2.2328

Abstract

Fake reviews are deceptive evaluations that mislead customers rather than reflect genuine customer experiences. These reviews can damage the business's reputation by deceiving the customers, which then causes them to make poor decisions about what to buy and diminishes the trust that e-commerce platforms can have. Detecting fake reviews is crucial for e-commerce platforms to maintain their integrity, protect consumers, and uphold business reputations. Despite its importance, there is a paucity of comprehensive research addressing fake review detection through the lenses of Sentiment Analysis (SA) and imbalanced data handling. To bridge this gap, a systematic literature review uses Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. This review analyzed 43 studies from the Scopus and Web of Science databases, covering the period from 2019 to 2024. Three primary themes emerged: SA levels, detection methods, and techniques for handling imbalanced data, which further branched into 28 sub-themes. The analysis revealed key trends such as a predominant focus on document-level SA, the application of machine learning approaches, and data resampling techniques to address imbalanced datasets. The review underscored the necessity for more research on aspect-level analysis and the development of combinational approaches, such as hybrid models, to enhance the accuracy and reliability of fake review detection. These insights provide valuable guidance for researchers, data scientists, and developers seeking to advance the field.

References

M. Liu, Y. Shang, Q. Yue, and J. Zhou, "Detecting fake reviews using multidimensional representations with fine-grained aspects plan," IEEE Access, vol. 9, pp. 3765–3773, 2021, doi: 10.1109/ACCESS.2020.3047947.

E. Elmurngi and A. Gherbi, "Detecting fake reviews through sentiment analysis using machine learning techniques," IARIA/Data Analytics, pp. 65–72, 2017.

R. P. Kashti and P. S. Prasad, "Enhancing NLP techniques for fake review detection," International Research Journal of Engineering and Technology, vol. 6, pp. 241–245, 2019.

R. Na, D. Ruoyu, and S. Chunhua, "GADM: Manual fake review detection for O2O commercial platforms," Computers & Security, vol. 88, 2019, doi: 10.1016/j.cose.2019.101657.

W. H. Asaad, R. Allami, and Y. H. Ali, "Fake review detection using machine learning," Revue d'Intelligence Artificielle, vol. 37, no. 5, pp. 1159–1166, 2023, doi: 10.18280/ria.370507.

R. S. Jagdale, V. S. Shirsat, and S. N. Deshmukh, Sentiment Analysis on Product Reviews Using Machine Learning Techniques. Singapore: Springer, 2019, doi: 10.1007/978-981-13-0617-4.

D. Kumar, A. Gupta, V. K. Gupta, and A. Gupta, "Aspect-based sentiment analysis using machine learning and deep learning approaches," International Journal of Recent Innovation Trends in Computing and Communication, vol. 11, no. 5, pp. 118–138, 2023, doi: 10.17762/ijritcc.v11i5s.6636.

N. Kumar et al., "Fake product review detection and sentiment analysis," International Research Journal of Modern Engineering Technology and Science, vol. 5, no. 4, pp. 2444–2447, 2023.

K. Kumar, K. S. George, D. Bhatt, and O. P. Paul, "A brief study on fake review detection methods on e-commerce websites," International Journal of Research in Applied Science & Engineering Technology, vol. 11, no. 9, pp. 721–726, 2023, doi: 10.22214/ijraset.2023.52743.

H. Al Najada and X. Zhu, "iSRD: Spam review detection with imbalanced data distributions," in Proc. IEEE IRI, 2014, pp. 553–560, doi: 10.1109/IRI.2014.7051938.

D. Tranfield, D. Denyer, and P. Smart, "Towards a methodology for developing evidence-informed management knowledge by means of systematic review," British Journal of Management, vol. 14, no. 3, pp. 207–222, 2003.

C. U. Pae, "Why systematic review rather than narrative review?," Psychiatry Investigation, vol. 12, no. 3, pp. 417–419, 2015, doi: 10.4306/pi.2015.12.3.417.

P. Furley and N. Goldschmied, "Systematic vs. narrative reviews in sport and exercise psychology: Is either approach superior to the other?," Frontiers in Psychology, vol. 12, art. no. 685082, 2021, doi: 10.3389/fpsyg.2021.685082.

J. Sukhera, "Narrative reviews: Flexible, rigorous, and practical," Journal of Graduate Medical Education, vol. 14, no. 4, pp. 414–417, 2022, doi: 10.4300/JGME-D-22-00480.1.

B. Kitchenham and S. M. Charters, "Guidelines for performing systematic literature reviews in software engineering," EBSE Technical Report, ver. 2.3, vol. 1, pp. 1–54, 2007.

D. Moher et al., "Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement," Systematic Reviews, vol. 4, pp. 1–9, 2015.

R. Sarkis-Onofre, F. Catalá-López, E. Aromataris, and C. Lockwood, "How to properly use the PRISMA Statement," Systematic Reviews, vol. 10, no. 1, pp. 13–15, 2021, doi: 10.1186/s13643-021-01671-z.

M. J. Page et al., "The PRISMA 2020 statement: an updated guideline for reporting systematic reviews," BMJ, vol. 372, 2021, doi: 10.1136/bmj.n71.

G. Terry, N. Hayfield, V. Clarke, and V. Braun, Thematic Analysis, 2nd ed., 2017, pp. 17–37.

S. Dawadi, "Thematic analysis approach: a step-by-step guide for ELT research practitioners," Journal of NELTA, vol. 25, no. 1–2, pp. 62–71, 2020.

S. C. Lee, D. G. Lee, and Y. S. Seo, "Determining the best feature combination through text and probabilistic feature analysis for GPT-2-based mobile app review detection," Applied Intelligence, vol. 54, no. 2, pp. 1219–1246, 2024, doi: 10.1007/s10489-023-05201-3.

M. Tabany and M. Gueffal, "Sentiment analysis and fake Amazon reviews classification using SVM supervised machine learning model," Journal of Advanced Information Technology, vol. 15, no. 1, pp. 49–58, 2024, doi: 10.12720/jait.15.1.49-58.

Z. Hadi and A. Sunyoto, "Detecting fake reviews using n-gram model and chi-square," in Proc. 6th Int. Conf. Inf. Commun. Technol. (ICOIACT 2023), 2023, pp. 454–458, doi: 10.1109/ICOIACT59844.2023.10455895.

S. Akshara, S. Shiva, S. Kubireddy, T. Arun, and V. V. S. L. Kanthety, "A small comparative study of machine learning algorithms in the detection of fake reviews of Amazon products," in Proc. Int. Conf. Contemporary Comput. Inform. (IC3I 2023), 2023, pp. 2258–2263, doi: 10.1109/IC3I59117.2023.10398096.

M. Ramu, C. J. Raj, A. Nithish, C. Boggula, G. Ganesh Babu, and K. I. Srikanth Goud, "Applying deep learning methods on spam review detection," in Proc. 7th Int. Conf. Comput. Methodol. Commun. (ICCMC 2023), 2023, pp. 162–169, doi: 10.1109/ICCMC56507.2023.10083900.

D. Ganesh et al., "Implementation of novel machine learning methods for analysis and detection of fake reviews in social media," in Proc. 2nd Int. Conf. Sustain. Comput. Data Commun. Syst. (ICSCDS 2023), 2023, pp. 243–250, doi: 10.1109/ICSCDS56580.2023.10104856.

N. Rajesh et al., "Fake reviews detection based on sentiment analysis using ML classifiers," in Proc. IEEE 1st Int. Conf. Ambient Intell. Knowl. Inform. Ind. Electron. (AIKIIE 2023), 2023, doi: 10.1109/AIKIIE60097.2023.10390289.

N. Deshai and B. B. Rao, "Deep learning hybrid approaches to detect fake reviews and ratings," Journal of Scientific and Industrial Research, vol. 82, no. 1, pp. 120–127, 2023, doi: 10.56042/jsir.v82i1.69937.

S. Ashraf, F. Rehman, H. Sharif, H. Kim, H. Arshad, and H. Manzoor, "Fake reviews classification using deep learning," in *Proc. Int. Multi-disciplinary Conf. Emerg. Res. Trends (IMCERT 2023)*, 2023, pp. 1–8, doi: 10.1109/IMCERT57083.2023.10075156.

Arini, K. S. Harira, and Y. Durachman, "Extraction features using lexicon embedding on convolutional neural network for fake review detection," in Proc. 11th Int. Conf. Cyber IT Service Manag. (CITSM 2023), 2023, pp. 1–6, doi: 10.1109/CITSM60085.2023.10455445.

W. Choi, K. Nam, M. Park, S. Yang, S. Hwang, and H. Oh, "Fake review identification and utility evaluation model using machine learning," Frontiers in Artificial Intelligence, vol. 5, 2023, doi: 10.3389/frai.2022.1064371.

C. Sasikala, S. Ramya, S. RajeshKumar, K. B. Ramachandramoorthy, S. VijayaKumar, and K. Umapathi, "Fake review detection and classification using improved convolutional neural network on Amazon dataset," in Proc. 3rd Int. Conf. Pervasive Comput. Soc. Netw. (ICPCSN 2023), 2023, pp. 398–403, doi: 10.1109/ICPCSN58827.2023.00071.

A. Bader, Y. Suhweil, D. Shaheen, B. Alhijawi, and S. Abu-Soud, "Detecting ChatGPT-generated fake reviews using supervised machine learning," in Proc. 14th Int. Conf. Inf. Commun. Syst. (ICICS 2023), 2023, pp. 1–5, doi: 10.1109/ICICS60529.2023.10330528.

R. A. Duma, Z. Niu, A. S. S. Nyamawe, J. Tchaye-Kondi, and A. A. Yusuf, "A deep hybrid model for fake review detection by jointly leveraging review text, overall ratings, and aspect ratings," Soft Computing, vol. 27, no. 10, pp. 6281–6296, May 2023, doi: 10.1007/s00500-023-07897-4.

G. Bathla, P. Singh, R. K. Singh, E. Cambria, and R. Tiwari, "Intelligent fake reviews detection based on aspect extraction and analysis using deep learning," Neural Computing and Applications, vol. 34, pp. 20213–20229, Nov. 2022, doi: 10.1007/s00521-022-07531-8.

H. Cai, K. Yu, Y. Zhou, and X. Wu, "Co-attention based feature fusion network for spam review detection on Douban," Neural Processing Letters, vol. 54, no. 6, pp. 5251–5271, 2022, doi: 10.1007/s11063-022-10859-w.

J. Hou and A. Zhu, "Fake online review recognition algorithm and optimisation research based on deep learning," Applied Mathematics and Nonlinear Sciences, vol. 7, no. 2, pp. 861–874, 2022, doi: 10.2478/amns.2021.2.00170.

N. Deshai and B. B. Rao, "A detection of unfairness online reviews using deep learning," Journal of Theoretical and Applied Information Technology, vol. 100, no. 13, pp. 4738–4779, 2022.

K. Wu and F. Poursardar, "Towards undeceived: fake reviews detection models comparison," in Proc. IEEE 23rd Int. Conf. Inf. Reuse Integr. Data Sci. (IRI 2022), 2022, pp. 41–42, doi: 10.1109/IRI54793.2022.00021.

Y. Mei, "Spam detection with integrated review text and reviewer behavior," in Proc. IEEE Int. Conf. Adv. Electr. Eng. Comput. Appl. (AEECA 2022), 2022, pp. 1466–1474, doi: 10.1109/AEECA55500.2022.9919064.

R. Mothukuri, A. Aasritha, K. C. Maremalla, K. N. Pokala, and G. K. Perumalla, "Fake review detection using unsupervised learning," in Proc. Int. Conf. Sustain. Comput. Data Commun. Syst. (ICSCDS 2022), 2022, pp. 119–125, doi: 10.1109/ICSCDS53736.2022.9760908.

Y. Liu, L. Wang, T. Shi, and J. Li, "Detection of spam reviews through a hierarchical attention architecture with N-gram CNN and Bi-LSTM," Information Systems, vol. 103, 2022, doi: 10.1016/j.is.2021.101865.

J. Li, N. Li, L. Yang, and P. Zhang, "Identifying review spam with an unsupervised approach based on topic abuse," in ACM Int. Conf. Proc. Ser., 2022, pp. 350–356, doi: 10.1145/3532213.3532265.

P. M. Kumar, S. S. Harrsha, K. Abhiram, M. Kavitha, and M. Kalyani, "Role of machine learning in fake review detection," in Proc. 6th Int. Conf. Electron. Commun. Aerosp. Technol. (ICECA 2022), 2022, pp. 1026–1030, doi: 10.1109/ICECA55336.2022.10009174.

B. Saxena, S. Goyal, A. Kumari, and A. Agarwal, "Boosting accuracy of fake review prediction using synthetic minority oversampling technique," in Proc. IEEE Int. Conf. Comput. Commun. Intell. Syst. (ICCCIS 2022), 2022, pp. 156–161, doi: 10.1109/ICCCIS56430.2022.10037717.

F. Setievi, J. Natalia, T. R. Tjhang, I. S. Edbert, and D. Suhartono, "A comparative study of supervised machine learning algorithms for fake review detection," in Proc. 5th Int. Seminar Res. Inf. Technol. Intell. Syst. (ISRITI 2022), 2022, pp. 306–312, doi: 10.1109/ISRITI56927.2022.10052860.

A. Jakupov, J. Mercadal, B. Zeddini, and J. Longhi, "Analyzing deceptive opinion spam patterns: the topic modeling approach," in Proc. Int. Conf. Tools Artif. Intell. (ICTAI 2022), 2022, pp. 1251–1261, doi: 10.1109/ICTAI56018.2022.00190.

D. U. Vidanagama, A. T. P. Silva, and A. S. Karunananda, "Ontology-based sentiment analysis for fake review detection," Expert Systems with Applications, vol. 206, Nov. 2022, doi: 10.1016/j.eswa.2022.117869.

M. H. Al-Adhaileh and F. W. Alsaade, "Detecting and analysing fake opinions using artificial intelligence algorithms," Intelligent Automation and Soft Computing, vol. 32, no. 1, pp. 643–655, 2022, doi: 10.32604/iasc.2022.021225.

D. Purwitasari, A. A. Zaqiyah, and C. Fatichah, "Word-embedding model for evaluating text generation of imbalanced spam reviews," in Proc. Int. Conf. Adv. Comput. Sci. Inf. Syst. (ICACSIS 2021), 2021, doi: 10.1109/ICACSIS53237.2021.9631315.

M. Bhundiya and M. Trivedi, "An efficient spam review detection using active deep learning classifiers," in Proc. Int. Conf. Artif. Intell. Mach. Vis. (AIMV 2021), 2021, pp. 1–4, doi: 10.1109/AIMV53313.2021.9670966.

A. Toplu and H. Liu, "Designing a deceptive comment detection platform with a rule-based artificial intelligent architecture," in Proc. IEEE Int. Conf. Ind. Eng. Eng. Manag. (IEEM 2021), 2021, pp. 1442–1445, doi: 10.1109/IEEM50564.2021.9672994.

L. Shi, S. Xie, Y. Tao, L. Wei, and Y. Gao, "Fake review identification method based on topic model and att-BiLSTM," in ACM Int. Conf. Proc. Ser., 2021, pp. 204–208, doi: 10.1145/3483845.3483881.

V. P. Sumathi, S. M. Pudhiyavan, M. Saran, and V. N. Kumar, "Fake review detection of e-commerce electronic products using machine learning techniques," in Proc. Int. Conf. Adv. Electr. Electron. Commun. Comput. Autom. (ICAECA 2021), 2021, doi: 10.1109/ICAECA52838.2021.9675684.

G. S. Budhi, R. Chiong, and Z. Wang, "Resampling imbalanced data to detect fake reviews using machine learning classifiers and textual-based features," Multimedia Tools and Applications, vol. 80, pp. 13079–13097, 2021, doi: 10.1007/s11042-020-10299-5.

R. Hassan and M. R. Islam, "A supervised machine learning approach to detect fake online reviews," in Proc. 23rd Int. Conf. Comput. Inf. Technol. (ICCIT 2020), 2020, doi: 10.1109/ICCIT51783.2020.9392727.

M. Z. Asghar, A. Ullah, S. Ahmad, and A. Khan, "Opinion spam detection framework using hybrid classification scheme," Soft Computing, vol. 24, no. 5, pp. 3475–3498, 2020, doi: 10.1007/s00500-019-04107-y.

A. Sihombing and A. C. M. Fong, "Fake review detection on Yelp dataset using classification techniques in machine learning," in Proc. 4th Int. Conf. Contemp. Comput. Inform. (IC3I 2019), 2019, pp. 64–68, doi: 10.1109/IC3I46837.2019.9055644.

P. Hajek and A. Barushka, "A comparative study of machine learning methods for detection of fake online consumer reviews," in ACM Int. Conf. Proc. Ser., 2019, pp. 18–22, doi: 10.1145/3383902.3383909.

Y. Chai and W. Li, "Towards deep learning interpretability: a topic modeling approach," in Proc. 40th Int. Conf. Inf. Syst. (ICIS 2019), 2019.

Y. Shuqin and F. Jing, "Fake reviews detection based on text feature and behavior feature," in *Proc. IEEE Int. Conf. High Perform. Comput. Commun. Smart City Data Sci. Syst. (HPCC/SmartCity/DSS 2019)*, 2019, pp. 2007–2012, doi: 10.1109/HPCC/SmartCity/DSS.2019.00277.

S. Behdenna, F. Barigou, and G. Belalem, "Document-level sentiment analysis: a survey," EAI Endorsed Transactions on Context-Aware Systems and Applications, vol. 4, no. 13, 2018, doi: 10.4108/eai.14-3-2018.154339.

A. Farkhod, A. Abdusalomov, F. Makhmudov, and Y. I. Cho, "LDA-based topic modeling sentiment analysis using topic/document/sentence (TDS) model," Applied Sciences, vol. 11, no. 23, 2021, doi: 10.3390/app112311091.

Y. Li, C. Yin, S. Zhong, and X. Pan, "Multi-instance multi-label learning networks for aspect-category," arXiv preprint arXiv:2010.02656, 2020.

Q. Jiang, L. Chen, R. Xu, X. Ao, and M. Yang, "A challenge dataset and effective models for aspect-based sentiment analysis," in *Proc. 2019 Conf. Empir. Methods Nat. Lang. Process. 9th Int. Jt. Conf. Nat. Lang. Process. (EMNLP-IJCNLP 2019)*, 2019, pp. 6280–6285, doi: 10.18653/v1/D19-1654.

R. Taqiuddin, F. A. Bachtiar, and W. Purnomo, "Opinion spam classification on Steam review using support vector machine with lexicon-based features," Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control, vol. 6, no. 4, 2021, doi: 10.22219/kinetik.v6i4.1323.

J. Su and S. Yu, "Enhancing aspect-based sentiment analysis with capsule network," IEEE Access, vol. 8, pp. 100551–100561, 2020, doi: 10.1109/ACCESS.2020.2997675.

Y. Prakash and D. K. Sharma, "A survey on aspect-based sentiment analysis for online purchased product feedback," in Proc. 17th INDIACom; 10th Int. Conf. Comput. Sustain. Glob. Dev. (INDIACom 2023), 2023, pp. 1342–1347.

M. Ziaulla and A. Biradar, "Exploring sentiments in text: a survey of implicit and explicit aspect-based sentiment analysis," in Int. Jt. Conf. Adv. Comput. Intell., Singapore: Springer Nature Singapore, 2022, pp. 89–97, doi: 10.1007/978-981-97-0180-3.

L. Rossi, A. Karimi, and A. Prati, "Improving BERT performance for aspect-based sentiment analysis," arXiv preprint arXiv:2010.11731, 2020.

T. T. Khoei, H. O. Slimane, and N. Kaabouch, "Deep learning: systematic review, models, challenges, and research directions," Neural Computing and Applications, vol. 35, no. 31, pp. 23103–23124, 2023, doi: 10.1007/s00521-023-08957-4.

H. Benarafa, M. Benkhalifa, and M. Akhloufi, "An enhanced SVM model for implicit aspect identification in sentiment analysis," International Journal of Advanced Computer Science and Applications, vol. 14, no. 5, pp. 42–53, 2023, doi: 10.14569/IJACSA.2023.0140505.

Y. Feng, M. Zhou, and X. Tong, "Imbalanced classification: a paradigm-based review," Statistical Analysis and Data Mining: The ASA Data Science Journal, vol. 14, no. 5, pp. 383–406, 2021.

M. Khairy, T. M. Mahmoud, and T. Abd-El-Hafeez, "The effect of rebalancing techniques on the classification performance in cyberbullying datasets," Neural Computing and Applications, vol. 36, no. 3, pp. 1049–1065, 2024, doi: 10.1007/s00521-023-1730456.

T. Brychcín, M. Konkol, and J. Steinberger, "UWB: machine learning approach to aspect-based sentiment analysis," in Proc. 8th Int. Workshop Semant. Eval. (SemEval 2014), 2014, pp. 817–822.

Downloads

Published

2026-04-26

How to Cite

Abdul Rahim, L. A., Abu Samah, K. A. F., Dzulkalnine, M. F., Abdul Jalil, U. M., Jono, M. N. H. H., & Kamarudin, N. ‘Azwa. (2026). Research Trends on Sentiment Analysis and Imbalanced Data Handling in Fake Review Detection: A Systematic Literature Review. VFAST Transactions on Software Engineering, 14(2), 96–115. https://doi.org/10.21015/vtse.v14i2.2328