Predictive Modelling on Competitor Analysis Performance by using Generalised Linear Models and Machine Learning Approach

Authors

  • Noryanti Muhammad Centre for Mathematical Sciences, Universiti Malaysia Pahang Al-Sultan Abdullah, Lebuhraya Persiaran Tun Khalil Yaakob, 26300 Kuantan, Pahang, Malaysia
  • Mohamad Nadzman Mohd Amin Centre for Artificial Intelligence & Data Science, Universiti Malaysia Pahang Al-Sultan Abdullah, Lebuhraya Persiaran Tun Khalil Yaakob, 26300 Kuantan, Pahang, Malaysia
  • Rose Adzreen Adnan Credence 1, Jalan Damansara, Damansara Kim, 60000 Kuala Lumpur, Malaysia

DOI:

https://doi.org/10.37934/araset.45.1.5159

Keywords:

Competitive analysis, Generalised linear model, Machine learning

Abstract

Competitive analysis in digital and technology is trending in the business field. However, the field of digital and technology in the business world is vast and challenging to analyse. The purpose of this research is first to identify the success factor which represent the company performance. Second, is to identify the significant services provided by the company to their business user. Then, based on the first and second objectives, a predictive modelling is developed to produce the best solution to their business user. The research is implementing a case study from Telecommunication Company and using data science life cycle methodology. The statistical modelling that is used to develop the competitor’s analysis model is generalised linear model (GLM) which integrated with machine learning approach. Furthermore, the synthetic data set is created by using Gamma Distribution, Gaussian Distribution and Poisson Distribution due to some data from the case study is confidential. The synthetic data set is based on existing real data which are from Telecommunication Company sentiment analysis data, were used to investigate the performance of the proposed model. The machine learning technique is used to get the accuracy of the significant GLM which has been developed. The accuracy is tested by using the error rates of the machine learning technique which are Root Mean Square Error (RMSE), Mean Absolute Error (MAE) and R-squared. This research discovered that the business solution to the significant service for the business user and discovered the best statistical model to be used for the business solution. The results show that the Gumbel distribution is the best fit model for the synthetic dataset where the values of RMSE is 1.0574, MAE is 0.9168 and R-squared is 0.3994, and the significant success factor that has been identified by using the GLM is advertising success factor. The model developed can be improved with another type of data set and different sizes of data. Hence, further studies and real-world data are required for better validation.

Downloads

Download data is not yet available.

Author Biographies

Noryanti Muhammad, Centre for Mathematical Sciences, Universiti Malaysia Pahang Al-Sultan Abdullah, Lebuhraya Persiaran Tun Khalil Yaakob, 26300 Kuantan, Pahang, Malaysia

noryanti@ump.edu.my

Mohamad Nadzman Mohd Amin, Centre for Artificial Intelligence & Data Science, Universiti Malaysia Pahang Al-Sultan Abdullah, Lebuhraya Persiaran Tun Khalil Yaakob, 26300 Kuantan, Pahang, Malaysia

nadzman98@gmail.com

Rose Adzreen Adnan, Credence 1, Jalan Damansara, Damansara Kim, 60000 Kuala Lumpur, Malaysia

roseadzreen.adnan@credence.tech

Published

2024-04-11

Issue

Section

Articles