Predictive Modelling on Competitor Analysis Performance by using Generalised Linear Models and Machine Learning Approach
DOI:
https://doi.org/10.37934/araset.45.1.5159Keywords:
Competitive analysis, Generalised linear model, Machine learningAbstract
Competitive analysis in digital and technology is trending in the business field. However, the field of digital and technology in the business world is vast and challenging to analyse. The purpose of this research is first to identify the success factor which represent the company performance. Second, is to identify the significant services provided by the company to their business user. Then, based on the first and second objectives, a predictive modelling is developed to produce the best solution to their business user. The research is implementing a case study from Telecommunication Company and using data science life cycle methodology. The statistical modelling that is used to develop the competitor’s analysis model is generalised linear model (GLM) which integrated with machine learning approach. Furthermore, the synthetic data set is created by using Gamma Distribution, Gaussian Distribution and Poisson Distribution due to some data from the case study is confidential. The synthetic data set is based on existing real data which are from Telecommunication Company sentiment analysis data, were used to investigate the performance of the proposed model. The machine learning technique is used to get the accuracy of the significant GLM which has been developed. The accuracy is tested by using the error rates of the machine learning technique which are Root Mean Square Error (RMSE), Mean Absolute Error (MAE) and R-squared. This research discovered that the business solution to the significant service for the business user and discovered the best statistical model to be used for the business solution. The results show that the Gumbel distribution is the best fit model for the synthetic dataset where the values of RMSE is 1.0574, MAE is 0.9168 and R-squared is 0.3994, and the significant success factor that has been identified by using the GLM is advertising success factor. The model developed can be improved with another type of data set and different sizes of data. Hence, further studies and real-world data are required for better validation.