COMPARISON BAGGING AND SUPPORT VECTOR MACHINE FOR CLASSIFICATION SOFTWARE REQUIREMENT

Klaus Rajendra Wastu

Abstract


Software Requirements Specifications is a document that describes the requirements that occur in the development of a software system. The category of requirements is defined in two types: Functional Requirements (FR) and Non-Functional Requirements (NFR). Software Requirements Engineering is critical in successfully designing a piece of software. Many studies have examined the classification of software requirements using machine learning, but none have compared bagging algorithms with Support Vector Machine (SVM). This study compares text feature extraction techniques with machine learning algorithms Bagging and Support Vector Machine to solve the Software Requirement Classification problem. Using vectorization techniques from word2vec: Continuous Bag of Words and Skip-gram can help produce the best model performance for Bagging and SVM models. In this study, the data used is expansion data from the PROMISE repository, namely PROMISE_exp, the repository is a collection of software requirements data that has been labeled. To measure performance, this study uses an evaluation matrix, namely precision, recall and f1-score. As a result, the two models that have been trained using the Continuous Bag of Words and skip-gram vectorization techniques will be compared to determine the more optimal model for classifying software requirements from the promise_exp repository.


Keywords


CUDA; GPU; CPU; Parallel / Bagging; Word2vec; Support Vector Machine; Software Requirement; Machine Learning

Full Text:

PDF

References


E. Dias Canedo and B. Cordeiro Mendes, “Software Requirements Classification Using Machine Learning Algorithms,” Entropy, vol. 22, no. 9, p. 1057, Sep. 2020, doi: 10.3390/e22091057.

A. Arfiani and Z. Rustam, “Ovarian cancer data classification using bagging and random forest,” presented at the PROCEEDINGS OF THE 4TH INTERNATIONAL SYMPOSIUM ON CUR-RENT PROGRESS IN MATHEMATICS AND SCIENCES (ISCPMS2018), Depok, Indonesia, 2019, p. 020046. doi: 10.1063/1.5132473.

Z. S. H. Abad, O. Karras, P. Ghazi, M. Glinz, G. Ruhe, and K. Schneider, “What Works Better? A Study of Classifying Requirements,” in 2017 IEEE 25th International Requirements Engineering Conference (RE), Lisbon, Portugal: IEEE, Sep. 2017, pp. 496–501. doi: 10.1109/RE.2017.36.

Z. Kurtanovic and W. Maalej, “Automatically Classifying Functional and Non-functional Require-ments Using Supervised Machine Learning,” in 2017 IEEE 25th International Requirements Engi-neering Conference (RE), Lisbon, Portugal: IEEE, Sep. 2017, pp. 490–495. doi: 10.1109/RE.2017.82.

S. Panichella, A. Di Sorbo, E. Guzman, C. A. Visaggio, G. Canfora, and H. C. Gall, “How can i improve my app? Classifying user reviews for software maintenance and evolution,” in 2015 IEEE International Conference on Software Maintenance and Evolution (ICSME), Bremen, Germany: IEEE, Sep. 2015, pp. 281–290. doi: 10.1109/ICSM.2015.7332474.

M. Lu and P. Liang, “Automatic Classification of Non-Functional Requirements from Augmented App User Reviews,” in Proceedings of the 21st International Conference on Evaluation and Assess-ment in Software Engineering, Karlskrona Sweden: ACM, Jun. 2017, pp. 344–353. doi: 10.1145/3084226.3084241.

I. Hidayah, E. P. Adhistya, and M. A. Kristy, “Application of J48 and bagging for classification of vertebral column pathologies,” in Proceedings of the 6th International Conference on Information Technology and Multimedia, Putrajaya: IEEE, Nov. 2014, pp. 314–317. doi: 10.1109/ICIMU.2014.7066651.

R. Navarro-Almanza, R. Juarez-Ramirez, and G. Licea, “Towards Supporting Software Engineering Using Deep Learning: A Case of Software Requirements Classification,” in 2017 5th International Conference in Software Engineering Research and Innovation (CONISOFT), Mérida: IEEE, Oct. 2017, pp. 116–120. doi: 10.1109/CONISOFT.2017.00021.

T. Mikolov, K. Chen, G. Corrado, and J. Dean, “Efficient Estimation of Word Representations in Vector Space.” arXiv, Sep. 06, 2013. Accessed: May 09, 2023. [Online]. Available: http://arxiv.org/abs/1301.3781

U. Krzeszewska, A. Poniszewska-Marańda, and J. Ochelska-Mierzejewska, “Systematic Com-parison of Vectorization Methods in Classification Context,” Applied Sciences, vol. 12, no. 10, p. 5119, May 2022, doi: 10.3390/app12105119.

M. Ali Fauzi, “Word2Vec model for sentiment analysis of product reviews in Indonesian lan-guage”, in International Journal of Electrical and Computer Engineering (IJECE) Vol. 9, No. 1, February 2019, pp. 525~530. doi: 10.11591/ijece.v9i1.pp525-530




DOI: https://doi.org/10.24167/proxies.v8i1.12475

Copyright (c) 2024 Proxies : Jurnal Informatika



View My Stats