Abstract
Since requirement dependency extraction is a cognitively challenging and error-prone task, this paper proposes an automatic requirement dependency extraction method based on integrated active learning strategies. In this paper, the coefficient of variation method was used to determine the corresponding weight of the impact factors from three different angles: uncertainty probability, text similarity difference degree and active learning variant prediction divergence degree. By combining the three factors with the proposed calculation formula to measure the information value of dependency pairs, the top K dependency pairs with the highest comprehensive evaluation value are selected as the optimal samples. As the optimal samples are continuously added into the initial training set, the performance of the active learning model using different dependency features for requirement dependency extraction is rapidly improved. Therefore, compared with other active learning strategies, a higher evaluation measure of requirement dependency extraction can be achieved by using the same number of samples. Finally, the proposed method using the PV-DM dependency feature improves the weight-F1 by 2.71%, the weight-recall by 2.45%, and the weight-precision by 2.64% in comparison with other strategies, saving approximately 46% of the labelled data compared with the machine learning approach.
Similar content being viewed by others
References
G. Deshpande, Q. Motger, C. Palomares, I. Kamra, K. Biesialska, X. Franch, G. Ruhe, J. Ho. Requirements dependency extraction by integrating active learning with ontology-based retrieval. In Proceedings of the 28th IEEE International Requirements Engineering Conference, IEEE, Zurich, Switzerland, pp. 78–89, 2020. DOI: https://doi.org/10.1109/RE48521.2020.00020.
Y. Priyadi, A. Djunaidy, D. Siahaan. Requirements dependency graph modeling on software requirements specification using text analysis. In Proceedings of the 1st International Conference on Cybernetics and Intelligent System, IEEE, Denpasar, Indonesia, pp. 221–226, 2019. DOI: https://doi.org/10.1109/ICORIS.2019.8874920.
R. Borrull Baraut. Incorporation of Models in Automatic Requirement Dependencies Detection, Master dissertation, Universitat Politècnica de Catalunya, Spain, 2018.
M. Atas, R. Samer, A. Felfernig. Automated identification of type-specific dependencies between requirements. In Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence, IEEE, Santiago, Chile, pp. 688–695, 2018. DOI: https://doi.org/10.1109/WI.2018.00-10.
R. Samer, M. Stettinger, M. Atas, A. Felfernig, G. Ruhe, G. Deshpande. New approaches to the identification of dependencies between requirements. In Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, IEEE, Portland, USA, pp.2665–2700, 2019. DOI: https://doi.org/10.1109/ICTAI.2019.00-91.
G. Deshpande, C. Arora, G. Ruhe. Data-driven elicitation and optimization of dependencies between requirements. In Proceedings of the 27th International Requirements Engineering Conference, IEEE, Jeju, Republic of Korea, pp. 416–421, 2019. DOI: https://doi.org/10.1109/RE.2019.00055.
G. Deshpande. SReYantra: Automated software requirement inter-dependencies elicitation, analysis and learning. In Proceedings of the 41st International Conference on Software Engineering: Companion Proceedings, IEEE, Montreal, Canada, pp. 186–187, 2019. DOI: https://doi.org/10.1109/ICSE-Companion.2019.00076.
X. Q. Zhang, Y. Hu, Z. J. Xiao, J. S. Fang, R. Higashita, J. Liu. Machine learning for cataract ciassification/grading on ophthalmic imaging modalities: A survey. Machine Intelligence Research, vol. 19, no. 3, pp. 184–208, 2022. DOI: https://doi.org/10.1007/s11633-022-1329-0.
A. Alqwadri, M. Azzeh, F. Almasalha. Application of machine learning for online reputation systems. International Journal of Automation and Computing, vol. 18, no. 3, pp. 492–502, 2021. DOI: https://doi.org/10.1007/s11633-020-1275-7.
B. Settles. Active Learning Literature Survey, Computer Sciences Technical Report 1648, University of Wisconsin-Madison, USA, 2009.
B. Settles. Active Learning, Cham, Germany: Springer, 2012.
B. Settles, M. Craven. An analysis of active learning strategies for sequence labeling tasks. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, ACL, Honolulu, Hawaii, pp. 1070–1079, 2008.
U. Naseem, M. Khushi, S. K. Khan, K. Shaukat, M. A. Moni. A comparative analysis of active learning for biomedical text mining. Applied System Innovation, vol. 4, no. 1, Article number 23, 2021. DOI: https://doi.org/10.3390/asi4010023.
B. S. Yang, J. T. Sun, T. J. Wang, Z. Chen. Effective multi-label active learning for text classification. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, Paris, France, pp. 917–926, 2009. DOI: https://doi.org/10.1145/1557019.1557119.
M. Goudjil, M. Koudil, M. Bedda, N. Ghoggali. A novel active learning method using SVM for text classification. International Journal of Automation and Computing, vol. 15, no. 3, pp. 290–298, 2018. DOI: https://doi.org/10.1007/s11633-015-0912-z.
V. T. Dhinakaran, R. Pulle, N. Ajmeri, P. K. Murukannaiah. App review analysis via active learning: Reducing supervision effort without compromising classification accuracy. In Proceedings of the 26th International Requirements Engineering Conference, IEEE, Banff, Canada, pp. 170–181, 2018. DOI: https://doi.org/10.1109/RE.2018.00026.
K. Liu, X. Qian, Z. Q. Wang. Survey on active learning algorithms. Computer Engineering and Applications. Computer Engineering and Applications, vol. 48, no. 34, pp. 1–4, 22, 2012. DOI: https://doi.org/10.3778/j.issn.1002-8331.1205-0149. (in Chinese)
L. Copa, D. Tuia, M. Volpi, M. Kanevski. Unbiased query-by-bagging active learning for VHR image classification. In Proceedings of SPIE 7830, Image and Signal Processing for Remote Sensing XVI, SPIE, Toulouse, France, Article No. 78300K, 2010. DOI: https://doi.org/10.1117/12.864861.
X. B. Dong, Z. W. Yu, W. M. Cao, Y. F. Shi, Q. L. Ma. A survey on ensemble learning. Frontiers of Computer Science, vol. 14, no. 2, pp. 241–258, 2020. DOI: https://doi.org/10.1007/s11704-019-8208-z.
F. Matloob, T. M. Ghazal, N. Taleb, S. Aftab, M. Ahmad, M. A. Khan, S. Abbas, T. R. Soomro. Software defect prediction using ensemble learning: A systematic literature review. IEEE Access, vol. 9, pp. 98754–98771, 2021. DOI: https://doi.org/10.1109/ACCESS.2021.3095559.
T. G. Dietterich. Ensemble methods in machine learning. In Proceedings of the 1st International Workshop on Multiple Classifier Systems, Springer, Berlin, Heidelberg, 2000. DOI: https://doi.org/10.1007/3-540-45014-9_1.
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, E. Duchesnay. Scikit-learn: Machine learning in Python. The Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
O. Sagi, L. Rokach. Ensemble learning: A survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol.8, no.4, Article number e1249, 2018.
T. Mikolov, K. Chen, G. Corrado, J. Dean. Efficient estimation of word representations in vector space. In Proceedings of the 1st International Conference on Learning Representations, Scottsdale, USA, 2013.
J. Bhatta, D. Shrestha, S. Nepal, S. Pandey, S. Koirala. Efficient estimation of Nepali word representations in vector space. Journal of Innovations in Engineering Education, vol. 3, no. 1, pp. 71–77, 2020. DOI: https://doi.org/10.3126/jiee.v3i1.34327.
Q. V. Le, T. Mikolov. Distributed representations of sentences and documents. In Proceedings of the 31th International Conference on Machine Learning, Beijing, China, pp. 1188–1196, 2014.
Q. Motger, R. Borrull, C. Palomares, J. Marco. OpenReq-DD: A. requirements dependency detection tool. In Proceedings of Workshops, Doctoral Symposium, Live Studies Track, and Poster Track Co-Located with the 25th International Conference on Requirements Engineering: Foundation for Software Quality, CEUR-WS.org, Essen, Germany, pp. 1–5, 2019.
H. Guan, Y. Lyu, C. Z. Jia. Automatic acquisition of requirement dependency based on syntax and semantics. Computer Technology and Development, vol. 31, no. 2, pp. 20–26, 2021. DOI: https://doi.org/10.3969/j.issn.1673-629X.2021.02.004. (in Chinese)
H. Guan, G. R. Cai, C. Zhao. An automatic approach to extracting requirement dependencies based on semantic web. In Proceedings of the 8th International Conference on Dependable Systems and Their Applications, IEEE, Yinchuan, China, pp. 414–420, 2021. DOI: https://doi.org/10.1109/DSA52907.2021.00062.
T. Y. Li, L. Liu, D. W. Zhao, Y. Cao. Eliciting relations from requirements text based on dependency analysis. Chinese Journal of Computers, vol. 36, no. 1, pp. 54–62, 2013. DOI: https://doi.org/10.3724/SP.J.1016.2013.00054. (in Chinese)
S. T. Luo, C. H. Zhang, Y. Jin, Y. N. Liu. Determination of cross-cutting concerns by requirement dependency. Journal of Jilin University (Engineering and Technology Edition), vol. 41, no. 4, pp. 1065–1070, 2011. DOI: https://doi.org/10.13229/j.cnki.jdxbgxb2011.04.013. (in Chinese)
A. Goknil, I. Kurtev, K. van den Berg, W. Spijkerman. Change impact analysis for requirements: A metamodeling approach. Information and Software Technology, vol. 56, no. 8, pp. 950–972, 2014. DOI: https://doi.org/10.1016/j.infsof.2014.03.002.
Acknowledgements
This work was supported by the Scientific Research Funding Project of Education Department of Liaoning Province 2021, China (No. LJKZ0434).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
The authors declared that they have no conflicts of interest to this work.
Additional information
Colored figures are available in the online version at https://link.springer.com/journal/11633
Hui Guan received the B. Sc. and M. Sc. degrees in computer science from Shenyang Institute of Chemical Technology, China in 2000 and 2006, respectively, and the Ph. D. degree in software engineering from De Montfort University, UK in 2014. She is currently an associated professor at School of Computer Science and Technology, Shenyang University of Chemical Technology, China.
Her research interests include software evolution, semantic computing and software requirement engineering.
E-mail: h.guan@syuct.edu.cn (Corresponding author)
ORCID iD: 0000-0003-4456-6674
Guorong Cai received the B. Sc. degree in software engineering from Jiangxi Normal University, China in 2020. He is now a master student at Shenyang University of Chemical Technology, China.
His research interests include active learning, machine learning and data mining.
E-mail: 1456177729@qq.com
Hang Xu received the B. Sc. degree in applied physics from Suzhou University of Science and Technology, China in 2020. He is now a master student at Shenyang University of Chemical Technology, China.
His research interests include ensemble learning, machine learning, and data mining.
E-mail: 1974139682@qq.com
Rights and permissions
About this article
Cite this article
Guan, H., Cai, G. & Xu, H. Automatic Requirement Dependency Extraction Based on Integrated Active Learning Strategies. Mach. Intell. Res. (2024). https://doi.org/10.1007/s11633-023-1420-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11633-023-1420-1