Skip to main content
Log in

Automatic Requirement Dependency Extraction Based on Integrated Active Learning Strategies

  • Research Article
  • Published:
Machine Intelligence Research Aims and scope Submit manuscript

Abstract

Since requirement dependency extraction is a cognitively challenging and error-prone task, this paper proposes an automatic requirement dependency extraction method based on integrated active learning strategies. In this paper, the coefficient of variation method was used to determine the corresponding weight of the impact factors from three different angles: uncertainty probability, text similarity difference degree and active learning variant prediction divergence degree. By combining the three factors with the proposed calculation formula to measure the information value of dependency pairs, the top K dependency pairs with the highest comprehensive evaluation value are selected as the optimal samples. As the optimal samples are continuously added into the initial training set, the performance of the active learning model using different dependency features for requirement dependency extraction is rapidly improved. Therefore, compared with other active learning strategies, a higher evaluation measure of requirement dependency extraction can be achieved by using the same number of samples. Finally, the proposed method using the PV-DM dependency feature improves the weight-F1 by 2.71%, the weight-recall by 2.45%, and the weight-precision by 2.64% in comparison with other strategies, saving approximately 46% of the labelled data compared with the machine learning approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. G. Deshpande, Q. Motger, C. Palomares, I. Kamra, K. Biesialska, X. Franch, G. Ruhe, J. Ho. Requirements dependency extraction by integrating active learning with ontology-based retrieval. In Proceedings of the 28th IEEE International Requirements Engineering Conference, IEEE, Zurich, Switzerland, pp. 78–89, 2020. DOI: https://doi.org/10.1109/RE48521.2020.00020.

    Google Scholar 

  2. Y. Priyadi, A. Djunaidy, D. Siahaan. Requirements dependency graph modeling on software requirements specification using text analysis. In Proceedings of the 1st International Conference on Cybernetics and Intelligent System, IEEE, Denpasar, Indonesia, pp. 221–226, 2019. DOI: https://doi.org/10.1109/ICORIS.2019.8874920.

    Google Scholar 

  3. R. Borrull Baraut. Incorporation of Models in Automatic Requirement Dependencies Detection, Master dissertation, Universitat Politècnica de Catalunya, Spain, 2018.

    Google Scholar 

  4. M. Atas, R. Samer, A. Felfernig. Automated identification of type-specific dependencies between requirements. In Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence, IEEE, Santiago, Chile, pp. 688–695, 2018. DOI: https://doi.org/10.1109/WI.2018.00-10.

    Google Scholar 

  5. R. Samer, M. Stettinger, M. Atas, A. Felfernig, G. Ruhe, G. Deshpande. New approaches to the identification of dependencies between requirements. In Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, IEEE, Portland, USA, pp.2665–2700, 2019. DOI: https://doi.org/10.1109/ICTAI.2019.00-91.

    Google Scholar 

  6. G. Deshpande, C. Arora, G. Ruhe. Data-driven elicitation and optimization of dependencies between requirements. In Proceedings of the 27th International Requirements Engineering Conference, IEEE, Jeju, Republic of Korea, pp. 416–421, 2019. DOI: https://doi.org/10.1109/RE.2019.00055.

    Google Scholar 

  7. G. Deshpande. SReYantra: Automated software requirement inter-dependencies elicitation, analysis and learning. In Proceedings of the 41st International Conference on Software Engineering: Companion Proceedings, IEEE, Montreal, Canada, pp. 186–187, 2019. DOI: https://doi.org/10.1109/ICSE-Companion.2019.00076.

    Google Scholar 

  8. X. Q. Zhang, Y. Hu, Z. J. Xiao, J. S. Fang, R. Higashita, J. Liu. Machine learning for cataract ciassification/grading on ophthalmic imaging modalities: A survey. Machine Intelligence Research, vol. 19, no. 3, pp. 184–208, 2022. DOI: https://doi.org/10.1007/s11633-022-1329-0.

    Article  Google Scholar 

  9. A. Alqwadri, M. Azzeh, F. Almasalha. Application of machine learning for online reputation systems. International Journal of Automation and Computing, vol. 18, no. 3, pp. 492–502, 2021. DOI: https://doi.org/10.1007/s11633-020-1275-7.

    Article  Google Scholar 

  10. B. Settles. Active Learning Literature Survey, Computer Sciences Technical Report 1648, University of Wisconsin-Madison, USA, 2009.

    Google Scholar 

  11. B. Settles. Active Learning, Cham, Germany: Springer, 2012.

    Book  Google Scholar 

  12. B. Settles, M. Craven. An analysis of active learning strategies for sequence labeling tasks. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, ACL, Honolulu, Hawaii, pp. 1070–1079, 2008.

    Google Scholar 

  13. U. Naseem, M. Khushi, S. K. Khan, K. Shaukat, M. A. Moni. A comparative analysis of active learning for biomedical text mining. Applied System Innovation, vol. 4, no. 1, Article number 23, 2021. DOI: https://doi.org/10.3390/asi4010023.

  14. B. S. Yang, J. T. Sun, T. J. Wang, Z. Chen. Effective multi-label active learning for text classification. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, Paris, France, pp. 917–926, 2009. DOI: https://doi.org/10.1145/1557019.1557119.

    Chapter  Google Scholar 

  15. M. Goudjil, M. Koudil, M. Bedda, N. Ghoggali. A novel active learning method using SVM for text classification. International Journal of Automation and Computing, vol. 15, no. 3, pp. 290–298, 2018. DOI: https://doi.org/10.1007/s11633-015-0912-z.

    Article  Google Scholar 

  16. V. T. Dhinakaran, R. Pulle, N. Ajmeri, P. K. Murukannaiah. App review analysis via active learning: Reducing supervision effort without compromising classification accuracy. In Proceedings of the 26th International Requirements Engineering Conference, IEEE, Banff, Canada, pp. 170–181, 2018. DOI: https://doi.org/10.1109/RE.2018.00026.

    Google Scholar 

  17. K. Liu, X. Qian, Z. Q. Wang. Survey on active learning algorithms. Computer Engineering and Applications. Computer Engineering and Applications, vol. 48, no. 34, pp. 1–4, 22, 2012. DOI: https://doi.org/10.3778/j.issn.1002-8331.1205-0149. (in Chinese)

    Google Scholar 

  18. L. Copa, D. Tuia, M. Volpi, M. Kanevski. Unbiased query-by-bagging active learning for VHR image classification. In Proceedings of SPIE 7830, Image and Signal Processing for Remote Sensing XVI, SPIE, Toulouse, France, Article No. 78300K, 2010. DOI: https://doi.org/10.1117/12.864861.

    Google Scholar 

  19. X. B. Dong, Z. W. Yu, W. M. Cao, Y. F. Shi, Q. L. Ma. A survey on ensemble learning. Frontiers of Computer Science, vol. 14, no. 2, pp. 241–258, 2020. DOI: https://doi.org/10.1007/s11704-019-8208-z.

    Article  Google Scholar 

  20. F. Matloob, T. M. Ghazal, N. Taleb, S. Aftab, M. Ahmad, M. A. Khan, S. Abbas, T. R. Soomro. Software defect prediction using ensemble learning: A systematic literature review. IEEE Access, vol. 9, pp. 98754–98771, 2021. DOI: https://doi.org/10.1109/ACCESS.2021.3095559.

    Article  Google Scholar 

  21. T. G. Dietterich. Ensemble methods in machine learning. In Proceedings of the 1st International Workshop on Multiple Classifier Systems, Springer, Berlin, Heidelberg, 2000. DOI: https://doi.org/10.1007/3-540-45014-9_1.

    Google Scholar 

  22. F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, E. Duchesnay. Scikit-learn: Machine learning in Python. The Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.

    MathSciNet  Google Scholar 

  23. O. Sagi, L. Rokach. Ensemble learning: A survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol.8, no.4, Article number e1249, 2018.

  24. T. Mikolov, K. Chen, G. Corrado, J. Dean. Efficient estimation of word representations in vector space. In Proceedings of the 1st International Conference on Learning Representations, Scottsdale, USA, 2013.

  25. J. Bhatta, D. Shrestha, S. Nepal, S. Pandey, S. Koirala. Efficient estimation of Nepali word representations in vector space. Journal of Innovations in Engineering Education, vol. 3, no. 1, pp. 71–77, 2020. DOI: https://doi.org/10.3126/jiee.v3i1.34327.

    Article  Google Scholar 

  26. Q. V. Le, T. Mikolov. Distributed representations of sentences and documents. In Proceedings of the 31th International Conference on Machine Learning, Beijing, China, pp. 1188–1196, 2014.

  27. Q. Motger, R. Borrull, C. Palomares, J. Marco. OpenReq-DD: A. requirements dependency detection tool. In Proceedings of Workshops, Doctoral Symposium, Live Studies Track, and Poster Track Co-Located with the 25th International Conference on Requirements Engineering: Foundation for Software Quality, CEUR-WS.org, Essen, Germany, pp. 1–5, 2019.

    Google Scholar 

  28. H. Guan, Y. Lyu, C. Z. Jia. Automatic acquisition of requirement dependency based on syntax and semantics. Computer Technology and Development, vol. 31, no. 2, pp. 20–26, 2021. DOI: https://doi.org/10.3969/j.issn.1673-629X.2021.02.004. (in Chinese)

    Google Scholar 

  29. H. Guan, G. R. Cai, C. Zhao. An automatic approach to extracting requirement dependencies based on semantic web. In Proceedings of the 8th International Conference on Dependable Systems and Their Applications, IEEE, Yinchuan, China, pp. 414–420, 2021. DOI: https://doi.org/10.1109/DSA52907.2021.00062.

    Google Scholar 

  30. T. Y. Li, L. Liu, D. W. Zhao, Y. Cao. Eliciting relations from requirements text based on dependency analysis. Chinese Journal of Computers, vol. 36, no. 1, pp. 54–62, 2013. DOI: https://doi.org/10.3724/SP.J.1016.2013.00054. (in Chinese)

    Article  Google Scholar 

  31. S. T. Luo, C. H. Zhang, Y. Jin, Y. N. Liu. Determination of cross-cutting concerns by requirement dependency. Journal of Jilin University (Engineering and Technology Edition), vol. 41, no. 4, pp. 1065–1070, 2011. DOI: https://doi.org/10.13229/j.cnki.jdxbgxb2011.04.013. (in Chinese)

    Google Scholar 

  32. A. Goknil, I. Kurtev, K. van den Berg, W. Spijkerman. Change impact analysis for requirements: A metamodeling approach. Information and Software Technology, vol. 56, no. 8, pp. 950–972, 2014. DOI: https://doi.org/10.1016/j.infsof.2014.03.002.

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the Scientific Research Funding Project of Education Department of Liaoning Province 2021, China (No. LJKZ0434).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hui Guan.

Ethics declarations

The authors declared that they have no conflicts of interest to this work.

Additional information

Colored figures are available in the online version at https://link.springer.com/journal/11633

Hui Guan received the B. Sc. and M. Sc. degrees in computer science from Shenyang Institute of Chemical Technology, China in 2000 and 2006, respectively, and the Ph. D. degree in software engineering from De Montfort University, UK in 2014. She is currently an associated professor at School of Computer Science and Technology, Shenyang University of Chemical Technology, China.

Her research interests include software evolution, semantic computing and software requirement engineering.

E-mail: h.guan@syuct.edu.cn (Corresponding author)

ORCID iD: 0000-0003-4456-6674

Guorong Cai received the B. Sc. degree in software engineering from Jiangxi Normal University, China in 2020. He is now a master student at Shenyang University of Chemical Technology, China.

His research interests include active learning, machine learning and data mining.

E-mail: 1456177729@qq.com

Hang Xu received the B. Sc. degree in applied physics from Suzhou University of Science and Technology, China in 2020. He is now a master student at Shenyang University of Chemical Technology, China.

His research interests include ensemble learning, machine learning, and data mining.

E-mail: 1974139682@qq.com

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Guan, H., Cai, G. & Xu, H. Automatic Requirement Dependency Extraction Based on Integrated Active Learning Strategies. Mach. Intell. Res. (2024). https://doi.org/10.1007/s11633-023-1420-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11633-023-1420-1

Keywords

Navigation