skip to main content
research-article

BiLabel-Specific Features for Multi-Label Classification

Published:20 July 2021Publication History
Skip Abstract Section

Abstract

In multi-label classification, the task is to induce predictive models which can assign a set of relevant labels for the unseen instance. The strategy of label-specific features has been widely employed in learning from multi-label examples, where the classification model for predicting the relevancy of each class label is induced based on its tailored features rather than the original features. Existing approaches work by generating a group of tailored features for each class label independently, where label correlations are not fully considered in the label-specific features generation process. In this article, we extend existing strategy by proposing a simple yet effective approach based on BiLabel-specific features. Specifically, a group of tailored features is generated for a pair of class labels with heuristic prototype selection and embedding. Thereafter, predictions of classifiers induced by BiLabel-specific features are ensembled to determine the relevancy of each class label for unseen instance. To thoroughly evaluate the BiLabel-specific features strategy, extensive experiments are conducted over a total of 35 benchmark datasets. Comparative studies against state-of-the-art label-specific features techniques clearly validate the superiority of utilizing BiLabel-specific features to yield stronger generalization performance for multi-label classification.

References

  1. J. Amores. 2013. Multiple instance classification: Review, taxonomy and comparative study. Artificial Intelligence 201 (2013), 81–105. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. Babbar and B. Schölkopf. 2019. Data scarcity, robustness and extreme multi-label classification. Machine Learning 108, 8--9 (2019), 1329–1351.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. K. Bhatia, H. Jain, P. Kar, M. Varma, and P. Jain. 2018. Sparse local embeddings for extreme multi-label classification. In Proceedings of the 28th International Conference on Advances in Neural Information Processing Systems. C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett (Eds.). MIT Press, Cambridge, MA, 730–738. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Boutell, J. Luo, X. Shen, and C. M. Brown. 2004. Learning multi-label scene classification. Pattern Recognition 37, 9 (2004), 1757–1771.Google ScholarGoogle ScholarCross RefCross Ref
  5. C. Brinker, E. Loza Mencía, and J. Fürnkranz. 2014. Graded multilabel classification by pairwise comparisons. In Proceedings of the 14th IEEE International Conference on Data Mining. 731–736. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Canuto, M. A. Gonçalves, and F. Benevenuto. 2016. Exploiting new sentiment-based meta-level features for effective sentiment analysis. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining. 53–62. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M.-A. Carbonneaua, V. Cheplyginabc, E. Granger, and G. Gagnon. 2018. Multiple instance learning: A survey of problem characteristics and applications. Pattern Recognition 77 (2018), 329–353. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C.-C. Chang and C.-J. Lin. 2011. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2, 3, Article 27 (2011). Software available at http://www.csie.ntu.edu.tw/∼cjlin/libsvm. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Z.-M. Chen, X.-S. Wei, P. Wang, and Y.-W. Guo. 2019. Multi-label image recognition with graph convolutional networks. In Proceedings of the 32nd IEEE Conference on Computer Vision and Pattern Recognition. 5177–5186.Google ScholarGoogle ScholarCross RefCross Ref
  10. Z.-S. Chen and M.-L. Zhang. 2019. Multi-label learning with regularization enriched label-specific features. In Proceedings of the 11th Asian Conference on Machine Learning. 411–424.Google ScholarGoogle Scholar
  11. J. Demšar. 2006. Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Research 7, 1 (2006), 1–30. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Feng and Z.-H. Zhou. 2017. Deep MIML network. In Proceedings of the 31st AAAI Conference on Artificial Intelligence. 1884–1890. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Fürnkranz, E. Hüllermeier, E. Loza Mencía, and K. Brinker. 2008. Multilabel classification via calibrated label ranking. Machine Learning 73, 2 (2008), 133–153. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. E. Gibaja and S. Ventura. 2015. A tutorial on multilabel learning. ACM Computing Surveys 47, 3, Article 52 (2015). Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. H. Gouk, B. Pfahringer, and M. Cree. 2016. Learning distance metrics for multi-label classification. In Proceedings of the 8th Asian Conference on Machine Learning. 318–333.Google ScholarGoogle Scholar
  16. Y. Guo, F. Chung, G. Li, J. Wang, and J. C. Gee. 2019. Leveraging label-specific discriminant mapping features for multi-label learning. ACM Transactions on Knowledge Discovery from Data 13, 2, Article 24 (2019). Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Huang, G. Li, Q. Huang, and X. Wu. 2016. Learning label-specific features and class-dependent labels for multi-label classification. IEEE Transactions on Knowledge and Data Engineering 28, 12 (2016), 3309–3323. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Huang, G. Li, Q. Huang, and X. Wu. 2018. Joint feature selection and classification for multilabel learning. IEEE Transactions on Cybernetics 48, 3 (2018), 876–889.Google ScholarGoogle ScholarCross RefCross Ref
  19. J. Huang, G. Li, S. Wang, Z. Xue, and Q. Huang. 2017. Multi-label classification by exploiting local positive and negative pairwise label correlation. Neurocomputing 257 (2017), 164–174.Google ScholarGoogle ScholarCross RefCross Ref
  20. M. Huang, F. Zhuang, X. Zhang, X. Ao, Z. Niu, M.-L. Zhang, and Q. He. 2019. Supervised representation learning for multi-label classification. Machine Learning 108, 5 (2019), 747–763. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S.-J. Huang, G.-X. Li, W.-Y. Huang, and S.-Y. Li. 2020. Incremental multi-label learning with active queries. Journal of Computer Science and Technology 35, 2 (2020), 234–246.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. B.-B. Jia and M.-L. Zhang. 2020. Multi-dimensional classification via kNN feature augmentation. Pattern Recognition 106 (2020), 107423.Google ScholarGoogle ScholarCross RefCross Ref
  23. B.-B. Jia and M.-L. Zhang. 2020. Multi-dimensional classification via stacked dependency exploitation. Science China Information Sciences 63, 12 (2020), 222102.Google ScholarGoogle ScholarCross RefCross Ref
  24. X.-Y. Jia, S.-S. Zhu, and W.-W. Li. 2020. Joint label-specific features and correlation information for multi-label learning. Journal of Computer Science and Technology 35, 2 (2020), 247–258.Google ScholarGoogle ScholarCross RefCross Ref
  25. X.-C. Li, D.-C. Zhan, J.-Q. Yang, and Y. Shi. 2021. Deep multiple instance selection. Science China Information Sciences 64, 3, Article 130102 (2021).Google ScholarGoogle Scholar
  26. Y. Li, Y. Song, and J. Luo. 2017. Improving pairwise ranking for multi-label image classification. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. 3617–3625.Google ScholarGoogle Scholar
  27. W. Liu and I. W. Tsang. 2015. Large margin metric learning for multi-label prediction. In Proceedings of the 29th AAAI Conference on Artificial Intelligence. 2800–2806. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. J. Ma, H. Zhang, and T. W. S. Chow. 2021. Multilabel classification with label-specific features and classifiers: A coarse- and fine-tuned framework. IEEE Transactions on Cybernetics 15, 2 (2021), 1028–1042.Google ScholarGoogle ScholarCross RefCross Ref
  29. Y. Ma, C. Cui, X. Nie, G. Yang, K. Shaheed, and Y. Yin. 2019. Pre-course student performance prediction with multi-instance multi-label learning. Science China Information Sciences 62, 2, Article 029101 (2019).Google ScholarGoogle Scholar
  30. R. B. Pereira, A. Plastino, B. Zadrozny, and L. H. C. Merschmann. 2018. Categorizing feature selection methods for multi-label classification. Artificial Intelligence Review 49, 1 (2018), 57–78. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Y. Prabhu and M. Varma. 2014. FastXML: A fast, accurate and stable tree-classifier for extreme multi-label learning. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 263–272. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. G.-J. Qi, X.-S. Hua, Y. Rui, J. Tang, T. Mei, and H.-J. Zhang. 2007. Correlative multi-label video annotation. In Proceedings of the 15th ACM International Conference on Multimedia. 17–26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. J. Read, C. Bielza, and P. Larrañaga. 2014. Multi-dimensional classification with super-classes. IEEE Transactions on Knowledge and Data Engineering 26, 7 (2014), 1720–1733.Google ScholarGoogle ScholarCross RefCross Ref
  34. J. Read, B. Pfahringer, G. Holmes, and E. Frank. 2011. Classifier chains for multi-label classification. Machine Learning 85, 3 (2011), 333–359. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. W. Siblini, P. Kuntz, and F. Meyer. 2018. CRAFTML, an efficient clustering-based random forest for extreme multi-label learning. In Proceedings of the 35th International Conference on Machine Learning. 4664–4673.Google ScholarGoogle Scholar
  36. L. Sun, S. Ji, and J. Ye. 2013. Multi-label Dimensionality Reduction. Chapman and Hall/CRC, Boca Ration, FL.Google ScholarGoogle Scholar
  37. L. Sun, M. Kudo, and K. Kimura. 2016. Multi-label classification with meta-label-specific features. In Proceedings of the 23rd International Conference on Pattern Recognition. 1612–1617.Google ScholarGoogle Scholar
  38. Y.-P. Sun and M.-L. Zhang. 2021. Compositional metric learning for multi-label classification. Frontiers of Computer Science 15, 5, Article 15532 (2021).Google ScholarGoogle Scholar
  39. G. Tsoumakas, I. Katakis, and I. Vlahavas. 2011. Random k-labelsets for multi-label classification. IEEE Transactions on Knowledge and Data Engineering 23, 7 (2011), 1079–1089. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. L. Wang, Z. Ding, S. Han, J.-J. Han, C. Choi, and Y. Fu. 2019. Generative correlation discovery network for multi-label learning. In Proceedings of the 19th IEEE International Conference on Data Mining. 588–597.Google ScholarGoogle Scholar
  41. Y. Wang, W. Zheng, Y. Cheng, and D. Zhao. 2020. Joint label completion and label-specific features for multi-label learning algorithm. Soft Computing 24, 9 (2020), 6553–6569.Google ScholarGoogle ScholarCross RefCross Ref
  42. W. Weng, Y.-N. Chen, C.-L. Chen, S.-X. Wu, and J.-H. Liu. 2020. Non-sparse label specific features selection for multi-label classification. Neurocomputing 377 (2020), 85–94.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. W. Weng, Y. Lin, S. Wu, Y. Li, and Y. Kang. 2018. Multi-label learning based on label-specific features and local pairwise label correlation. Neurocomputing 273 (2018), 385–394. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. X. Wu, Q.-G. Chen, Y. Hu, D. Wang, X. Chang, X. Wang, and M.-L. Zhang. 2019. Multi-view multi-label learning with view-specific information extraction. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. 3884–3890. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Y. Xing, G. Yu, D. Carlotta, J. Wang, and Z. Zhang. 2018. Multi-label co-training. In Proceedings of the 27th International Joint Conference on Artificial Intelligence. 2882–2888. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. J.-H. Xu, H.-D. Tian, Z.-Y. Wang, Y. Wang, F. Chen, and W.-X. Kang. 2020. Joint input and output space learning for multi-label image classification. IEEE Transactions on Multimedia (2020).Google ScholarGoogle ScholarCross RefCross Ref
  47. M. Xu and L.-Z. Guo. 2021. Learning from group supervision: The impact of supervision deficiency on multi-label learning. Science China Information Sciences 64, 3, Article 130101 (2021).Google ScholarGoogle Scholar
  48. S. Xu, X. Yang, H. Yu, D.-J. Yu, J. Yang, and E. C. C. Tsang. 2016. Multi-label learning with label-specific feature reduction. Knowledge-Based Systems 104 (2016), 52–61. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Y. Yang and S. Gopal. 2012. Multilabel classification with meta-level features in a learning-to-rank framework. Machine Learning 88, 1--2 (2012), 47–68. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. C. K. Yeh, W. C. Wu, W. J. Ko, and Y. C. F. Wang. 2017. Learning deep latent spaces for multi-label classification. In Proceedings of the 31st AAAI Conference on Artificial Intelligence. 2838–2844. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. C. Zhang, Z. Yu, Q. Hu, P. Zhu, X. Liu, and X. Wang. 2018. Latent semantic aware multi-view multi-label classification. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 4414–4421.Google ScholarGoogle Scholar
  52. J. Zhang, C. Li, D. Cao, Y. Lin, S. Su, L. Dai, and S. Li. 2018. Multi-label learning with label-specific features by resolving label correlations. Knowledge-Based Systems 159 (2018), 148–157. Google ScholarGoogle ScholarCross RefCross Ref
  53. M.-L. Zhang, Y.-K. Li, Y.-Y. Liu, and X. Geng. 2018. Binary relevance for multi-label learning: An overview. Frontiers of Computer Science 12, 2 (2018), 191–202. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. M.-L. Zhang and L. Wu. 2015. LIFT: Multi-label learning with label-specific features. IEEE Transactions on Pattern Analysis and Machine Intelligence 37, 1 (2015), 107–120.Google ScholarGoogle ScholarCross RefCross Ref
  55. M.-L. Zhang and Z.-H. Zhou. 2014. A review on multi-label learning algorithms. IEEE Transactions on Knowledge and Data Engineering 26, 8 (2014), 1819–1837.Google ScholarGoogle ScholarCross RefCross Ref
  56. Q.-W. Zhang and M.-L. Zhang. 2018. Feature-induced labeling information enrichment for multi-label learning. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 4446–4453.Google ScholarGoogle Scholar
  57. Z.-H. Zhou, M.-L. Zhang, S.-J. Huang, and Y.-F. Li. 2012. Multi-instance multi-label learning. Artificial Intelligence 176, 1 (2012), 2291–2320. Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. S. Zhu, X. Ji, W. Xu, and Y. Gong. 2005. Multi-labelled classification using maximum entropy method. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 274–281. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. BiLabel-Specific Features for Multi-Label Classification

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Knowledge Discovery from Data
        ACM Transactions on Knowledge Discovery from Data  Volume 16, Issue 1
        February 2022
        475 pages
        ISSN:1556-4681
        EISSN:1556-472X
        DOI:10.1145/3472794
        Issue’s Table of Contents

        Copyright © 2021 Association for Computing Machinery.

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 20 July 2021
        • Accepted: 1 March 2021
        • Revised: 1 February 2021
        • Received: 1 June 2020
        Published in tkdd Volume 16, Issue 1

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader