skip to main content
10.1145/3372278.3390728acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
short-paper

Confidence-based Weighted Loss for Multi-label Classification with Missing Labels

Authors Info & Claims
Published:08 June 2020Publication History

ABSTRACT

The problem of multi-label classification with missing labels (MLML) is a common challenge that is prevalent in several domains, e.g. image annotation and auto-tagging. In multi-label classification, each instance may belong to multiple class labels simultaneously. Due to the nature of the dataset collection and labelling procedure, it is common to have incomplete annotations in the dataset, i.e. not all samples are labelled with all the corresponding labels. However, the incomplete data labelling hinders the training of classification models. MLML has received much attention from the research community. However, in cases where a pre-trained model is fine-tuned on an MLML dataset, there has been no straightforward approach to tackle the missing labels, specifically when there is no information about which are the missing ones. In this paper, we propose a weighted loss function to account for the confidence in each label/sample pair that can easily be incorporated to fine-tune a pre-trained model on an incomplete dataset. Our experiment results show that using the proposed loss function improves the performance of the model as the ratio of missing labels increases.

References

  1. Thierry Bertin-Mahieux, Douglas Eck, and Michael Mandel. 2011. Automatic tagging of audio: The state-of-the-art. In Machine audition: Principles, algorithms and systems. IGI Global, 334--352.Google ScholarGoogle Scholar
  2. Wei Bi and James T Kwok. 2014. Multilabel classification with label correlations and missing labels. In Proceedings of 28th AAAI Conference on Artificial Intelligence.Google ScholarGoogle ScholarCross RefCross Ref
  3. Matthew R Boutell, Jiebo Luo, Xipeng Shen, and Christopher M Brown. 2004. Learning multi-label scene classification. Pattern recognition, Vol. 37, 9 (2004), 1757--1771.Google ScholarGoogle Scholar
  4. Serhat Selcuk Bucak, Rong Jin, and Anil K. Jain. 2011. Multi-label learning with incomplete class assignments. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE Computer Society.Google ScholarGoogle Scholar
  5. Ricardo S Cabral, Fernando Torre, Jo ao P Costeira, and Alexandre Bernardino. 2011. Matrix completion for multi-label image classification. In Proceedings of the Advances in neural information processing systems.Google ScholarGoogle Scholar
  6. Gang Chen, Yangqiu Song, Fei Wang, and Changshui Zhang. 2008. Semi-supervised multi-label learning by solving a sylvester equation. In Proceedings of the 2008 SIAM International Conference on Data Mining.Google ScholarGoogle ScholarCross RefCross Ref
  7. Minmin Chen, Alice Zheng, and Kilian Weinberger. 2013. Fast image tagging. In Proceedings of the International conference on machine learning.Google ScholarGoogle Scholar
  8. Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from National University of Singapore. In Proceedings of the ACM international conference on image and video retrieval.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In Proceedings of the IEEE conference on computer vision and pattern recognition.Google ScholarGoogle ScholarCross RefCross Ref
  10. Jia Deng, Olga Russakovsky, Jonathan Krause, Michael S Bernstein, Alex Berg, and Li Fei-Fei. 2014. Scalable multi-label annotation. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Thibaut Durand, Nazanin Mehrasa, and Greg Mori. 2019. Learning a Deep ConvNet for Multi-label Classification with Partial Labels. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle ScholarCross RefCross Ref
  12. Charles Elkan and Keith Noto. 2008. Learning Classifiers from Only Positive and Unlabeled Data.Google ScholarGoogle Scholar
  13. Mingchen Gao, Ulas Bagci, Le Lu, Aaron Wu, Mario Buty, Hoo-Chang Shin, Holger Roth, Georgios Z Papadakis, Adrien Depeursinge, Ronald M Summers, et al. 2018. Holistic classification of CT attenuation patterns for interstitial lung diseases via deep convolutional neural networks. Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization, Vol. 6, 1 (2018), 1--6.Google ScholarGoogle ScholarCross RefCross Ref
  14. Eva Gibaja and Sebastián Ventura. 2015. A tutorial on multilabel learning. ACM Computing Surveys (CSUR), Vol. 47, 3 (2015), 52.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Zhi-Fen He, Ming Yang, Yang Gao, Hui-Dong Liu, and Yilong Yin. 2019. Joint multi-label classification and label correlations with missing labels and feature selection. Knowledge-Based Systems, Vol. 163 (2019), 145--158.Google ScholarGoogle ScholarCross RefCross Ref
  16. Mengying Hu, Hu Han, Shiguang Shan, and Xilin Chen. 2018. Multi-label Learning from Noisy Labels with Non-linear Feature Transformation. In Proceedings of the Asian Conference on Computer Vision.Google ScholarGoogle Scholar
  17. Jun Huang, Feng Qin, Xiao Zheng, Zekai Cheng, Zhixiang Yuan, Weigang Zhang, and Qingming Huang. 2019. Improving multi-label classification with missing labels by learning label-specific features. Information Sciences, Vol. 492 (2019), 124--146.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Simon Kornblith, Jonathon Shlens, and Quoc V Le. 2019. Do better imagenet models transfer better?. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle ScholarCross RefCross Ref
  19. Yuncheng Li, Yale Song, and Jiebo Luo. 2017. Improving pairwise ranking for multi-label image classification. In Proceedings of the IEEE conference on computer vision and pattern recognition.Google ScholarGoogle ScholarCross RefCross Ref
  20. Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision.Google ScholarGoogle ScholarCross RefCross Ref
  21. Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In Proceedings of the European conference on computer vision.Google ScholarGoogle ScholarCross RefCross Ref
  22. Jan Margeta, Antonio Criminisi, R Cabrera Lozoya, Daniel C Lee, and Nicholas Ayache. 2017. Fine-tuned convolutional neural nets for cardiac MRI acquisition plane recognition. Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization, Vol. 5, 5 (2017), 339--349.Google ScholarGoogle ScholarCross RefCross Ref
  23. Andrew McCallum. 1999. Multi-label text classification with a mixture model trained by EM. In Proceedings of the AAAI workshop on Text Learning.Google ScholarGoogle Scholar
  24. Olivier Petit, Nicolas Thome, Arnaud Charnoz, Alexandre Hostettler, and Luc Soler. [n.d.]. Handling Missing Annotations for Semantic Segmentation with Deep ConvNets. Technical Report.Google ScholarGoogle Scholar
  25. Guo-Jun Qi, Xian-Sheng Hua, Yong Rui, Jinhui Tang, Tao Mei, and Hong-Jiang Zhang. 2007. Correlative multi-label video annotation. In Proceedings of the 15th ACM international conference on Multimedia.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Marina Sokolova and Guy Lapalme. 2009. A systematic analysis of performance measures for classification tasks. Information processing & management, Vol. 45, 4 (2009), 427--437.Google ScholarGoogle Scholar
  27. Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, and Alexander A Alemi. 2017. Inception-v4, inception-resnet and the impact of residual connections on learning. In Proceedings of the 31st AAAI Conference on Artificial Intelligence.Google ScholarGoogle ScholarCross RefCross Ref
  28. Nima Tajbakhsh, Jae Y Shin, Suryakanth R Gurudu, R Todd Hurst, Christopher B Kendall, Michael B Gotway, and Jianming Liang. 2016. Convolutional neural networks for medical image analysis: Full training or fine tuning? IEEE transactions on medical imaging, Vol. 35, 5 (2016), 1299--1312.Google ScholarGoogle Scholar
  29. Grigorios Tsoumakas, Anastasios Dimou, Eleftherios Spyromitros, Vasileios Mezaris, Ioannis Kompatsiaris, and Ioannis Vlahavas. 2009a. Correlation-based pruning of stacked binary relevance models for multi-label learning. In Proceedings of the 1st international workshop on learning from multi-label data.Google ScholarGoogle Scholar
  30. Grigorios Tsoumakas, Ioannis Katakis, and Ioannis Vlahavas. 2009b. Mining multi-label data. In Data mining and knowledge discovery handbook. Springer, 667--685.Google ScholarGoogle Scholar
  31. Arash Vahdat. 2017. Toward robustness against label noise in training deep discriminative neural networks. In Proceedings of the Advances in Neural Information Processing Systems.Google ScholarGoogle Scholar
  32. Byron C Wallace, Kevin Small, Carla E Brodley, and Thomas A Trikalinos. 2011. Class imbalance, redux. In Proceedings of the IEEE 11th international conference on data mining.Google ScholarGoogle ScholarCross RefCross Ref
  33. Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, and Wei Xu. 2016. Cnn-rnn: A unified framework for multi-label image classification. In Proceedings of the IEEE conference on computer vision and pattern recognition.Google ScholarGoogle ScholarCross RefCross Ref
  34. Baoyuan Wu, Zhilei Liu, Shangfei Wang, Bao-Gang Hu, and Qiang Ji. 2014. Multi-label learning with missing labels. In Proceedings of 22nd International Conference on Pattern Recognition.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Baoyuan Wu, Siwei Lyu, and Bernard Ghanem. 2015. Ml-mg: Multi-label learning with missing labels using a mixed graph. In Proceedings of the IEEE international conference on computer vision.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Miao Xu, Rong Jin, and Zhi-Hua Zhou. 2013. Speedup matrix completion with side information: Application to multi-label learning. In Proceedings of the Advances in neural information processing systems.Google ScholarGoogle Scholar
  37. Miao Xu, Gang Niu, Bo Han, Ivor W Tsang, Zhi-Hua Zhou, and Masashi Sugiyama. 2018. Matrix Co-completion for Multi-label Classification with Missing Features and Labels. arXiv preprint arXiv:1805.09156 (2018).Google ScholarGoogle Scholar
  38. Hsiang-Fu Yu, Prateek Jain, Purushottam Kar, and Inderjit Dhillon. 2014a. Large-scale multi-label learning with missing labels. In International conference on machine learning.Google ScholarGoogle Scholar
  39. Hsiang-Fu Yu, Prateek Jain, Purushottam Kar, and Inderjit Dhillon. 2014b. Large-scale multi-label learning with missing labels. In Proceedings of the International conference on machine learning.Google ScholarGoogle Scholar
  40. Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, and Oriol Vinyals. 2016. Understanding deep learning requires rethinking generalization. arXiv preprint arXiv:1611.03530 (2016).Google ScholarGoogle Scholar
  41. Min-Ling Zhang and Zhi-Hua Zhou. 2013. A review on multi-label learning algorithms. IEEE transactions on knowledge and data engineering, Vol. 26, 8 (2013), 1819--1837.Google ScholarGoogle Scholar
  42. Zongwei Zhou, Jae Shin, Lei Zhang, Suryakanth Gurudu, Michael Gotway, and Jianming Liang. 2017. Fine-tuning convolutional neural networks for biomedical image analysis: actively and incrementally. In Proceedings of the IEEE conference on computer vision and pattern recognition.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Confidence-based Weighted Loss for Multi-label Classification with Missing Labels

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ICMR '20: Proceedings of the 2020 International Conference on Multimedia Retrieval
      June 2020
      605 pages
      ISBN:9781450370875
      DOI:10.1145/3372278

      Copyright © 2020 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 June 2020

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      Overall Acceptance Rate254of830submissions,31%

      Upcoming Conference

      ICMR '24
      International Conference on Multimedia Retrieval
      June 10 - 14, 2024
      Phuket , Thailand

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader