demonstration

iART: A Search Engine for Art-Historical Images to Support Research in the Humanities

Authors:
Matthias Springstein

TIB - Leibniz Information Centre for Science and Technology, Hanover, Germany

TIB - Leibniz Information Centre for Science and Technology, Hanover, Germany
View Profile

,
Stefanie Schneider

Ludwig Maximilian University of Munich, Munich, Germany

Ludwig Maximilian University of Munich, Munich, Germany
View Profile

,
Javad Rahnama

University Paderborn, Paderborn, Germany

University Paderborn, Paderborn, Germany
View Profile

,
Eyke Hüllermeier

Ludwig Maximilian University of Munich, Munich, Germany

Ludwig Maximilian University of Munich, Munich, Germany
View Profile

,
Hubertus Kohle

Ludwig Maximilian University of Munich, Munich, Germany

Ludwig Maximilian University of Munich, Munich, Germany
View Profile

,
Ralph Ewerth

TIB - Leibniz Information Center for Science and Technology, Hanover, Germany

TIB - Leibniz Information Center for Science and Technology, Hanover, Germany
View Profile

MM '21: Proceedings of the 29th ACM International Conference on MultimediaOctober 2021Pages 2801–2803https://doi.org/10.1145/3474085.3478564

Published:17 October 2021Publication History

MM '21: Proceedings of the 29th ACM International Conference on Multimedia

Pages 2801–2803

ABSTRACT

In this paper, we introduce iART: an open Web platform for art-historical research that facilitates the process of comparative vision. The system integrates various machine learning techniques for keyword- and content-based image retrieval as well as category formation via clustering. An intuitive GUI supports users to define queries and explore results. By using a state-of-the-art cross-modal deep learning approach, it is possible to search for concepts that were not previously detected by trained classification models. Art-historical objects from large, openly licensed collections such as Amsterdam Rijksmuseum and Wikidata are made available to users.

Supplemental Material

de3242.mp4

mp4

24.8 MB

Download

de3242.mp4

mp4

24.8 MB

Download

References

Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Gregory S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian J. Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Józefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mané, Rajat Monga, Sherry Moore, Derek Gordon Murray, Chris Olah, Mike Schuster, Jonathon Shlens, Benoit Steiner, Ilya Sutskever, Kunal Talwar, Paul A. Tucker, Vincent Vanhoucke, Vijay Vasudevan, Fernanda B. Viégas, Oriol Vinyals, Pete Warden, Martin Wattenberg, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. CoRR abs/1603.04467 (2016). arXiv:1603.04467 http://arxiv.org/abs/1603.04467Google Scholar
Matthias Becker, Martin Bogner, Fabian Bross, François Bry, Caterina Campanella, Laura Commare, Silvia Cramerotti, Katharina Jakob, Martin Josko, Fabian Kneißl, Hubertus Kohle, Thomas Krefeld, Elena Levushkina, Stephan Lücke, Alessandra Puglisi, Anke Regner, Christian Riepl, Clemens Schefels, Corina Schemainda, Eva Schmidt, Stefanie Schneider, Gerhard Schön, Klaus Schulz, Franz Siglmüller, Bartholomäus Steinmayr, Florian Störkle, Iris Teske, and Christoph Wieser. 2018. ARTigo -- Social Image Tagging [Dataset and Images]. https://doi.org/10.5282/ubm/data.136..Google Scholar
Reiner Diedrichs. 2021. Kenom Digitaler Münzkatalog. Retrieved June 15, 2021 from https://www.kenom.deGoogle Scholar
Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre H. Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Ávila Pires, Zhaohan Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Koray Kavukcuoglu, Rémi Munos, and Michal Valko. 2020. Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin (Eds.). https://proceedings.neurips.cc/paper/2020/hash/f3ada80d5c4ee70142b17b8192b2958e-Abstract.htmlGoogle Scholar
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016. 770--778. https://doi.org/10.1109/CVPR.2016.90Google Scholar
Iconclass. 2021. Iconclass. Retrieved June 15, 2021 from http://www.iconclass.orgGoogle Scholar
Jeff Johnson, Matthijs Douze, and Hervé Jégou. 2017. Billion-Scale Similarity Search with GPUs. CoRR (2017). http://arxiv.org/abs/1702.08734Google Scholar
Sabine Lang and Björn Ommer. 2018. Attesting Similarity: Supporting the Organization and Study of Art Image Collections with Computer Vision. Digital Schol- arship in the Humanities 33, 4 (2018), 845--856. https://doi.org/10.1093/llc/fqy006Google Scholar
Leland McInnes and John Healy. 2018. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. CoRR (2018). http://arxiv.org/abs/1802. 03426Google Scholar
Kiri Nichol. 2016. Painter by Numbers. Retrieved June 15, 2021 from https://www.kaggle.com/c/painter-by-numbersGoogle Scholar
The Metropolitan Museum of Art. 2020. iMet Collection 2020 - FGVC7. Retrieved June 15, 2021 from https://www.kaggle.com/c/imet-2020-fgvc7Google Scholar
Fabian Offert, Peter Bell, and Oleg Harlamov. 2020. imgs.ai. https://imgs.ai/. Accessed: 2021-06--15.Google Scholar
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Köpf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d'Alché-Buc, Emily B. Fox, and Roman Garnett (Eds.). 8024--8035. https://proceedings.neurips.cc/paper/2019/hash/bdbca288fee7f92f2bfa9f7012727740-Abstract.html Google ScholarDigital Library
Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. 2021. Learning Transferable Visual Models From Natural Language Supervision. CoRR (2021). https://arxiv.org/abs/2103.00020Google Scholar
Rijksmuseum. 2021. Rijksmuseum Amsterdam, Home of the Dutch masters. Retrieved June 15, 2021 from https://www.rijksmuseum.nl/enGoogle Scholar
Luca Rossetto, Ivan Giangreco, Claudiu Tanase, and Heiko Schuldt. 2016. vitrivr: A Flexible Retrieval Stack Supporting Multiple Query Modes for Searching in Multimedia Collections. In 24th ACM International Conference on Multimedia. 1183--1186. https://dl.acm.org/doi/10.1145/2964284.2973797 Google ScholarDigital Library
Christoph Wieser, François Bry, Alexandre Bérard, and Richard Lagrange. 2013. ARTigo: Building an Artwork Search Engine with Games and Higher-Order Latent Semantic Analysis. In Disco 2013, Workshop on Human Computation and Machine Learning in Games at the International Conference on Human Computation (HComp). https://www.en.pms.ifi.lmu.de/publications/PMS-FB/PMS-FB-2013-3/PMS-FB-2013-3-paper.pdf[18] Wikimedia. 2019. Wikidata. Retrieved June 15, 2021 from https://www.wikidata.org/wiki/Wikidata:Main_PageGoogle Scholar
Heinrich Wölfflin. 1915. Kunstgeschichtliche Grundbegriffe. Bruckmann, Munich.Google Scholar

Index Terms

iART: A Search Engine for Art-Historical Images to Support Research in the Humanities
1. Human-centered computing
  1. Visualization
    1. Visualization application domains
      1. Information visualization
2. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval
  2. World Wide Web
    1. Web applications

Recommendations

Relevance-based Margin for Contrastively-trained Video Retrieval Models
ICMR '22: Proceedings of the 2022 International Conference on Multimedia Retrieval

Video retrieval using natural language queries has attracted increasing interest due to its relevance in real-world applications, from intelligent access in private media galleries to web-scale video search. Learning the cross-similarity of video and ...
Read More
Fine-Grained Visual Textual Alignment for Cross-Modal Retrieval Using Transformer Encoders
Despite the evolution of deep-learning-based visual-textual processing systems, precise multi-modal matching remains a challenging task. In this work, we tackle the task of cross-modal retrieval through image-sentence matching based on word-region ...
Read More
IR Questioner: QA-based Interactive Retrieval System
ICMR '21: Proceedings of the 2021 International Conference on Multimedia Retrieval

Image retrieval from a given text query (text-to-image retrieval) is one of the most essential systems, and it is effectively utilized for databases (DBs) on the Web. To make them more versatile and familiar, a retrieval system that is adaptive even for ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '21: Proceedings of the 29th ACM International Conference on Multimedia
October 2021
5796 pages
ISBN:9781450386517
DOI:10.1145/3474085
General Chairs:
Heng Tao Shen
University of Electronic Science&Technology of China, China
,
Yueting Zhuang
Zhejiang University, China
,
John R. Smith
IBM, USA
,
Program Chairs:
Yang Yang
University of Electronic Science and Technology of China, China
,
Pablo Cesar
CWI&TU Delft, The Netherlands
,
Florian Metze
FACEBOOK, Inc., USA
,
Balakrishnan Prabhakaran
University of Texas at Dallas, USA
Copyright © 2021 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 October 2021
Check for updates
Author Tags
art retrieval
cross-modal retrieval
deep learning
web application
Qualifiers
- demonstration
Conference

Acceptance Rates
Overall Acceptance Rate995of4,171submissions,24%
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 137
  Total Downloads
- Downloads (Last 12 months)40
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

iART: A Search Engine for Art-Historical Images to Support Research in the Humanities

MM '21: Proceedings of the 29th ACM International Conference on Multimedia

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Relevance-based Margin for Contrastively-trained Video Retrieval Models

Fine-Grained Visual Textual Alignment for Cross-Modal Retrieval Using Transformer Encoders

IR Questioner: QA-based Interactive Retrieval System