Graph-Based Image Retrieval: State of the Art

Belahyane, Imane; Mammass, Mouad; Abioui, Hasna; Idarrou, Ali

doi:10.1007/978-3-030-51935-3_32

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12119))

Included in the following conference series:

International Conference on Image and Signal Processing

3861 Accesses
1 Citations

Abstract

The paper deals with the problem of semantic Image Retrieval. Indeed, the image has recently gained popularity in several domains such as medical domain, marketing, etc. Image plays a very vital role in documentation. However, finding visual and relevant information in an image is a huge task for Image Retrieval community and a very discussed issue in digital image processing. In fact, image can be extracted from a big collection of images, in the purpose of responding to user’s need. Image Retrieval processes based on classical techniques may not be sufficient to user. For several years, great efforts have been devoted to integrate semantic aspect, in order to enhance relevance of the result and ensure high-level content consideration in image. This paper presents a state of the art of Image Retrieval approaches using graph theory due to the growing interest given to graphs in terms of performance, representation and its ability to ingrate semantic aspect. We review a number of recently available graph-based approaches in Image Retrieval aiming to determine factors adding semantic aspect in Image Retrieval system.

You have full access to this open access chapter, Download conference paper PDF

Saliency-weighted graphs for efficient visual content description and their applications in real-time image retrieval systems

Article 12 November 2015

Fast graph similarity search via hashing and its application on image retrieval

Article 14 September 2017

Pattern graph-based image retrieval system combining semantic and visual features

Article 20 May 2017

Keywords

1 Introduction

Since the advent of information technology, the number of multimedia documents has grown continuously. In fact, more than 80% of company and organization data is in the form of documents. Multimedia documents are characterized by a rich content and complex structures, which complicates access to specific granules in such documents and therefore makes the document retrieval a tedious task. Graph is the most effective representation model, which allow the representation of complex and connected data, such as multimedia document. Comparing two documents structurally, means comparing graphs that represent them. The graph theory could be of great interest in the evaluation of the structural similarity.

In general, comparing two graphs leads to find the better matching between them. The general approaches proposed by graph theory concerning matching are: exact matching and approximate matching. In many fields of application, the goal is not to show that two graphs are structurally identical, but it is more interesting to know how similar these graphs are. In such applications, graph similarity based on exact matching is not appropriate. For this purpose, approximate error-tolerant graph matching based on finding the maximum common subgraph or on the calculation of the graph editing distance has been proposed in [8, 9, 17]. In the context of image, finding visual information granularity in an image, requires the use of special techniques in order to respond to user’s need. Indeed, Image Retrieval can be based either on the text or on the visual content. However, a key limitation of traditional image retrieval system is the ignorance of semantic aspect. Several works has used graphs to represent images [2, 6, 7, 32]. In [1], graphs are rich data structures with the ability to represent complex and structured objects.

In this paper, we present the basic concepts of Image Retrieval and image retrieval techniques. Also, a comparative study of image retrieval techniques has been carried out to present their advantages, and drawbacks. The problematic of this work revolves around: the integration of semantic aspect in graph-based Images Retrieval approaches.

The remainder of this paper is organized as follows. Section 2, describes the concept of Image Retrieval and the techniques of Image Retrieval, and presents advantages and limits of using each type of techniques. Section 3 presents a comparative study on graph-based Image Retrieval and outlines the integration of semantic aspect in graph-based approaches. Finally, Sect. 4 discusses our issue focused on graph-based semantic Image Retrieval.

2 Image Retrieval

Information Retrieval (IR) is concerned with the acquisition, structuring, storage, retrieval, and ranking of information [13]. It based on the information needs of user. This task can be applied for all types of data: text, image, video, music track. An Image is the visual representation of an object by different mediums or support.

2.1 Image Retrieval Techniques

Image Retrieval (ImR) is the area studying the way to find images in image collection. Moreover, the issue is to rank the similar images to the user’s request. ImR attracts the attention of many researchers in the field of: digital libraries, remote sensing, astronomy, etc. It has been a very active field since the 1970s. ImR systems can be classified into three techniques [20]:

Text-based Image Retrieval (TBIR), is the system for retrieving images by text queries. The extraction of an image similar to a user query is based on indexation process, which proposes to attach to an image, a set of descriptors. This technique use the textual indexation of an image, its metadata or the textual elements attached to the image. A lot of research has been done on TBIR, but are very ancient due to the great importance given to the other types of ImR. TBIR can be based on annotation [15]. The major drawback of this approach is that for a descriptive annotation, it must be manual, hence, the complexity of the task. One of the first examples of TBIR is [36], it presents a framework that performs annotations to images using text and then uses text-based databases.
Content-based Retrieval (CBIR): over the last decades, a big interest in images collection has grown with the development of image acquisition devices, storage capacities and the availability of high-quality digitization techniques. Indeed, CBIR is a system based on colors, textures, shapes, and other characteristics (depending on the user’s needs). In other words, it consists of extracting visual descriptors and retrieving by visual similarity. This technique responds to many needs in the field of ImR and overcomes the limitations of TBIR. An image can be described by a weighting function that reflects the importance of the features and varies widely according to the system and the objectives. Work [34] presents an effective content-based visual ImR system, by extracting color histogram and spatial information. Work [14] presents CBIR approach using a computational visual attention model, based on saliency regions and energy features of the gray-level co-occurrence and saliency structure histogram. Work [16] presents an approach using local visual attention feature, based on fast and performant salient point detector, and the salient point expansion.
Semantic-based Image Retrieval (SBIR), is the technique that defines image using semantic terms to determine the significance conveyed in the image. SBIR can be obtained by extracting visual descriptors from the image in order to identify significant and interesting regions of the image, followed by a process for extracting knowledge in order to obtain a semantic description of the image. This technique is performed by several factors. The following section focus on that. Table 1 summarizes the advantages and limits of each techniques.

Table 1. Advantages and Limits of ImR techniques

Full size table

According to Table 1, we conclude that each technique has its advantages and limitations, therefore the use of TBIR, CBIR and SBIR depends on the objective of the task carried out by the user and the context studied. Several works aim to increase the relevance of the result, to that end, they combined more than one category. Work [25] presents a decisive content based ImR approach for feature fusion in visual and textual images. In [30] a system is proposed to combine textual and visual statistics in a single index vector for content-based retrieval. Work [29] presents a system based on content-based and develops its own ontology module, it contributes to significantly increase the relevance of retrieval results, by enhancing the ranking of images.

2.2 Image Representation Models

For several years great effort has been devoted to the study of image representation models, most commonly used are: vectors, strings, trees, and graphs. Vectors are often used in ImR, works [16, 29, 34] model the image as a vector.

String is an ordered set of elements, used in ImR when it becomes important to order elements. The distance between two string are often defined by the editing distance of Levenshtein’s chain [33], as in [23]. However, studies on vector-based and string-based approaches are still lacking due to modeling poverty and may not be conventional in all situations to model complex objects. Likewise, tree-based model allows the representation of hierarchical relationships, and not practicable to model complex relations. To solve this issue, many researchers have proposed graph theory in ImR.

Graph-based model permits to represent all possible relations between components, the semantics associated with an arc is not limited to a typing or membership relation. Graphs offer a very rich modeling of the document and their structures. Graph-based Image is represented as a set of components and a set of binary relations between these components. They are widely used in many applications due to their very high expressiveness in terms of structure and semantics. Note that strings and trees are particular graphs.

The application of graph theory to IR is studied in several works due to its advantages in terms of improving the efficiency of the IR engine. Indeed, graph-based measures provide the use of the graph as a semantic representation model for queries and documents and also its exploitation in a semantic document search model [10, 24, 28].

3 Graph-Based Semantic Image Retrieval

3.1 Graph-Based Image Retrieval

In graph data, nodes represent entities and edges represent relationships between nodes. Graph structure has the ability of representing meaning of entities and relationships between entities. This excellent ability makes graph more and more popular in the field of computer. In general, the mathematical theory of graphs could be of great interest in measuring the similarity of objects. In [12], sub-graph isomorphism can be used to show the inclusion or the equivalence of two graphs. Below, we give the mathematical definition of a graph:

Definition 1

A graph G can be defined by a pair (V, E), where V is the set of nodes of G and \(E \in V \times V\) represents the set of edges of G (relations between nodes)

In the following a number of works regarding graph-based ImR.

3.2 Factors Adding Semantic Aspect

Taking into account semantic aspect in ImR, means to retrieve the relevant result with considering the overall signification of the image. SBIR aims to give the adequate interpretation of the image. The purpose of the following table is to determine factors adding semantic aspect in graph-based ImR.

Table 2. Approaches using graphs

Full size table

Table 2 shows image-based approaches, classified according to the graph model and the factors reflecting semantic aspect in graph-based ImR approaches. Those factors contributed to enhance semantic aspect in the ImR system.

In the following, a set of factors is grouped together to clearly identify the factors that implicitly and explicitly influence the semantics of ImR approaches.

To evaluate the retrieval performance, work [5] designs an automatic scheme to simulate the relevance feedback. The simulation system automatically classifies a database image as relevant if the image belongs to the same semantic category with the initial query. The work affirms that experimental results show that the relevance feedback technique improves retrieval performance for semantic categories with clear region correspondence.

In [35], relevance feedback was defined as a powerful interactive technique used to improve the performance of ImR systems. With user provided relevant/irrelevant information on the retrieved images, the system can capture the semantic concept of the query more correctly and gradually improve the retrieval precision.

Work [18] presents an intelligent annotation-based ImR system, that introduces concepts and instances, where annotations are stored as RDF triples and can be queried to find images. Annotations at concept level, are enable to create semantic links between concepts and then addresses many challenges.

Relevance feedback and query annotation are techniques that allow the expansion of the query to enhance the query expressing of the user’s need. Query annotation is a technique that influences the graph before the retrieval process and the relevance feedback allows the extension of the graph to optimize the retrieval engine.

According to the underlying structure, most traditional methods focus on the data features, but, they ignore the underlying structure information, which plays a major role for semantic discovery, especially when the label information is unknown. Many databases have underlying cluster or manifold structure [3].

The context of a node in graph model has an influence on the semantics, but in a lower degree, it implies to take into account the ascendant and descendant nodes, in order to make a general interpretation of the image, such is in [26].

Specific graphs such as: the semantic graph and conceptual graph, used for the representation of knowledge and reasoning. Work [22] presents CKSGIS, it retrieves automatically an interactive semantic graph of convigned terms that allow users to easily find related images, not limited to a specific search term. In [4, 11, 27], conceptual graphs have been used in semantic representations for ImR. They are very used in graph-based ImR. As well as scene graph, it logically structures the spatial representation of a scene graph, such as in [21, 31].

4 Discussion and Coclusion

In the last few years, several privileges have been acquired while working with graph-based, due to the tree architecture of graph, its ability to model complex objects (in our case: images), and complex relationships between these objects (e.g.: representation of multiple relationships between the same nodes).

In ImR domain, there are several factors adding semantic aspect, to increase the performance of the retrieval system and to improve the relevance of the result. Based on the approaches presented in Table 2, the main problematic of this work is to know in what extent, graphs integrate semantic aspect in ImR, using not only approaches dealing with semantics in an explicitly way, but also those expressing semantics in an implicitly way. In this work, we were interested in the semantic aspect of graph-based ImR approaches, depending on the context of the study and the objectives pursued.

In this paper, we presented a state of the art of works related to graph-based ImR. In general, the paper reviews the ImR technique (TBIR, CBIR, and SBIR) and image representation models (strings, lists, trees, and graphs). From this we deduce that TBIR and CBIR techniques are not enough to deal with relevant and effective ImR and according to image representation models literature, we deduce that, graph model has a great importance in ImR.

Graph-based ImR permits to represent all the possible relations between the components, the semantics associated with an arc is not limited to a typing or membership relation. An image is represented as a set of components and a set of binary relations between these components.

The main purpose of the paper is to draw attention to graph-based approaches, and its vital role to increase significance of the image and optimally serve the user’s interest. We have made an overview of existing approaches on ImR. We have concluded that semantic aspect can be carried from several angles, in accordance with study’s context and the objectives pursued. Based on this state of the art, we can conclude that graph-based approaches to ImR can open other leads to improve image-related information retrieval systems.

ImR involves into a promising field, but existing methods of semantic ImR must be adapted. There remain many challenges to overcome in this domain. Our future work will focus on approaches using semantic in IR.

References

Idarrou, A., Mammass, D.: Structural clustering multimedia documents: an approach based on semantic sub-graph isomorphism. Int. J. Comput. Appl. 51(1), 14–21 (2012)
Google Scholar
Kumar, A., Kim, J., Wen, L., Fulham, M., Feng, D.: A graph-based approach for the retrieval of multi-modality medical images. Med. Image Anal. 18(2), 330–342 (2014)
Article Google Scholar
Bin, X., Jiajun, B., Chen, C., Wang, C., Cai, D., He, X.: EMR: a scalable graph-based ranking model for content-based image retrieval. IEEE Trans. Knowl. Data Eng. 27(1), 102–114 (2013)
Article Google Scholar
Hernández-Gracidas, C., Enrique Sucar, L., Montes-y Gómez, M.: Modeling spatial relations for image retrieval by conceptual graphs. In: Proceedings of the First Chilean Workshop on Pattern Recognition (2009)
Google Scholar
Li, C.-Y., Hsu, C.-T.: Image retrieval with relevance feedback based on graph-theoretic region correspondence estimation. IEEE Trans. Multimed. 10(3), 447–456 (2008)
Article Google Scholar
Pedronette, D.C.G., Torres, R.D.S.: A correlation graph approach for unsupervised manifold learning in image retrieval tasks. Neurocomputing 208, 66–79 (2016)
Article Google Scholar
Pedronette, D.C.G., Gonçalves, F.M.F., Guilherme, I.R.: Unsupervised manifold learning through reciprocal KNN graph and connected components for image retrieval tasks. Pattern Recognit. 75, 161–174 (2018)
Article Google Scholar
Conte, D., Foggia, P., Sansone, C., Ven-to, M.: Thirty years of graph matching in pattern recognition. Int. J. Pattern Recognit. Artif. Intell. 18(03), 265–298 (2004)
Article Google Scholar
Hidović, D., Pelillo, M.: Metrics for attributed graphs based on the maximal similarity common subgraph. Int. J. Pattern Recognit. Artif. Intell. 18(03), 299–313 (2004)
Article Google Scholar
Boubekeur, F., Boughanem, M., Tamine-Lechani, L.: Semantic information retrieval based on CP-nets. In: 2007 IEEE International Fuzzy Systems Conference, pp. 1–7. IEEE (2007)
Google Scholar
Gonçalves, F.M.F., Guilherme, I.R., Pedronette, D.C.G.: Semantic guided interactive image retrieval for plant identification. Expert Syst. Appl. 91, 12–26 (2018)
Article Google Scholar
Salton, G., et al.: The smart system-experiments in automatic document processing (1971)
Google Scholar
Salton, G.: Recent trends in automatic information retrieval. In: Proceedings of the 9th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1–10. ACM (1986)
Google Scholar
Liu, G.-H., Yang, J.-Y., Li, Z.Y.: Content-based image retrieval using computational visual attention model. Pattern Recognit. 48(8), 2554–2566 (2015)
Article Google Scholar
Abioui, H., Idarrou, A., Bouzit, A., Mammass, D.: Review: automatic image annotation for semantic image retrieval. In: Mansouri, A., El Moataz, A., Nouboud, F., Mammass, D. (eds.) ICISP 2018. LNCS, vol. 10884, pp. 129–137. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-94211-7_15
Chapter Google Scholar
Yang, H.-Y., Li, Y.-W., Li, W.-Y., Wang, X.-Y., Yang, F.-Y.: Content-based image retrieval using local visual attention feature. J. Vis. Commun. Image Represent. 25(6), 1308–1323 (2014)
Article Google Scholar
Bunke, H., Shearer, K.: A graph distance metric based on the maximal common subgraph. Pattern Recognit. Lett. 19(3–4), 255–259 (1998)
Article Google Scholar
Chen, H., Trouve, A., Murakami, K.J., Fukuda, A.: An intelligent annotation-based image retrieval system based on RDF descriptions. Comput. Electr. Eng. 58, 537–550 (2017)
Article Google Scholar
Urban, J., Jose, J.M.: Adaptive image retrieval using a graph model for semantic feature integration. In: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, pp. 117–126. ACM (2006)
Google Scholar
H’roura, J.: Contributions à l’extraction de descripteureurs sur des données non conventionnelles pour a reconnaissance d’objets 3D. Ph.D. thesis, Université Ibn Zohr (2019)
Google Scholar
Johnson, J., et al.: Image retrieval using scene graphs. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3668–3678 (2015)
Google Scholar
Shieh, J.-R., Yeh, Y.-T., Lin, C.-H., Lin, C.-Y., Wu, J.-L.: Collaborative knowledge semantic graph image search. In: Proceedings of the 17th International Conference on World Wide Web, pp. 1055–1056. ACM (2008)
Google Scholar
Jenni, K., Mandala, S., Sunar, M.S.: Content based image retrieval using colour strings comparison. Procedia Comput. Sci. 50, 374–379 (2015)
Article Google Scholar
Maisonnasse, L., Chevallet, J.P., Berrut, C.: Incomplete and fuzzy conceptual graphs to automatically index medical reports. In: Kedad, Z., Lammari, N., Métais, E., Meziane, F., Rezgui, Y. (eds.) NLDB 2007. LNCS, vol. 4592, pp. 240–251. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-73351-5_21
Chapter Google Scholar
La Cascia, M., Sethi, S., Sclaroff, S.: Combining textual and visual cues for content-based image retrieval on the World Wide Web. In: Proceedings of IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No. 98EX173), pp. 24–28. IEEE (1998)
Google Scholar
Torjmen-Khemakhem, M., Pinel-Sauvagnat, K., Boughanem, M.: Investigating the document structure as a source of evidence for multimedia fragment retrieval. Inf. Process. Manag. 49(6), 1281–1300 (2013)
Article Google Scholar
Mechkour, M., Berrut, C., Chiaramella, Y.: Using conceptual graph frame work for image retrieval. In: International Conference on MultiMedia Modeling (MMM 1995), Singapore, pp. 127–142 (1995)
Google Scholar
Baziz, M., Boughanem, M., Loiseau, Y., Prade, H.: Fuzzy logic and ontology-based information retrieval. In: Wang, P.P., Ruan, D., Kerre, E.E. (eds.) Fuzzy Logic, vol. 215, pp. 193–218. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-71258-9_10
Chapter Google Scholar
Allani, O., Zghal, H.B., Mellouli, N., Akdag, H.: A knowledge-based image retrieval system integrating semantic and visual features. Procedia Comput. Sci. 96, 1428–1436 (2016)
Article Google Scholar
Unar, S., Wang, X., Wang, C., Wang, Y.: A decisive content based image retrieval approach for feature fusion in visual and textual images. Knowl.-Based Syst. 179, 8–20 (2019)
Article Google Scholar
Schuster, S., Krishna, R., Chang, A., Fei-Fei, L., Manning, C.D.: Generating semantically precise scene graphs from textual descriptions for improved image retrieval. In: Proceedings of the Fourth Workshop on Vision and Language, pp. 70–80 (2015)
Google Scholar
Sorlin, S., Solnon, C.: Similarité de graphes: une mesure générique et un algorithme tabou réactif (2005)
Google Scholar
Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. In: Soviet Physics doklady, vol. 10, pp. 707–710 (1966)
Google Scholar
Li, X., Chen, S.-C., Shyu, M.-L., Furht, B.: An effective content-based visual image retrieval system. In: Proceedings 26th Annual International Computer Software and Applications, pp. 914–919. IEEE (2002)
Google Scholar
Rui, Y., Huang, T.S., Ortega, M., Mehrotra, S.: Relevance feedback: a power tool for interactive content-based image retrieval. IEEE Trans. Circuits Syst. Video Technol. 8(5), 644–655 (1998)
Article Google Scholar
Rui, Y., Huang, T.S.: A novel relevance feedback technique in image retrieval (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

IRF-SIC Laboratory, Ibn Zohr University, Agadir, Morocco
Imane Belahyane, Mouad Mammass, Hasna Abioui & Ali Idarrou

Authors

Imane Belahyane
View author publications
You can also search for this author in PubMed Google Scholar
Mouad Mammass
View author publications
You can also search for this author in PubMed Google Scholar
Hasna Abioui
View author publications
You can also search for this author in PubMed Google Scholar
Ali Idarrou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Imane Belahyane , Mouad Mammass , Hasna Abioui or Ali Idarrou .

Editor information

Editors and Affiliations

GREYC, University of Caen Normandie, Caen, France
Abderrahim El Moataz
IRF-SIC, Faculty of Sciences, Ibn Zohr University, Agadir, Morocco
Driss Mammass
ImViA, University of Burgundy, Dijon, France
Alamin Mansouri
Math - Info, University of Quebec, Trois-Rivières, QC, Canada
Fathallah Nouboud

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Belahyane, I., Mammass, M., Abioui, H., Idarrou, A. (2020). Graph-Based Image Retrieval: State of the Art. In: El Moataz, A., Mammass, D., Mansouri, A., Nouboud, F. (eds) Image and Signal Processing. ICISP 2020. Lecture Notes in Computer Science(), vol 12119. Springer, Cham. https://doi.org/10.1007/978-3-030-51935-3_32

Download citation

DOI: https://doi.org/10.1007/978-3-030-51935-3_32
Published: 08 July 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-51934-6
Online ISBN: 978-3-030-51935-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Graph-Based Image Retrieval: State of the Art

Abstract

Similar content being viewed by others

Saliency-weighted graphs for efficient visual content description and their applications in real-time image retrieval systems

Fast graph similarity search via hashing and its application on image retrieval

Pattern graph-based image retrieval system combining semantic and visual features

Keywords

1 Introduction

2 Image Retrieval

2.1 Image Retrieval Techniques

2.2 Image Representation Models

3 Graph-Based Semantic Image Retrieval

3.1 Graph-Based Image Retrieval

Definition 1

3.2 Factors Adding Semantic Aspect

4 Discussion and Coclusion

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Graph-Based Image Retrieval: State of the Art

Abstract

Similar content being viewed by others

Saliency-weighted graphs for efficient visual content description and their applications in real-time image retrieval systems

Fast graph similarity search via hashing and its application on image retrieval

Pattern graph-based image retrieval system combining semantic and visual features

Keywords

1 Introduction

2 Image Retrieval

2.1 Image Retrieval Techniques

2.2 Image Representation Models

3 Graph-Based Semantic Image Retrieval

3.1 Graph-Based Image Retrieval

Definition 1

3.2 Factors Adding Semantic Aspect

4 Discussion and Coclusion

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation