Skip to main content

LinkedMDR: A Collective Knowledge Representation of a Heterogeneous Document Corpus

  • Conference paper
  • First Online:
Database and Expert Systems Applications (DEXA 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10438))

Included in the following conference series:

Abstract

The ever increasing need for extracting knowledge from heterogeneous data has become a major concern. This is particularly observed in many application domains where several actors, with different expertise, exchange a great amount of information at any stage of a large-scale project. In this paper, we propose LinkedMDR: a novel ontology for Linked Multimedia Document Representation that describes the knowledge of a heterogeneous document corpus in a semantic data network. LinkedMDR combines existing standards and introduces new components that handle the connections between these standards and augment their capabilities. It is generic and offers a pluggable layer that makes it adaptable to different domain-specific knowledge. Experiments conducted on construction projects show that LinkedMDR is applicable in real-world scenarios.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Inter-document links are relations between documents. Intra-document links are relations between elements of the same document.

  2. 2.

    http://www.nobatek.com.

  3. 3.

    For the sake of simplicity, we only present 6 documents. However, other documents could be also involved such as videos, audios and 3D drawings.

  4. 4.

    LinkedMDR is an OWL ontology created on Protégé. Details on the LinkedMDR ontology, the overall concepts and relations are available at http://spider.sigappfr.org/linkedmdr/.

  5. 5.

    Available at http://ifcowl.openbimstandards.org/IFC4_ADD2.owl.

  6. 6.

    HIT2GAP (Highly Innovative Building Control Tools) is a large-scale project that involves 21 partners and provides an energy management platform for managing building energy behavior. Further details are available at: http://www.hit2gap.eu/.

  7. 7.

    http://spider.sigappfr.org/linkedmdr/lmdr-annotator.

  8. 8.

    The number of XML tags in the XML annotation files that we generated based on the existing standards and the number of RDF triples that we generated in the LinkedMDR ontology.

  9. 9.

    \(F_2\)-measure: (5 \(\times \) P \(\times \) R) /(4 \(\times \) P+ R)

    Recall: No. of covered relevant criteria/Total No. of expected criteria.

    Precision: No. of covered relevant criteria/Total No. of annotated criteria.

References

  1. Arndt, R., Troncy, R., Staab, S., Hardman, L., Vacura, M.: COMM: designing a well-founded multimedia ontology for the web. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 30–43. Springer, Heidelberg (2007). doi:10.1007/978-3-540-76298-0_3

    Chapter  Google Scholar 

  2. Bloechle, J.-L., Rigamonti, M., Hadjar, K., Lalanne, D., Ingold, R.: XCDF: a canonical and structured document format. In: Bunke, H., Spitz, A.L. (eds.) DAS 2006. LNCS, vol. 3872, pp. 141–152. Springer, Heidelberg (2006). doi:10.1007/11669487_13

    Chapter  Google Scholar 

  3. Brut, M., Laborie, S., Manzat, A.M., Sedes, F.: Integrating heterogeneous metadata into a distributed multimedia information system. In: COGnitive systems with Interactive Sensors (2009)

    Google Scholar 

  4. buildingSMART: IFC-Industry Foundation Classes, IFC4 Add2 Release (2016). http://www.buildingsmart-tech.org/specifications/ifc-releases/ifc4-add2-release

  5. Charbel, N., Tekli, J., Chbeir, R., Tekli, G.: Resolving XML semantic ambiguity. In: EDBT, pp. 277–288 (2015)

    Google Scholar 

  6. Dublin Core Metadata Initiative: DCMI Metadata Terms (2012). http://dublincore.org/documents/dcmi-terms/

  7. EXIF: Exchangeable Image File Format for digital still cameras (2002). http://www.exif.org/Exif2-2.PDF

  8. Garcia, R., Celma, O.: Semantic integration and retrieval of multimedia metadata. In: 5th International Workshop on Knowledge Markup and Semantic Annotation, pp. 69–80 (2005)

    Google Scholar 

  9. Guo, K., Liang, Z., Tang, Y., Chi, T.: SOR: an optimized semantic ontology retrieval algorithm for heterogeneous multimedia big data. J. Comput. Sci. (2017)

    Google Scholar 

  10. Hunter, J.: An overview of the MPEG-7 description definition language (DDL). IEEE Trans. Circuits Syst. Video Technol. 11(6), 765–772 (2001)

    Article  Google Scholar 

  11. Huovila, P.: Linking IFCs and BIM to sustainability assessment of buildings. In: Proceedings of the CIB W78 2012: 29th International Conference (2012)

    Google Scholar 

  12. ITEA: LINDO-Large scale distributed INDexation of multimedia Objects (2010). https://itea3.org/project/lindo.html

  13. Klinger, M., Susong, M.: Chapter, phases of the contruction project. In: The Construction Project: Phases, People, Terms, Paperwork, Processes. American Bar Association (2006)

    Google Scholar 

  14. OpenCV: Open Source Computer Vision Library (2011). http://opencv.org

  15. Pankowski, T., Brzykcy, G.: Data access based on faceted queries over ontologies. In: Hartmann, S., Ma, H. (eds.) DEXA 2016 Part II. LNCS, vol. 9828, pp. 275–286. Springer, Cham (2016). doi:10.1007/978-3-319-44406-2_21

    Chapter  Google Scholar 

  16. Saathoff, C., Scherp, A.: Unlocking the semantics of multimedia presentations in the web with the multimedia metadata ontology. In: Proceedings of the 19th International Conference on World Wide Web, pp. 831–840. ACM (2010)

    Google Scholar 

  17. Salembier, P., Smith, J.R.: MPEG-7 multimedia description schemes. IEEE Trans. Circuits Syst. Video Technol. 11(6), 748–759 (2001)

    Article  Google Scholar 

  18. Scherp, A., Eissing, D., Saathoff, C.: A method for integrating multimedia metadata standards and metadata formats with the multimedia metadata ontology. Int. J. Semant. Comput. 6(01), 25–49 (2012)

    Article  Google Scholar 

  19. Suarez-Figueroa, M.C., Atemezing, G.A., Corcho, O.: The landscape of multimedia ontologies in the last decade. Multimed. Tools Appl. 62(2), 377–399 (2013)

    Article  Google Scholar 

  20. Tekli, J., Charbel, N., Chbeir, R.: Building semantic trees from XML documents. Web Semant.: Sci. Serv. Agents World Wide Web 37, 1–24 (2016)

    Article  Google Scholar 

  21. The Moving Picture Experts Group: MPEG7-Multimedia Content Description Interface (2001). http://mpeg.chiariglione.org/standards/mpeg-7

  22. The Text Encoding Initiative Consortium: TEI-Text Encoding Initiative (1994). http://www.tei-c.org/release/doc/tei-p5-doc/en/Guidelines.pdf

  23. W3C: Resource Description Framework (2004). https://www.w3.org/RDF/

  24. W3C: Ontology for Media Resources 1.0 (2012). http://www.w3.org/TR/mediaont-10/

  25. Weibel, S., Kunze, J., Lagoze, C., Wolf, M.: Dublin Core metadata for resource discovery. Technical report 2070-1721 (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nathalie Charbel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Charbel, N., Sallaberry, C., Laborie, S., Tekli, G., Chbeir, R. (2017). LinkedMDR: A Collective Knowledge Representation of a Heterogeneous Document Corpus. In: Benslimane, D., Damiani, E., Grosky, W., Hameurlain, A., Sheth, A., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2017. Lecture Notes in Computer Science(), vol 10438. Springer, Cham. https://doi.org/10.1007/978-3-319-64468-4_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-64468-4_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-64467-7

  • Online ISBN: 978-3-319-64468-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics