Abstract
Only humans may decide whether a column or a set of columns of a table should store minimally unique values. Not adding to a table any existing constraint (business rule), which includes keys, allows for storing implausible instances in it. Unfortunately, 2n possibilities should be considered in the worst case for a table with n columns in order to discover all of its keys. This paper presents and discusses contrastively three as efficient as possible, sound, and complete algorithms that assist database designers in discovering all existing keys in the corresponding subuniverse of discourse.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abedjan, Z., Naumann, F.: Advancing the discovery of unique column combinations. In: ACM CIKM 2011 International Conference on Information and Knowledge Management, Glasgow, UK, pp. 1565–1570. ACM Press, New York (2011)
Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley, Reading (1995)
Berkus, J.: Josh’s Rules (of Database Contracting). http://it.toolbox.com/blogs/database-soup/joshs-rules-of-database-contracting-17253
Calude, C.S., Longo, G.: The deluge of spurious correlations in big data. CDMTSC Report Series, CDMTSC-488. University of Auckland, New Zealand (2016)
Chen, P.P.: The entity-relationship model: toward a unified view of data. ACM Trans. Database Syst. 1(1), 9–36 (1976)
Codd, E.F.: A relational model for large shared data banks. CACM 13(6), 377–387 (1970)
Demetrovics, J.: On the equivalence of candidate keys with Sperner systems. Acta Cybernetica 4(3), 247–252 (1979)
Fadous, R., Forsyth, J.: Finding candidate keys for relational data bases. In: Proceedings ACMSIGMOD International Conference on Management of Data, pp 203–210. ACM Press, New York (1975)
Fagin, R.: A normal form for relational databases that is based on domains and keys. ACM Trans. Database Syst. 6(3), 387–415 (1981)
Heise, A., Quiane-Ruiz, J.-A., Abedjan, Z., Jentzsch, A., Naumann, F.: Scalable discovery of unique column combinations. Proc. VLDB 7(4), 301–312 (2013)
Huhtala, Y., Kärkkainen, J., Porkka, P., Toivonen, H.: TANE: an efficient algorithm for discovering functional and approximate dependencies. Comput. J. 42(2), 100–111 (1999)
Ilyas, I.F., Markl, V., Haas, P., Brown, P., Aboulnaga, A.: CORDS: automatic discovery of correlations and soft functional dependencies. In: Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, pp. 647–658. ACM, New York (2004)
Lucchesi, C.L., Osborn, S.L.: Candidate keys for relations. J. Comput. Syst. Sci. 17(2), 26–279 (1978)
Mancas, C.: A deeper insight into the mathematical data model. In: Proceedings of 13th ISDBMS International Seminar on DBMS, pp. 122–134. ICI Bucharest, Romania (1990)
Mancas, C.: Conceptual Data Modeling and Database Design: A Com-pletely Algorithmic Approach. Volume I: The Shortest Advisable Path. Apple Aca-de-mic Press/CRC Press, Waretown (2015)
Mancas, C., Crasovschi, L.: An optimal algorithm for computer-aided design of key type constraints. In: Proceedings of 1st Balkan BIT 2003 Information Technology Conference, pp. 574–584. Aristotle University Press, Thessaloniki, Greece (2003)
Mancas, C., Dragomir, S.: An optimal algorithm for structural keys design. In: Proceedings of SEA 2003 IASTED Conference on Software Engineering and Applications, pp. 328–334. Acta Press, Calgary (2003)
Mancas, C., Dragomir, S.: MatBase Datalog¬ subsystem metacatalog conceptual design. In: Proceedingsof IASTED DBA 2004 Conference on Software Engineering and Applications, pp. 34–41. Acta Press, Calgary (2004)
Sismanis, Y., Brown, P., Haas, P.J., Reinwald, B.: GORDIAN: efficient and scalable discovery of composite keys. In: Proceedings of VLDB 06 Conference, pp. 691–702. VLDB Endowment (2006)
Symeonidou, D., Armant, V., Pernelle, N., Saïs, F.: SAKey: scalable almost key discovery in RDF data. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 33–49. Springer, Heidelberg (2014)
Thalheim, B.: The number of keys in relational and nested relational databases. Discrete Appl. Math. 40, 265–282 (1992)
Thalheim, B.: Fundamentals of Entity-Relationship Modeling. Springer-Verlag, Berlin (2000)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Mancas, C. (2016). Algorithms for Database Keys Discovery Assistance. In: Řepa, V., Bruckner, T. (eds) Perspectives in Business Informatics Research. BIR 2016. Lecture Notes in Business Information Processing, vol 261. Springer, Cham. https://doi.org/10.1007/978-3-319-45321-7_23
Download citation
DOI: https://doi.org/10.1007/978-3-319-45321-7_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45320-0
Online ISBN: 978-3-319-45321-7
eBook Packages: Business and ManagementBusiness and Management (R0)