Abstract
Special difficulties are encountered in devising reliable systems for searching and updating any large files of documents that must be identified primarily on the basis of names and other personal particulars. The underlying problem is that of making nearly maximum use of items of identifying information that are individually unreliable but that may collectively be of considerable discriminating power. Rules that can be applied generally to name retrieval systems have been developed In a methodological study of the linkage of vital and health records into family groupings for demographic research purposes. These rules are described, and the ways in which information utilization for matching may be optimized are discussed.
- 1 The rules of Soundex coding are given in the Notes at the endGoogle Scholar
- 2 HOGBEN, L., JOHNSTONE, M. M., AND CROSS, K. W. Identification of uledical documents. British Med. J. (Apr. 1948), 632-635.Google Scholar
- 3 HOGBEN, L., AND CROSS, K. W. The statistical specificity of a code personnel cypher sequence. British J . Soc. Med. 2, 4 (1948), 149-152.Google Scholar
- 4 Modern coding methods. IBM Booklet 32-3793-6, pp. 24-25.Google Scholar
- 5 DAVIDSON, L., Retrieval of misspelled names in an airline passenger records system. Comm. ACM 5, 3 (1962), 169-171. Google ScholarDigital Library
- 6 STARK, MARTIN C. Address coding in an automatic mall sorting system. Nat. Bur. Standards Report 4959, Washington, D.C.Google Scholar
- 7 NEWCOMBE, H. B., KENNEDY, J. M., AXFORD, S. J., AND JAMES, A. P. 1959. Automatic linkage of vital records. Science 150 (1959), 954-959.Google Scholar
- 8 NEWCOMBE, H. B., AND RHYNAS, P. O. W. Family linkage of population records. In The Use of Vital and Health Statistics for Genetic and Radiation Studies, pp. 135-154, Proc. Seminar sponsored by the United Nations and the World Health Organization. United Nations Publication, Sales No.: 61. XVII. 8 (1962).Google Scholar
- 9 KENNEDY, J. M. The use of a digital computer for record linkage. In The Use of Vital and Health Statistics for Genetic and Radiation Studies, pp. 155-159, Proc. Seminar sponsored by the United Nations and the World Health Organization. United Nations Publ., Sales No. : 61. XVII. 8 (1962).Google Scholar
- 10 KENNEDY, J. M. Linkage of birth and marriage records using a digital computer. AECL Report No. 1258. (1915), Atomic Energy of Canada, Ltd., Chalk River, Ont., Canada.Google Scholar
Recommendations
Leveraging Social Media Signals for Record Linkage
WWW '18: Proceedings of the 2018 World Wide Web ConferenceMany data-intensive applications collect (structured) data from a variety of sources. A key task in this process is record linkage, which is the problem of determining the records from these sources that refer to the same real-world entities. ...
Subsequent patient visit detection in a high volume OPD using record linkage techniques
COMPUTE '10: Proceedings of the Third Annual ACM Bangalore ConferenceRecord or data linkage techniques are used to link records which represent the same entity (e.g. patient, customer, citation, etc.) in one or more data sets where a unique identifier for each entity is not available in all or any of the data sets to be ...
Multiple instance learning for group record linkage
PAKDD'12: Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part IRecord linkage is the process of identifying records that refer to the same entities from different data sources. While most research efforts are concerned with linking individual records, new approaches have recently been proposed to link groups of ...
Comments