Article

Free Access

An algorithm for identifying cognates between related languages

Author:
Jacques B. M. Guy

Australian National University, Canberra, Australia

Australian National University, Canberra, Australia
View Profile

ACL '84/COLING '84: Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational LinguisticsJuly 1984Pages 448–451https://doi.org/10.3115/980491.980582

Published:02 July 1984Publication History

ACL '84/COLING '84: Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics

Pages 448–451

ABSTRACT

The algorithm takes as only input a list of words, preferably but not necessarily in phonemic transcription, in any two putatively related languages, and sorts it into decreasing order of probable cognation. The processing of a 250-item bilingual list takes about five seconds of CPU time on a DEC KL1091, and requires 56 pages of core memory. The algorithm is given no information whatsoever about the phonemic transcription used, and even though cognate identification is carried out on the basis of a context-free one-for-one matching of individual characters, its cognation decisions are bettered by a trained linguist using more information only in cases of wordlists sharing less than 40% cognates and involving complex, multiple sound correspondences.

References

Abramowitz, Milton and Irene A. Stegun. Handbook of Mathematical Functions. National Bureau of Standards, 1970. Google ScholarDigital Library
Suhotin, B. V. Eksperimental'noe vydelenie klassov bukv s pomoshchju elektronnoj vychislitel'noj mashiny. Problemy strukturnoj lingvistiki. Moscow 1962.Google Scholar

An algorithm for identifying cognates between related languages
1. Hardware
  1. Power and energy
    1. Power estimation and optimization

Recommendations

Methods for extracting and classifying pairs of cognates and false friends

The identification of cognates has attracted the attention of researchers working in the area of Natural Language Processing, but the identification of false friends is still an under-researched area. This paper proposes novel methods for the automatic ...
Read More
Tagging Portuguese with a Spanish tagger using cognates
CrossLangInduction '06: Proceedings of the International Workshop on Cross-Language Knowledge Induction

We describe a knowledge and resource light system for an automatic morphological analysis and tagging of Brazilian Portuguese. We avoid the use of labor intensive resources; particularly, large annotated corpora and lexicons. Instead, we use (i) an ...
Read More
Identifying cognates by phonetic and semantic similarity
NAACL '01: Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies

I present a method of identifying cognates in the vocabularies of related languages. I show that a measure of phonetic similarity based on multivalued features performs better than "orthographic" measures, such as the Longest Common Subsequence Ratio (...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ACL '84/COLING '84: Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics
July 1984
577 pages
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 2 July 1984
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate85of443submissions,19%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 266
  Total Downloads
- Downloads (Last 12 months)18
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

An algorithm for identifying cognates between related languages

ACL '84/COLING '84: Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics

ABSTRACT

References

Cited By

Recommendations

Methods for extracting and classifying pairs of cognates and false friends

Tagging Portuguese with a Spanish tagger using cognates

Identifying cognates by phonetic and semantic similarity

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

An algorithm for identifying cognates between related languages

ACL '84/COLING '84: Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics

ABSTRACT

References

Cited By

Recommendations

Methods for extracting and classifying pairs of cognates and false friends

Tagging Portuguese with a Spanish tagger using cognates

Identifying cognates by phonetic and semantic similarity

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media