18.222.30.5
18.222.30.5
close menu
KCI 등재
한국어 구어 말뭉치를 대상으로 한 연어구성 추출 방법들의 비교: 재현가능 연구
A comparative evaluation of lexical association measures for collocation extraction from Korean spoken corpus: An example of reproducible research
이은하 ( Eun Ha Lee )
언어와 언어학 70권 497-539(43pages)
DOI 10.20865/20167019
UCI I410-ECN-0102-2016-700-000771471

The present study aims at an empirical evaluation of lexical association measures (AMs) for collocation extraction from Korean spoken corpus. Compared was the effectiveness of five widely used AMs: cooccurrence frequency, mutual information, chi-square test, t-test and log-likelihood ratio. The results revealed that for the entire reference data set, cooccurrence frequency outperformed the rest of the AMs. For candidate pairs of nouns and non-hada verbs, however, log-likelihood ratio turned out to be better than or at least comparable to the other four AMs. In comparison with that, cooccurrence frequency showed the worst performance among the five AMs. Finally, the present study has implications for development of a methodology for evaluation of collocation extraction methods in the field of Korean linguistics by making its procedures and findings reproducible.

1. 서론
2. 선행연구 검토
3. 연구방법
4. 결과
5. 논의
6. 결론
참고문헌
[자료제공 : 네이버학술정보]
×