Abstract
We present a text simplifier for English that has been built with open source software and has both lexical and syntactic simplification capabilities. The lexical simplifier uses a vector space model approach to obtain the most appropriate sense of a given word in a given context and word frequency simplicity measures to rank synonyms. The syntactic simplifier uses linguistically-motivated rule-based syntactic analysis and generation techniques that rely on part-of-speech tags and syntactic dependency information. Experimental results show good performance of the lexical simplification component when compared to a hard-to-beat baseline, good syntactic simplification accuracy, and according to human assessment, improvements over the best reported results in the literature for a system with same architecture as YATS.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
simplewiki-20140204 dump version.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
It uses the Mate Tools’ PoS tagger and lemmatizer before parsing.
- 13.
None of them developed the simplifier.
References
Biran, O., Brody, S., Elhadad, N.: Putting it simply: a context-aware approach to lexical simplification. In: Proceedings of the ACL 2011, pp. 496–501 (2011)
Bott, S., Rello, L., Drndarevic, B., Saggion, H.: Can Spanish be simpler? LexSiS: lexical simplification for Spanish. In: Proceedings of the COLING 2012, Mumbai, India, pp. 357–374 (2012)
Carroll, J., Minnen, G., Canning, Y., Devlin, S., Tait, J.: Practical simplification of English newspaper text to assist aphasic readers. In: Proceedings of the AAAI 1998 Workshop on Integrating AI and Assistive Technology, pp. 7–10 (1998)
Chandrasekar, R., Doran, C., Srinivas, B.: Motivations and methods for text simplification. In: Proceedings of the COLING 1996, pp. 1041–1044 (1996)
Coster, W., Kauchak, D.: Learning to simplify sentences using wikipedia. In: Proceedings of ACL 2011 Workshop on Monolingual Text-To-Text Generation, Portland, Oregon, USA, pp. 1–9 (2011)
Devlin, S., Tait, J.: The use of a psycholinguistic database in the simplification of text for aphasic readers. In: Linguistic Databases, pp. 161–173 (1998)
Horn, C., Manduca, C., Kauchak, D.: Learning a lexical simplifier using Wikipedia. In: Proceedings of ACL 2014, pp. 458–463 (2014)
Saggion, H., Bott, S., Rello, L.: Simplifying words in context. Experiments with two lexical resources in Spanish. Comput. Speech Lang. 35, 200–218 (2016)
Saggion, H., Stajner, S., Bott, S., Mille, S., Rello, L., Drndarevic, B.: Making it simplext: implementation and evaluation of a text simplification system for spanish. TACCESS 6(4), 14 (2015)
Shardlow, M.: Out in the open: finding and categorising errors in the lexical simplification pipeline. In: Proceedings of LREC 2014, Reykjavik, Iceland (2014)
Siddharthan, A.: Syntactic simplification and text cohesion. In: Proceedings of the LEC 2002, pp. 64–71 (2002)
Siddharthan, A.: Text simplification using typed dependencies: a comparision of the robustness of different generation strategies. In: Proceedings of the 13th European Workshop on Natural Language Generation, Nancy, France (2011)
Siddharthan, A., Angrosh, M.: Hybrid text simplification using synchronous dependency grammars with hand-written and automatically harvested rules. In: Proceedings of the EACL 2014, Gothenburg, Sweden (2014)
Turney, P.D., Pantel, P.: From frequency to meaning: vector space models of semantics. J. Artif. Int. Res. 37(1), 141–188 (2010)
Wubben, S., Bosch, A., Krahmer, E.: Sentence simplification by monolingual machine translation. In: Proceedings of ACL 2012, pp. 1015–1024 (2012)
Yatskar, M., Pang, B., Danescu-Niculescu-Mizil, C., Lee, L.: For the sake of simplicity: unsupervised extraction of lexical simplifications from Wikipedia. In: Proceedings of HLT-NAACL 2010 (2010)
Acknowledgments
We are grateful to three anonymous reviewers for their useful comments, to the participants in our human evaluation experiments, and to A. Siddharthan for sharing his dataset. This work was funded by the ABLE-TO-INCLUDE project (European Commission CIP Grant No. 621055). Horacio Saggion is (partly) supported by the Spanish MINECO Ministry (MDM-2015-0502).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Ferrés, D., Marimon, M., Saggion, H., AbuRa’ed, A. (2016). YATS: Yet Another Text Simplifier. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds) Natural Language Processing and Information Systems. NLDB 2016. Lecture Notes in Computer Science(), vol 9612. Springer, Cham. https://doi.org/10.1007/978-3-319-41754-7_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-41754-7_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41753-0
Online ISBN: 978-3-319-41754-7
eBook Packages: Computer ScienceComputer Science (R0)