Enriching Education through Data Mining

Agrawal, Rakesh

doi:10.1007/978-3-642-23780-5_1

Enriching Education through Data Mining

Rakesh Agrawal²³

Conference paper

2926 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6911))

Abstract

Education is acknowledged to be the primary vehicle for improving the economic well-being of people [1,6]. Textbooks have a direct bearing on the quality of education imparted to the students as they are the primary conduits for delivering content knowledge [9]. They are also indispensable for fostering teacher learning and constitute a key component of the ongoing professional development of the teachers [5,8]. Many textbooks, particularly from emerging countries, lack clear and adequate coverage of important concepts [7]. In this talk, we present our early explorations into developing a data mining based approach for enhancing the quality of textbooks. We discuss techniques for algorithmically augmenting different sections of a book with links to selective content mined from the Web. For finding authoritative articles, we first identify the set of key concept phrases contained in a section. Using these phrases, we find web (Wikipedia) articles that represent the central concepts presented in the section and augment the section with links to them [4]. We also describe a framework for finding images that are most relevant to a section of the textbook, while respectingglobal relevancy to the entire chapter to which the section belongs. We pose this problem of matching images to sections in a textbook chapter as an optimization problem and present an efficient algorithm for solving it [2].

We also present a diagnostic tool for identifying those sections of a book that are notwell-written and hence should be candidates for enrichment. We propose a probabilistic decision model for this purpose, which is based on syntactic complexity of the writing and the newly introduced notion of the dispersion of key concepts mentioned in the section. The model is learned using a tune set which is automatically generated in a novel way. This procedure maps sampled text book sections to the closest versions of Wikipedia articles having similar content and uses the maturity of those versions to assign need-for-enrichment labels. The maturity of a version is computed by considering the revision history of the corresponding Wikipedia article and convolving the changes in size with a smoothing filter [3].

We also provide the results of applying the proposed techniques to a corpus of widely-used, high school textbooks published by the National Council of Educational Research and Training (NCERT), India. We consider books from grades IX–XII, covering four broad subject areas, namely, Sciences, Social Sciences, Commerce, and Mathematics. The preliminary results are encouraging and indicate that developing technological approaches to enhancing the quality of textbooks could be a promising direction for research for our field.

Download to read the full chapter text

Chapter PDF

References

World Bank Knowledge for Development. World Development Report 1998/1999 (1998)
Google Scholar
Agrawal, R., Gollapudi, S., Kannan, A., Kenthapadi, K.: Enriching Textbooks with Web Images. Working paper (2011)
Google Scholar
Agrawal, R., Gollapudi, S., Kannan, A., Kenthapadi, K.: Identifying Enrichment Candidates in Textbooks. In: WWW (2011)
Google Scholar
Agrawal, R., Gollapudi, S., Kannan, A., Kenthapadi, K., Srivastava, N., Velu, R.: Enriching Textbooks Through Data Mining. In: First Annual ACM Symposium on Computing for Development, ACM DEV (2010)
Google Scholar
Gillies, J., Quijada, J.: Opportunity to Learn: A High Impact Strategy for Improving Educational Outcomes in Developing Countries. In: USAID Educational Quality Improvement Program, EQUIP2 (2008)
Google Scholar
Hanushek, E.A., Woessmann, L.: The Role of Education Quality for Economic Growth. Policy Research Department Working Paper 4122, World Bank (2007)
Google Scholar
Mohammad, R., Kumari, R.: Effective Use of Textbooks: A Neglected Aspect of Education in Pakistan. Journal of Education for International Development 3(1) (2007)
Google Scholar
Oakes, J., Saunders, M.: Education’s Most Basic Tools: Access to Textbooks and Instructional Materials in California’s Public Schools. Teachers College Record 106(10) (2004)
Google Scholar
Stein, M., Stuen, C., Carnine, D., Long, R.M.: Textbook Evaluation and Adoption. Reading & Writing Quarterly 17(1) (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Microsoft Search Labs, California, USA
Rakesh Agrawal

Authors

Rakesh Agrawal
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics and Telecommunications, University of Athens, Panepistimioupolis, Ilisia, 15784, Athens, Greece
Dimitrios Gunopulos
Google Switzerland GmbH, Brandschenkestrasse 110, 8002, Zurich, Switzerland
Thomas Hofmann
Department of Computer Science, University of Bari “Aldo Moro”, via Orabona 4, 70125, Bari, Italy
Donato Malerba
Deptartment of Informatics, Athens University of Economics and Business, Patision 76, 10434, Athens, Greece
Michalis Vazirgiannis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Agrawal, R. (2011). Enriching Education through Data Mining. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2011. Lecture Notes in Computer Science(), vol 6911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23780-5_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-23780-5_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23779-9
Online ISBN: 978-3-642-23780-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics