Abstract
Large quantities of data, often referred to as big data, are now held by companies. This big data includes statements of customer opinion regarding product or service quality in an unstructured textual form. While many tools exist to extract meaningful information from big data, automation tools do not exist to monitor the ongoing conceptual content of that data. We use latent semantic analysis to extract concept factors related to service quality categories. Customer comments found in the data that express dissatisfaction are then considered as representing a non-conforming observation in a process. Once factors are extracted, proportions of nonconformities for service quality failure categories are plotted on a control chart. The results are easily interpreted and the approach allows for the quantitative evaluation of customer acceptance of system process improvement initiatives.
Similar content being viewed by others
References
Arora, S., Ge, R., Moitra, A.: Learning topic models: going beyond SVD. In: Proceedings of the IEEE 53rd Annual Symposium on Foundations of Computer Science, New Brunswick, pp. 1–10 (2012).
Ashton, T., Evangelopoulos, N., Prybutok, V.: Extending monitoring methods to textual data: a research agenda. Qual. Quant. (2013a). doi:10.1007/s11135-013-9891-8
Ashton, T., Evangelopoulos, N., Prybutok, V.: Exponentially weighted moving average control charts for monitoring customer service quality comments. Int. J. Serv. Stand. 8(3), 230–246 (2013). doi:10.1504/IJSS.2013.057237
Bollegala, D., Weir, D., Carroll, J.: Cross-domain sentiment classification using a sentiment sensitive thesaurus. IEEE Trans. Knowl. Data Eng. 25(8), 1719–1731 (2013)
Bradford, R.: An empirical study of required dimensionality for large-scale latent semantic indexing applications. CIKM ’08: Proceedings of the 17th ACM Conference on Information and Knowledge Management, ACM, pp. 153–162, New York (2008).
Cocco, M., Tuzzi, A.: New data collection modes for surveys: a comparative analysis of the influence of survey mode on question-wording effects. Qual. Quant. 47, 3135–3152 (2013)
Deerwester, S., Dumais, S., Furnas, G., Landauer, T., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391–407 (1990)
Duncan, A.: Quality Control and Industrial Statistics, 5th edn, pp. 436–451. Irwin, Homewood (1986)
Evangelopoulos, N.: Latent semantic analysis. Wiley Interdisc. Rev. Cog. Sci. 4(6), 683–692 (2013). doi:10.1002/wcs.1254
Evangelopoulos, N., Zhang, X., Prybutok, V.: Latent semantic analysis: five methodological recommendations. Eur. J. Inf. Syst. 21, 70–86 (2012). doi:10.1057/ejis.2010.61
Evans, J., Lindsay, W.: Managing for Quality and Performance Excellence, 7th edn. Thomson/South-Western, Mason (2008). pp. 462
Feldman, R.: Techniques and applications for sentiment analysis. Commun. ACM 56(4), 82–89 (2013)
Fielding, J., Fielding, N., Hughes, G.: Opening up open-ended survey data using qualitative software. Qual. Quant. 47, 3261–3276 (2013)
Heider, F.: The Psychology of Interpersonal Relations. Wiley, New York (1958)
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the Knowledge Discovery and Data Mining, pp. 168–177. Association for Computing Machinery, Seattle (2004).
Hu, X., Cai, Z., Wiemer-Hastings, P., Graesser, A.C., McNamara, D.S.: Strengths, limitations, and extensions of LSA. In: Landauer, T.K., McNamara, D.S., Dennis, S., Kintsch, W. (eds.) Handbook of Latent Semantic Analysis, pp. 401–425. Lawrence Erlbaum Associates, Mahwah (2007)
Landauer, T.: LSA as a theory of meaning. In: Landauer, T.K., McNamara, D.S., Dennis, S., Kintsch, W. (eds.) Handbook of Latent Semantic Analysis, pp. 1–32. Lawrence Erlbaum Associates, Mahwah (2007)
Liu, B.: Sentiment Analysis and Opinion Mining. Morgan & Claypool, San Rafael (2012)
Lo, S.: Web service quality control based on text mining using support vector machine. Expert Syst. Appl. 34, 603–610 (2008)
Lucas, J., Saccucci, M.: Exponentially weighted moving average control schemes: properties and enhancements. Technometrics 32(1), 1–12 (1990)
Manning, C., Raghavan, P., Schütze, H.: Matrix decompositions and latent semantic indexing. In: Introduction to Information Retrieval, pp. 403–420. Cambridge University Press, New York (2008).
Montgomery, D.: Introduction to Statistical Quality Control, 3rd edn, pp. 252–266. Wiley, New York (1996)
Russom, P.: Big data analytics. TDWI Best Practices Report. The Data Warehouse Institute, Reston, Va. [Online]. Available: http://tdwi.org/research/2011/09/best-practices-report-q4-big-data-analytics.aspx (2011, Fourth Quarter)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)
Salton, G.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)
Scharkow, M.: Thematic content analysis using supervised machine learning: an empirical evaluation using German online news. Qual. Quant. 47, 761–773 (2013)
Schott, J.: Systems of linear equations. In: Matrix Analysis for Statistics (2nd ed.), pp. 221–254. Wiley, Hoboken (2005).
Shehata, S., Karray, F., Kamel, M.S.: An efficient concept-based mining model for enhancing text clustering. IEEE Trans Knowl. Data Eng. 22(10), 1360–1371 (2013)
Sidorova, A., Evangelopoulos, N., Valacich, J., Ramakrishnan, T.: Uncovering the intellectual core of the information systems discipline. MIS Q. 32(3), 467–482 (2008)
Stock, J., Lambert, D.: Strategic Logistics Management, 4th edn. McGraw-Hill/Irwin, Boston (2001). pp. 9–29
Zipf, G.K.: Human Behavior and the Principle of Least Effort. Addison-Wesley Press, Cambridge (1949)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ashton, T., Evangelopoulos, N. & Prybutok, V.R. Quantitative quality control from qualitative data: control charts with latent semantic analysis. Qual Quant 49, 1081–1099 (2015). https://doi.org/10.1007/s11135-014-0036-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11135-014-0036-5