Skip to main content
Log in

Quantitative quality control from qualitative data: control charts with latent semantic analysis

  • Published:
Quality & Quantity Aims and scope Submit manuscript

Abstract

Large quantities of data, often referred to as big data, are now held by companies. This big data includes statements of customer opinion regarding product or service quality in an unstructured textual form. While many tools exist to extract meaningful information from big data, automation tools do not exist to monitor the ongoing conceptual content of that data. We use latent semantic analysis to extract concept factors related to service quality categories. Customer comments found in the data that express dissatisfaction are then considered as representing a non-conforming observation in a process. Once factors are extracted, proportions of nonconformities for service quality failure categories are plotted on a control chart. The results are easily interpreted and the approach allows for the quantitative evaluation of customer acceptance of system process improvement initiatives.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Arora, S., Ge, R., Moitra, A.: Learning topic models: going beyond SVD. In: Proceedings of the IEEE 53rd Annual Symposium on Foundations of Computer Science, New Brunswick, pp. 1–10 (2012).

  • Ashton, T., Evangelopoulos, N., Prybutok, V.: Extending monitoring methods to textual data: a research agenda. Qual. Quant. (2013a). doi:10.1007/s11135-013-9891-8

  • Ashton, T., Evangelopoulos, N., Prybutok, V.: Exponentially weighted moving average control charts for monitoring customer service quality comments. Int. J. Serv. Stand. 8(3), 230–246 (2013). doi:10.1504/IJSS.2013.057237

    Article  Google Scholar 

  • Bollegala, D., Weir, D., Carroll, J.: Cross-domain sentiment classification using a sentiment sensitive thesaurus. IEEE Trans. Knowl. Data Eng. 25(8), 1719–1731 (2013)

    Article  Google Scholar 

  • Bradford, R.: An empirical study of required dimensionality for large-scale latent semantic indexing applications. CIKM ’08: Proceedings of the 17th ACM Conference on Information and Knowledge Management, ACM, pp. 153–162, New York (2008).

  • Cocco, M., Tuzzi, A.: New data collection modes for surveys: a comparative analysis of the influence of survey mode on question-wording effects. Qual. Quant. 47, 3135–3152 (2013)

    Article  Google Scholar 

  • Deerwester, S., Dumais, S., Furnas, G., Landauer, T., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391–407 (1990)

    Article  Google Scholar 

  • Duncan, A.: Quality Control and Industrial Statistics, 5th edn, pp. 436–451. Irwin, Homewood (1986)

    Google Scholar 

  • Evangelopoulos, N.: Latent semantic analysis. Wiley Interdisc. Rev. Cog. Sci. 4(6), 683–692 (2013). doi:10.1002/wcs.1254

    Google Scholar 

  • Evangelopoulos, N., Zhang, X., Prybutok, V.: Latent semantic analysis: five methodological recommendations. Eur. J. Inf. Syst. 21, 70–86 (2012). doi:10.1057/ejis.2010.61

    Article  Google Scholar 

  • Evans, J., Lindsay, W.: Managing for Quality and Performance Excellence, 7th edn. Thomson/South-Western, Mason (2008). pp. 462

    Google Scholar 

  • Feldman, R.: Techniques and applications for sentiment analysis. Commun. ACM 56(4), 82–89 (2013)

    Article  Google Scholar 

  • Fielding, J., Fielding, N., Hughes, G.: Opening up open-ended survey data using qualitative software. Qual. Quant. 47, 3261–3276 (2013)

    Article  Google Scholar 

  • Heider, F.: The Psychology of Interpersonal Relations. Wiley, New York (1958)

    Book  Google Scholar 

  • Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the Knowledge Discovery and Data Mining, pp. 168–177. Association for Computing Machinery, Seattle (2004).

  • Hu, X., Cai, Z., Wiemer-Hastings, P., Graesser, A.C., McNamara, D.S.: Strengths, limitations, and extensions of LSA. In: Landauer, T.K., McNamara, D.S., Dennis, S., Kintsch, W. (eds.) Handbook of Latent Semantic Analysis, pp. 401–425. Lawrence Erlbaum Associates, Mahwah (2007)

    Google Scholar 

  • Landauer, T.: LSA as a theory of meaning. In: Landauer, T.K., McNamara, D.S., Dennis, S., Kintsch, W. (eds.) Handbook of Latent Semantic Analysis, pp. 1–32. Lawrence Erlbaum Associates, Mahwah (2007)

    Google Scholar 

  • Liu, B.: Sentiment Analysis and Opinion Mining. Morgan & Claypool, San Rafael (2012)

    Google Scholar 

  • Lo, S.: Web service quality control based on text mining using support vector machine. Expert Syst. Appl. 34, 603–610 (2008)

    Article  Google Scholar 

  • Lucas, J., Saccucci, M.: Exponentially weighted moving average control schemes: properties and enhancements. Technometrics 32(1), 1–12 (1990)

    Article  Google Scholar 

  • Manning, C., Raghavan, P., Schütze, H.: Matrix decompositions and latent semantic indexing. In: Introduction to Information Retrieval, pp. 403–420. Cambridge University Press, New York (2008).

  • Montgomery, D.: Introduction to Statistical Quality Control, 3rd edn, pp. 252–266. Wiley, New York (1996)

    Google Scholar 

  • Russom, P.: Big data analytics. TDWI Best Practices Report. The Data Warehouse Institute, Reston, Va. [Online]. Available: http://tdwi.org/research/2011/09/best-practices-report-q4-big-data-analytics.aspx (2011, Fourth Quarter)

  • Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)

    Article  Google Scholar 

  • Salton, G.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)

    Article  Google Scholar 

  • Scharkow, M.: Thematic content analysis using supervised machine learning: an empirical evaluation using German online news. Qual. Quant. 47, 761–773 (2013)

    Article  Google Scholar 

  • Schott, J.: Systems of linear equations. In: Matrix Analysis for Statistics (2nd ed.), pp. 221–254. Wiley, Hoboken (2005).

  • Shehata, S., Karray, F., Kamel, M.S.: An efficient concept-based mining model for enhancing text clustering. IEEE Trans Knowl. Data Eng. 22(10), 1360–1371 (2013)

    Article  Google Scholar 

  • Sidorova, A., Evangelopoulos, N., Valacich, J., Ramakrishnan, T.: Uncovering the intellectual core of the information systems discipline. MIS Q. 32(3), 467–482 (2008)

    Google Scholar 

  • Stock, J., Lambert, D.: Strategic Logistics Management, 4th edn. McGraw-Hill/Irwin, Boston (2001). pp. 9–29

    Google Scholar 

  • Zipf, G.K.: Human Behavior and the Principle of Least Effort. Addison-Wesley Press, Cambridge (1949)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Triss Ashton.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ashton, T., Evangelopoulos, N. & Prybutok, V.R. Quantitative quality control from qualitative data: control charts with latent semantic analysis. Qual Quant 49, 1081–1099 (2015). https://doi.org/10.1007/s11135-014-0036-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11135-014-0036-5

Keywords

Navigation