Abstract
Web mining refers to the whole of data miningand related techniques that are used toautomatically discover and extract informationfrom web documents and services. When used in abusiness context and applied to some type ofpersonal data, it helps companies to builddetailed customer profiles, and gain marketingintelligence. Web mining does, however, pose athreat to some important ethical values likeprivacy and individuality. Web mining makes itdifficult for an individual to autonomouslycontrol the unveiling and dissemination of dataabout his/her private life. To study thesethreats, we distinguish between `content andstructure mining' and `usage mining.' Webcontent and structure mining is a cause forconcern when data published on the web in acertain context is mined and combined withother data for use in a totally differentcontext. Web usage mining raises privacyconcerns when web users are traced, and theiractions are analysed without their knowledge.Furthermore, both types of web mining are oftenused to create customer files with a strongtendency of judging and treating people on thebasis of group characteristics instead of ontheir own individual characteristics and merits(referred to as de-individualisation). Althoughthere are a variety of solutions toprivacy-problems, none of these solutionsoffers sufficient protection. Only a combinedsolution package consisting of solutions at anindividual as well as a collective level cancontribute to release some of the tensionbetween the advantages and the disadvantages ofweb mining. The values of privacy andindividuality should be respected and protectedto make sure that people are judged and treatedfairly. People should be aware of theseriousness of the dangers and continuouslydiscuss these ethical issues. This should be ajoint responsibility shared by web miners (bothadopters and developers), web users, andgovernments.
Similar content being viewed by others
References
M.J.A. Berry and G.S. Linoff. Mining the Web: Transforming Customer Data. John Wiley & Sons, New York, 2002.
R. Clarke. 'Profiling’ and Its Privacy Implications. Privacy Law & Policy Reporter, 1: 128, 1994.
R. Clarke. Platform for Privacy Preferences: A Critique. Privacy Law & Policy Reporter, 5 (3): 46–48, 1998.
B. Custers. Data Mining and Group Profiling on the Internet. In A. Vedder, editor, Ethics and the Internet, pages 87–104. Intersentia, Antwerpen Groningen Oxford, 2001.
O. Etzioni. The World Wide Web: Quagmire or Gold Mine? Communications of the ACM, 39 (11): 65–68, 1996.
D.R. Fordham, D.A. Riordan and M. Riordan. Business Intelligence. Management Accounting, 83 (11): 24–29, 2002.
J.S. Fulda. Data Mining and Privacy. In R. Spinello and H. Tavani, editors, Readings in CyberEthics, pages 413–417. Jones and Bartlett, Sudbury MA, 2001.
D.G. Johnson. Computer Ethics, 3rd. edition. Prentice-Hall, Upper Saddle River New Jersey, 2001.
J.F. Johnson. Immunity from the Illegitimate Focused Attention of Others: An Explanation of our Thinking and Talking about Privacy. In A. Vedder, editor, Ethics and the Internet, pages 49–70. Intersentia, Antwerpen Groningen Oxford, 2001.
R. Kosala, H. Blockeel and F. Neven. An Overview of Web Mining. In J. Meij, editor, Dealing with the Data Flood: Mining Data, Text and Multimedia, pages 480–497. STT, Rotterdam, 2002.
S.K. Madria, S.S. Bhowmick, W.-K. Ng and E.P. Lim. Research Issues in Web-data Mining. Lecture Notes in Computer Science, 1676: 303–312, 1999.
B. Mobasher, R. Cooley and J. Srivastava. Automatic Personalization Based on Web Usage Mining. Communications of the ACM, 43 (8): 142–151, 2000.
M.D. Mulvenna, S.S. Anand and A.G. Büchner. Personalization on the Net Using Web Mining. Communications of the ACM, 43 (8): 123–125, 2000.
H. Nissenbaum. Toward an Approach to Privacy in Public: Challenges of Information Technology. Ethics & Behavior, 7 (3): 207–220, 1997.
E. Philips, D. Vriens and G. Kienhorst, editors. Business Intelligence. Kluwer, Deventer, 1999.
R. Pierik. Group Profiles, Equality, and the Power of Numbers. In A. Vedder, editor, Ethics and the Internet, pages 105–123. Intersentia, Antwerpen Groningen Oxford, 2001.
K. Scribbins. Privacy@net, An International Comparative Study of Consumer Privacy on the Internet. Consumers International (http://www.consumersinternational.org/news/press releases/fprivreport.pdf), 2001.
J. Srivastava, R. Cooley, M. Deshpande and P.N. Tan. Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data. ACM SIGKDD, 1(2): 12–23, 2000.
H.T. Tavani. Informational Privacy, Data Mining, and the Internet.’ Ethics and Information Technology, 1: 137–145, 1999a.
H.T. Tavani. KDD, Data Mining, and the Challenge to Normative Privacy. Ethics and Information Technology, 1: 265–273, 1999a.
H.T. Tavani. Privacy Enhancing Technologies as a Panacea for Online Privacy Concerns: Some Ethical Considerations. Ethics and Information Technology, 9: 26–36, 2000.
H.T. Tavani and J. Moor. Privacy Protection, Control of Information, and Privacy-Enhancing Technologies. Computers and Society, 31: 6–11, 2001.
A. Vedder. KDD: The Challenge to Individualism. Ethics and Information Technology, 1: 275–281, 1999.
A. Vedder. Privacy and Confidentiality. Medical Data, New Information Technologies, and the Need for Normative Principles Other than Privacy Rules. Law and Medicine, 3: 441–459, 2000.
A. Vedder. KDD, Privacy, Individuality, and Fairness. In R. Spinello and H. Tavani, editors, Readings in CyberEthics, pages 404–412. Jones and Bartlett, Sudbury MA, 2001.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
van Wel, L., Royakkers, L. Ethical issues in web data mining. Ethics and Information Technology 6, 129–140 (2004). https://doi.org/10.1023/B:ETIN.0000047476.05912.3d
Issue Date:
DOI: https://doi.org/10.1023/B:ETIN.0000047476.05912.3d