Skip to main content

Extracting Hidden Information Based on Comparing Web with UGC

  • Conference paper
Web Information Systems Engineering – WISE 2010 Workshops (WISE 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6724))

Included in the following conference series:

  • 986 Accesses

Abstract

Nowadays, many users create information related to user-generated content (UGC) such as that for social network services (SNSs). They create communities based on their hobbies and interests. Then they readily exchange that information mutually in the UGC. They know the information of the community deeply and the information tends to become rare information. Therefore, much information that is not written in the general Web content is buried in the UGC. We designate that buried information as “hidden information.” Our proposed “hidden information” definition is “specific information for the community and important information for general users.” As described in this paper, we propose a means to extract “hidden information.”

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Turney, P.D., Littman, M.L.: Unsupervised Learning of Semantic Orientation from a Hundred-Billion-Word Corpus, National Research Council, Institute for Information Technology, Technical Report ERB-1094, NRC #44929 (2002)

    Google Scholar 

  2. Turney, P.D.: Thumbs up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews. In: Proc. ACL, pp. 417–424 (2002)

    Google Scholar 

  3. Popescu, A.-M., Etzioni, O.: Extracting Product Features and Opinions from Reviews. In: Human Language Technology Conference (HLT) / Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 339–346 (2005)

    Google Scholar 

  4. Hu, M., Liu, B.: Mining and Summarizing Customer Reviews. In: Proc. KDD, pp. 168–177 (2004)

    Google Scholar 

  5. Dave, K., Lawrence, S., Pennock, D.M.: Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews. In: Proc. 12th International World Wide Web Conference (WWW), pp. 519–528 (2003)

    Google Scholar 

  6. Inui, K., Abe, S., Morita, H., Eguchi, M., Sumida, A., Sao, C., Hara, K., Murakami, K., Matsuyoshi, S.: Experience Mining: Building a Large-Scale Database of Personal Experiences and Opinions from Web Documents. In: Proc. 2008 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 314–321 (December 2008)

    Google Scholar 

  7. Hatzivassiloglou, V., McKeown, K.R.: Predicting the Semantic Orientation of Adjectives. In: Proc. ACL, pp. 174–181 (1997)

    Google Scholar 

  8. Hatzivassiloglou, V., Wiebe, J.M.: Effective of Adjective Orientation and Gradability on Sentence Subjectivity. In: Proc. 18th International Conference on Computational Linguistic. ACL (2000)

    Google Scholar 

  9. Nakatani, M., Jatowt, A., Tanaka, K.: Easiest-First Search: Towards Comprehension-based Web Search. In: Proc. 18th ACM Conference on Information and Knowledge Management (CIKM 2009), pp. 2057–2060. ACM Press, New York (2009), short paper

    Google Scholar 

  10. Pudota, N., Casoto, P., Dattolo, A., Omero, P., Tasso, C.: Towards Bridging the Gap between Personalization and Information Extraction. In: Agosti, M., Esposito, F., Thanos, C. (eds.) Proc. Fourth Italian Research Conference on Digital Library Systems (IRCDL), Padua, Italy, January 24-25, pp. 33–40 (2008)

    Google Scholar 

  11. Nakayama, K., Pei, M., Erdmann, M., Ito, M., Shirakawa, M., Hara, T., Nishio, S.: Wikipedia Mining - Wikipedia as a Corpus for Knowledge Extraction. In: Proc. Annual Wikipedia Conference, Wikimania (2008)

    Google Scholar 

  12. Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia. In: Proc. 16th International World Wide Web Conference (WWW 2007), pp. 697–706 (May 2007)

    Google Scholar 

  13. Wu, F., Weld, D.S.: Automatically Refining the Wikipedia Infobox Ontology. In: Proc. 17th International World Wide Web Conference (WWW 2008), pp. 365–644 (April 2008)

    Google Scholar 

  14. Gabrilovich, E., Markovitch, S.: Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis. In: Proc. International Joint Conference on Artificial Intelligence 2007 (IJCAI 2007), pp. 1606–1611 (January 2007)

    Google Scholar 

  15. Bruce, R.F., Wiebe, J.M.: Recognizing subjectivity: A case study of manual tagging. Natural Language Engineering 5(2) (June 1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Uchimura, K., Nadamoto, A. (2011). Extracting Hidden Information Based on Comparing Web with UGC. In: Chiu, D.K.W., et al. Web Information Systems Engineering – WISE 2010 Workshops. WISE 2010. Lecture Notes in Computer Science, vol 6724. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24396-7_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-24396-7_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-24395-0

  • Online ISBN: 978-3-642-24396-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics