ABSTRACT
In this paper, we report on a large-scale study of structural differences among the national webs. The study is based on a web-scale crawl conducted in the summer 2008. More specifically, we study two graphs derived from this crawl, the nation graph, with nodes corresponding to nations and edges - to links among nations, and the host graph, with nodes corresponding to hosts and edges - to hyperlinks among pages on the hosts. Contrary to some of the previous work [2], our results show that webs of different nations are often very different from each other, both in terms of their internal structure, and in terms of their connectivity with other nations.
- Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J. Graph Structure in the Web. Computer Networks, 33(1--6), pp.309--320, 2000. Google ScholarDigital Library
- Donato, D., Leonardi, S., Millozzi, S., Tsaparas, P. Mining the Inner Structure of the Web Graph. 8th International Workshop on the Web and Databases (WebDB), June 16--17 2005, Baltimore, MD, USA.Google Scholar
- Zhu, J.H., Meng, T., Xie, Z., Li, G., Li, X. A Teapot Graph and its Hierarchical Structure of the Chinese Web. 17th International World Wide Web Conference, April 21--25 2008, Beijing, China. Google ScholarDigital Library
Comments