skip to main content
10.1145/1081870.1081893acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Graphs over time: densification laws, shrinking diameters and possible explanations

Published:21 August 2005Publication History

ABSTRACT

How do real graphs evolve over time? What are "normal" growth patterns in social, technological, and information networks? Many studies have discovered patterns in static graphs, identifying properties in a single snapshot of a large network, or in a very small number of snapshots; these include heavy tails for in- and out-degree distributions, communities, small-world phenomena, and others. However, given the lack of information about network evolution over long periods, it has been hard to convert these findings into statements about trends over time.Here we study a wide range of real graphs, and we observe some surprising phenomena. First, most of these graphs densify over time, with the number of edges growing super-linearly in the number of nodes. Second, the average distance between nodes often shrinks over time, in contrast to the conventional wisdom that such distance parameters should increase slowly as a function of the number of nodes (like O(log n) or O(log(log n)).Existing graph generation models do not exhibit these types of behavior, even at a qualitative level. We provide a new graph generator, based on a "forest fire" spreading process, that has a simple, intuitive justification, requires very few parameters (like the "flammability" of nodes), and produces graphs exhibiting the full range of properties observed both in prior work and in the present study.

References

  1. J. Abello, A. L. Buchsbaum, and J. Westbrook. A functional approach to external graph algorithms. In Proceedings of the 6th Annual European Symposium on Algorithms, pages 332--343. Springer-Verlag, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Abello, P. M. Pardalos, and M. G. C. Resende. Handbook of massive data sets. Kluwer, 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Albert and A.-L. Barabasi. Emergence of scaling in random networks. Science, pages 509--512, 1999.]]Google ScholarGoogle Scholar
  4. R. Albert, H. Jeong, and A.-L. Barabasi. Diameter of the world-wide web. Nature, 401:130--131, September 1999.]]Google ScholarGoogle ScholarCross RefCross Ref
  5. Z. Bi, C. Faloutsos, and F. Korn. The dgx distribution for mining massive, skewed data. In KDD, pages 17--26, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. B. Bollobas and O. Riordan. The diameter of a scale-free random graph. Combinatorica, 24(1):5--34, 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins, and J. Wiener. Graph structure in the web: experiments and models. In Proceedings of World Wide Web Conference, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. Chakrabarti, Y. Zhan, and C. Faloutsos. R-mat: A recursive model for graph mining. In SDM, 2004.]]Google ScholarGoogle ScholarCross RefCross Ref
  9. F. Chung and L. Lu. The average distances in random graphs with given expected degrees. Proceedings of the National Academy of Sciences, 99(25):15879--15882, 2002.]]Google ScholarGoogle ScholarCross RefCross Ref
  10. C. Cooper and A. Frieze. A general model of web graphs. Random Struct. Algorithms, 22(3):311--335, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Faloutsos, P. Faloutsos, and C. Faloutsos. On power-law relationships of the internet topology. In SIGCOMM, pages 251--262, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Gehrke, P. Ginsparg, and J. M. Kleinberg. Overview of the 2003 kdd cup. SIGKDD Explorations, 5(2):149--151, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. B. H. Hall, A. B. Jaffe, and M. Trajtenberg. The nber patent citation data file: Lessons, insights and methodological tools. NBER Working Papers 8498, National Bureau of Economic Research, Inc, Oct. 2001.]]Google ScholarGoogle Scholar
  14. B. A. Huberman and L. A. Adamic. Growth dynamics of the world-wide web. Nature, 399:131, 1999.]]Google ScholarGoogle ScholarCross RefCross Ref
  15. J. S. Katz. The self-similar science system. Research Policy, 28:501--517, 1999.]]Google ScholarGoogle ScholarCross RefCross Ref
  16. J. S. Katz. Scale independent bibliometric indicators. Measurement: Interdisciplinary Research and Perspectives, 3:24--28, 2005.]]Google ScholarGoogle ScholarCross RefCross Ref
  17. J. M. Kleinberg. Small-world phenomena and the dynamics of information. In Advances in Neural Information Processing Systems 14, 2002.]]Google ScholarGoogle Scholar
  18. J. M. Kleinberg, R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. The web as a graph: Measurements, models, and methods. In Proc. International Conference on Combinatorics and Computing, pages 1--17, 1999.]]Google ScholarGoogle ScholarCross RefCross Ref
  19. R. Kumar, P. Raghavan, S. Rajagopalan, D. Sivakumar, A. Tomkins, and E. Upfal. Stochastic models for the web graph. In Proc. 41st IEEE Symp. on Foundations of Computer Science, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. Trawling the web for emerging cyber-communities. In Proceedings of 8th International World Wide Web Conference, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. F. Menczer. Growing and navigating the small world web by local content. Proceedings of the National Academy of Sciences, 99(22):14014--14019, 2002.]]Google ScholarGoogle ScholarCross RefCross Ref
  22. S. Milgram. The small-world problem. Psychology Today, 2:60--67, 1967.]]Google ScholarGoogle Scholar
  23. M. Mitzenmacher. A brief history of generative models for power law and lognormal distributions, 2004.]]Google ScholarGoogle Scholar
  24. M. E. J. Newman. The structure and function of complex networks. SIAM Review, 45:167--256, 2003.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. A. Ntoulas, J. Cho, and C. Olston. What's new on the web? the evolution of the web from a search engine perspective. In WWW Conference, pages 1--12, New York, New York, May 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. U. of Oregon Route Views Project. Online data and reports. http://www.routeviews.org.]]Google ScholarGoogle Scholar
  27. C. R. Palmer, P. B. Gibbons, and C. Faloutsos. Anf: A fast and scalable tool for data mining in massive graphs. In SIGKDD, Edmonton, AB, Canada, 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. S. Redner. Citation statistics from more than a century of physical review. Technical Report physics/0407137, arXiv, 2004.]]Google ScholarGoogle Scholar
  29. M. Schroeder. Fractals, Chaos, Power Laws: Minutes from an Infinite Paradise. W.H. Freeman and Company, New York, 1991.]]Google ScholarGoogle Scholar
  30. D. J. Watts, P. S. Dodds, and M. E. J. Newman. Collective dynamics of 'small-world' networks. Nature, 393:440--442, 1998.]]Google ScholarGoogle ScholarCross RefCross Ref
  31. D. J. Watts, P. S. Dodds, and M. E. J. Newman. Identity and search in social networks. Science, 296:1302--1305, 2002.]]Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Graphs over time: densification laws, shrinking diameters and possible explanations

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '05: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
      August 2005
      844 pages
      ISBN:159593135X
      DOI:10.1145/1081870

      Copyright © 2005 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 21 August 2005

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader