skip to main content
10.1145/1935826.1935914acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
poster

Supervised random walks: predicting and recommending links in social networks

Published:09 February 2011Publication History

ABSTRACT

Predicting the occurrence of links is a fundamental problem in networks. In the link prediction problem we are given a snapshot of a network and would like to infer which interactions among existing members are likely to occur in the near future or which existing interactions are we missing. Although this problem has been extensively studied, the challenge of how to effectively combine the information from the network structure with rich node and edge attribute data remains largely open.

We develop an algorithm based on Supervised Random Walks that naturally combines the information from the network structure with node and edge level attributes. We achieve this by using these attributes to guide a random walk on the graph. We formulate a supervised learning task where the goal is to learn a function that assigns strengths to edges in the network such that a random walker is more likely to visit the nodes to which new links will be created in the future. We develop an efficient training algorithm to directly learn the edge strength estimation function.

Our experiments on the Facebook social graph and large collaboration networks show that our approach outperforms state-of-the-art unsupervised approaches as well as approaches that are based on feature extraction.

References

  1. L. Adamic and E. Adar. Friends and neighbors on the web. Social Networks, 25(3):211--230, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  2. A. Agarwal and S. Chakrabarti. Learning random walks to rank nodes in graphs. In ICML '07, pages 9--16, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Agarwal, S. Chakrabarti, and S. Aggarwal. Learning to rank networked entities. In KDD '06, pages 14--23, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Andrew. Iterative computation of derivatives of eigenvalues and eigenvectors. IMA Journal of Applied Mathematics, 24(2):209--218, 1979.Google ScholarGoogle ScholarCross RefCross Ref
  5. A. L. Andrew. Convergence of an iterative method for derivatives of eigensystems. Journal of Computational Physics, 26:107--112, 1978.Google ScholarGoogle ScholarCross RefCross Ref
  6. L. Backstrom, D. P. Huttenlocher, J. M. Kleinberg, and X. Lan. Group formation in large social networks: membership, growth, and evolution. In KDD '06, pages 44--54, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A.-L. Barabási and R. Albert. Emergence of scaling in random networks. Science, 286:509--512, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  8. A. Blum, H. Chan, and M. Rwebangira. A random-surfer web-graph model. In ANALCO '06, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  9. A. Clauset, C. Moore, and M. E. J. Newman. Hierarchical structure and the prediction of missing links in networks. Nature, 453(7191):98--101, May 2008.Google ScholarGoogle ScholarCross RefCross Ref
  10. J. Coleman. Social Capital in the Creation of Human Capital. The American Journal of Sociology, 94:S95--S120, 1988.Google ScholarGoogle ScholarCross RefCross Ref
  11. M. Diligenti, M. Gori, and M. Maggini. Learning web page scores by error back--propagation. In IJCAI '05, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Gehrke, P. Ginsparg, and J. M. Kleinberg. Overview of the 2003 kdd cup. SIGKDD Explorations, 5(2):149--151, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. Gomez-Rodriguez, J. Leskovec, and A. Krause. Inferring networks of diffusion and influence. In KDD '10, 2010.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. S. Granovetter. The strength of weak ties. American Journal of Sociology, 78:1360-1380, 1973.Google ScholarGoogle ScholarCross RefCross Ref
  15. T. H. Haveliwala. Topic-sensitive pagerank. In WWW '02, pages 517--526, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. K. Henderson and T. Eliassi-Rad. Applying latent dirichlet allocation to group discovery in large graphs. In SAC '09, pages 1456--1461. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. G. Jeh and J. Widom. Scaling personalized web search. In WWW '03, pages 271--279, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. R. Kumar, P. Raghavan, S. Rajagopalan, D. Sivakumar, A. Tomkins, and E. Upfal. Stochastic models for the web graph. In FOCS '00, page 57, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Leskovec, L. Backstrom, R. Kumar, and A. Tomkins. Microscopic evolution of social networks. In KDD '08, pages 462--470, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Leskovec, J. M. Kleinberg, and C. Faloutsos. Graphs over time: densification laws, shrinking diameters and possible explanations. In KDD '05, pages 177--187, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D. Liben-Nowell and J. Kleinberg. The link prediction problem for social networks. In CIKM '03, pages 556--559, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. D. Liu and J. Nocedal. On the limited memory bfgs method for large scale optimization. Mathematical Programming, 45:503--528, 1989. 10.1007/BF01589116. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. R. Minkov and W. W. Cohen. Learning to rank typed graph walks: Local and global approaches. In WebKDD/SNA-KDD '07, pages 1--8, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. S. Myers and J. Leskovec. On the convexity of latent social network inference. In NIPS '10, 2010.Google ScholarGoogle Scholar
  25. L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Dig. Lib. Tech. Proj., 1998.Google ScholarGoogle Scholar
  26. A. Popescul, R. Popescul, and L. H. Ungar. Statistical relational learning for link prediction, 2003.Google ScholarGoogle Scholar
  27. P. Sarkar and A. W. Moore. Fast dynamic reranking in large graphs. In WWW '09, pages 31--40, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. B. Taskar, M. F. Wong, P. Abbeel, and D. Koller. Link prediction in relational data. In NIPS '03, 2003.Google ScholarGoogle Scholar
  29. H. Tong and C. Faloutsos. Center-piece subgraphs: problem definition and fast solutions. In KDD '06, pages 404--413, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. H. Tong, C. Faloutsos, and Y. Koren. Fast direction-aware proximity for graph mining. In KDD '07, pages 747--756, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. T. Tong, C. Faloutsos, and J.-Y. Pan. Fast randomwalk with restart and its applications. In ICDM '06, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. L. Yan, R. Dodier, M. Mozer, and R. Wolniewicz. Optimizing classifier performance via an approximation to the wilcoxon-mann-whitney statistic. In ICML '03, pages 848--855, 2003.Google ScholarGoogle Scholar

Index Terms

  1. Supervised random walks: predicting and recommending links in social networks

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      WSDM '11: Proceedings of the fourth ACM international conference on Web search and data mining
      February 2011
      870 pages
      ISBN:9781450304931
      DOI:10.1145/1935826

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 9 February 2011

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      WSDM '11 Paper Acceptance Rate83of372submissions,22%Overall Acceptance Rate498of2,863submissions,17%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader