Google web graph

Creators:
Leskovec, Jure; Lang, Kevin J.; Dasgupta, Anirban; Mahoney, Michael W.
Publication Date:
2002
Data Category:
Dataset Description:

The Google Web Graph dataset offers a detailed representation of the web's hyperlink structure as captured in 2002. In this dataset, nodes correspond to individual web pages, and directed edges represent hyperlinks from one page to another. This structure is particularly valuable for studying web connectivity, page importance algorithms like PageRank, and the overall topology of the internet during that period. In total, the dataset has a size of 0,02 GB and comprises 875,713 nodes and 5,105,039 directed edges, reflecting the extensive interlinking characteristic of the early 2000s web. Structurally, the dataset is presented as a single directed graph where each node represents a web page, and each directed edge denotes a hyperlink from one page to another. This format facilitates analyses of web page connectivity, identification of influential pages, and exploration of community structures within the web.

Variables:
Details:

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.