FYI
-Giovanni
-------- Messaggio originale -------- Oggetto: [nan-l] Dataset of 13 billion clicks available Data: Sat, 19 Jan 2013 17:37:30 -0500 Mittente: Fil Menczer fil@indiana.edu Rispondi-a: Fil Menczer fil@indiana.edu A: NaN nan-l@indiana.edu, cns-nwb-l@indiana.edu, i-complex-l@indiana.edu
Feel free to share this announcement with any interested parties.
To foster the study of the structure and dynamics of Web traffic networks, we are making available to the research community a large Click Dataset of about 13 billion HTTP requests collected at Indiana University. During about seven months of collection in 2006-2007, our system generated data at a rate of about 60 million requests per day, or about 30 GB/day of raw data. We hope that this data will help develop a better understanding of user behavior online and create more realistic models of Web traffic. The potential applications of this data include improved designs for networks, sites, and server software; more accurate forecasting of traffic trends; classification of sites based on the patterns of activity they inspire; and improved ranking algorithms for search results.
The data was collected by Mark Meiss and is available here:
http://cnets.indiana.edu/groups/nan/webtraffic/click-dataset
-Fil
Filippo Menczer Professor of Informatics and Computer Science Director, Center for Complex Networks and Systems Research Indiana University, Bloomington http://cnets.indiana.edu/people/filippo-menczer