2016 files are uploaded to Internet Archive. Identifier "enwiki-pageviews2007-2016"

On Mon, Dec 12, 2016 at 1:00 PM, <wiki-research-l-request@lists.wikimedia.org> wrote:
Send Wiki-research-l mailing list submissions to
        wiki-research-l@lists.wikimedia.org

To subscribe or unsubscribe via the World Wide Web, visit
        https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
or, via email, send a message with subject or body 'help' to
        wiki-research-l-request@lists.wikimedia.org

You can reach the person managing the list at
        wiki-research-l-owner@lists.wikimedia.org

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Wiki-research-l digest..."


Today's Topics:

   1. Upcoming research newsletter (November 2016): new papers open
      for review (masssly@ymail.com)
   2. another pageview db to download (Alex Druk)
   3. Re: another pageview db to download (Federico Leva (Nemo))


----------------------------------------------------------------------

Message: 1
Date: Sun, 11 Dec 2016 22:57:34 +0000
From: <masssly@ymail.com>
To: Wikimedia Research Mailing List
        <wiki-research-l@lists.wikimedia.org>
Subject: [Wiki-research-l] Upcoming research newsletter (November
        2016): new papers open for review
Message-ID: <726158.24370.bm@smtp108.mail.ir2.yahoo.com>
Content-Type: text/plain; charset="utf-8"

Hi everybody,

We’re preparing for the November 2016 research newsletter and looking for contributors. Please take a look at: https://etherpad.wikimedia.org/p/WRN201611 and add your name next to any paper you are interested in covering. Reviews should be in before December 14. As usual, short notes and one-paragraph reviews are most welcome.

Highlights from this month:
 
• Black Lives Matter in Wikipedia: Collaboration and Collective Memory around Online Social Movements
• DePP: A System for Detecting Pages to Protect in Wikipedia
• Digital Heritage. Progress in Cultural Heritage: Documentation, Preservation, and Protection
• Docforia: A Multilayer Document Model
• Does astronomy research become too dated for the public? Wikipedia citations to astronomy and astrophysics journal articles 1996-2014
• Election Prediction Based on Wikipedia Pageviews
• Establishing and Evaluating Digital Ethos and Online Credibility
• Finding and Expanding Hypernymic Relations in the Music Domain
• Game with a Purpose for mappings verification
• Hierarchical Question Answering for Long Documents
• How Many People Constitute a Crowd and What Do They Do? Quantitative Analyses of Revisions in the English and German Wiktionary Editions
• Measuring Quality of Collaboratively Edited Documents: the case of Wikipedia
• On Emerging Entity Detection
• Predicting Importance of Historical Persons Using Wikipedia
• Relationship between personality and attitudes to Wikipedia
• Social patterns and dynamics of creativity in Wikipedia
• Travel Attractions Recommendation with Knowledge Graphs
• What Makes a Link Successful on Wikipedia?

If you have any question about the format or process feel free to get in touch off-list.

Masssly, Tilman Bayer and Dario Taraborelli

[1] http://meta.wikimedia.org/wiki/Research:Newsletter
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.wikimedia.org/pipermail/wiki-research-l/attachments/20161211/ccdfb7d1/attachment-0001.html>

------------------------------

Message: 2
Date: Mon, 12 Dec 2016 08:32:20 +0100
From: Alex Druk <alex.druk@gmail.com>
To: wiki-research-l@lists.wikimedia.org
Subject: [Wiki-research-l] another pageview db to download
Message-ID:
        <CAP=qzqZ6gv8h0asRrK5+XG396tu2prOHr4Rwsed6yuUoRpzQSw@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Dear Wikipedia and MediaWiki people,

For a few years I have maintained a web site wikipediatrends.com. For
variety of reasons I cannot do it any more and the site will be closed in
January.
However, our DB of English wikipedia pageviews from 2007 can be used for
other projects. Any person who wish to get it please see  info below.
--

A few words about DB. We keep data in separate files for each page. Each
file is csv with lines started with year and followed by pageviews for each
day. Page name is md5 encoded  and used as name of the file. Page names are
in separate Berkley DB file. The total size of DB is about 30GB. It is in 3
archived files ~ 10 GB.
You can download DB as 12/03/2016 from:
https://s3-us-west-2.amazonaws.com/adrouk/november2016/rdd112016_1.tar.gz
https://s3-us-west-2.amazonaws.com/adrouk/november2016/rdd112016_2.tar.gz
https://s3-us-west-2.amazonaws.com/adrouk/november2016/articles112016.db
As June 2015:
https://s3-us-west-2.amazonaws.com/adrouk/june2015/rdd62015_1.tar.gz
https://s3-us-west-2.amazonaws.com/adrouk/june2015/rdd62015_2.tar.gz
https://s3-us-west-2.amazonaws.com/adrouk/june2015/articles62015.db
Please do not hesitate to ask any question about DB. If by any chance you
are interested in the site also, please contact me of the list.
Enjoy!

---
Thank you.

Alex Druk, PhD
wikipediatrends.com
alex.druk@gmail.com
(775) 237-8550 Google voice
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.wikimedia.org/pipermail/wiki-research-l/attachments/20161212/f37aa88f/attachment-0001.html>

------------------------------

Message: 3
Date: Mon, 12 Dec 2016 08:53:05 +0100
From: "Federico Leva (Nemo)" <nemowiki@gmail.com>
To: Research into Wikimedia content and communities
        <wiki-research-l@lists.wikimedia.org>
Cc: wikiteam-discuss@googlegroups.com
Subject: Re: [Wiki-research-l] another pageview db to download
Message-ID: <03f8ddd5-df11-219b-6e3c-19b92de8b1ff@gmail.com>
Content-Type: text/plain; charset=utf-8; format=flowed

Alex Druk, 12/12/2016 08:32:
> For a few years I have maintained a web site wikipediatrends.com
> <http://wikipediatrends.com>. For variety of reasons I cannot do it any
> more and the site will be closed in January.
> However, our DB of English wikipedia pageviews from 2007 can be used for
> other projects. Any person who wish to get it please see  info below.

Thanks. Can you please upload those files to the Internet Archive? You
can use the
https://internetarchive.readthedocs.io/en/latest/cli.html#upload CLI
with mediatype "data", collection "opensource" and subject "Wikipedia;
enwiki".

Nemo

> A few words about DB. We keep data in separate files for each page. Each
> file is csv with lines started with year and followed by pageviews for
> each day. Page name is md5 encoded  and used as name of the file. Page
> names are in separate Berkley DB file. The total size of DB is about
> 30GB. It is in 3 archived files ~ 10 GB.
> You can download DB as 12/03/2016 from:
> https://s3-us-west-2.amazonaws.com/adrouk/november2016/rdd112016_1.tar.gz
> https://s3-us-west-2.amazonaws.com/adrouk/november2016/rdd112016_2.tar.gz
> https://s3-us-west-2.amazonaws.com/adrouk/november2016/articles112016.db
> As June 2015:
> https://s3-us-west-2.amazonaws.com/adrouk/june2015/rdd62015_1.tar.gz
> <https://s3-us-west-2.amazonaws.com/adrouk/june2015/rdd62015_1.tar.gz>
> https://s3-us-west-2.amazonaws.com/adrouk/june2015/rdd62015_2.tar.gz
> <https://s3-us-west-2.amazonaws.com/adrouk/june2015/rdd62015_2.tar.gz>
> https://s3-us-west-2.amazonaws.com/adrouk/june2015/articles62015.db
> <https://s3-us-west-2.amazonaws.com/adrouk/june2015/articles62015.db>
> Please do not hesitate to ask any question about DB. If by any chance
> you are interested in the site also, please contact me of the list.
> Enjoy!
>
> ---
> Thank you.
>
> Alex Druk, PhD
> wikipediatrends.com
> <http://wikipediatrends.com/>alex.druk@gmail.com
> <mailto:alex.druk@gmail.com>
> (775) 237-8550 <tel:(775)%20237-8550> Google voice
>
>
>
> _______________________________________________
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>



------------------------------

Subject: Digest Footer

_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


------------------------------

End of Wiki-research-l Digest, Vol 136, Issue 9
***********************************************



--
Thank you.

Alex Druk
alex.druk@gmail.com
(775) 237-8550 Google voice