If any of of you are at the wikimania currently, Netha Hussain would be
happy to run you guys through the graphs in person and take any questions
on them. She is free from 09:00 - 11:00 CDT (mexican local time) on Sunday.
On Wed, Jul 15, 2015 at 1:19 PM, jeph <jephpaul(a)gmail.com> wrote:
- I've added the definitions.
using the historical definition -
for an active editor.
- The longevity graph shows some interesting results when we compare
'en' with other languages like 'es', 'zh' etc. I'll upload
them and send
the link as soon as I can.
- Graphs 4&5 do show results that are different from 
- Graph 4 - Monthly Editor Activity Split By Cohort - Stacked Bars
Selecting 1-2 in the selector show the contribution of the cohort that
joined in that month in all the months.
- Graph 5 - Monthly Editor Activity % Split By Cohort - Stacked Bars
Selecting the same (1-2 in the selector) show the contribution of the
cohort that joined in that month as a % of the total activity in a given
month for all months.
- In the month of Jan 07 , The cohort Jan 07 contributed 'x'%.
- The activity in month Jan 07 = cohort Jan 07 + cohort Dec 06
..... Cohort Jan 01.
- The editor activity peaks in Jan 07 - March 07 as shown in Graph 4
many other graphs  etc.
- Graph 5 show that for the same period the contribution in % for
the cohorts joining in the months Jan 07 - March 07, aka the new comers
each month has remained the same and it is < 40 %. So the older editors
contributed 60+ % in those months. Which tell us that the contribution to
the fall in active editors lies both with the new editors in a month and
also the older editors. In fact the older editors contributed more to the
- I have not looked specifically at (No of edits in first session
- It was  that got me working on the graphs :-)
- Please send me a screenshot & I'll try to fix it for you.
- If you know the dates when they were introduced we could find out
what effect it had. Could you please add them to
or on the talk page.
There are five different graphs at
explanation for each of them can be found at the bottom of each graph. I've
generated the graphs for other wikis too 'es', 'de', 'ru' etc.
them up as soon as I can.
On Wed, Jul 15, 2015 at 4:27 AM, Aaron Halfaker <aaron.halfaker(a)gmail.com>
There are a lot of undefined metrics in your
methods. For example, what
do you mean by "canonical definition of edit sessions". Is it ? Also,
is there something that we learn from this longevity analysis that we
didn't learn from previous research? E.g.  and . One point that I
think would look into is the engagement measure used in  (# of edits in
first session after registration). In my work on , it looked like this
stat remained consistent since 2004 and therefor didn't seem to explain the
drop in newcomer retention.
On Tue, Jul 14, 2015 at 2:01 PM, jeph <jephpaul(a)gmail.com> wrote:
I been working on graphs to visualize the entire edit activity of in
wiki for some time now. I'm documenting all of it at
The graphs can be viewed at
only graphs for 'en' have been put up, I'll add the graphs for the wikis
- The editors are split into groups based on the month in which they
made their first edit.
- The active edit sessions (value or percentage etc) for the groups
are then plotted as stacked bars or as a matrix. I've used the canonical
definition of an active edit session. The value are + or - .1% of the
values on https://stats.wikimedia.org/
- There is a selector on each graph that lets you filter the data in
the graph. On moving the cursor to the left end of the selector you will
get a resize cursor. The selection can then are moved or redrawn.
- In graphs 1,2 the selector filters by percentage.
- In graphs 3,4,5 the selector filters by the age of the cohort.
- Longevity of editors fell drastically starting Jan 06 and has
since stabilized at levels from Jan 07.
Would you to hear what you guys think of the graphs & any ideas you
would have for me.
Wiki-research-l mailing list
Wiki-research-l mailing list