+analytics
On Tue, May 19, 2015 at 3:23 PM, Brian Gerstle bgerstle@wikimedia.org wrote:
+search
On Tue, May 19, 2015 at 3:14 PM, Brian Gerstle bgerstle@wikimedia.org wrote:
The subject hints at a question that's been nagging me for a while, and now that I'm going to be hacking on testing in Lyon I wanted to ask:
Do we have a list of articles we usually run tests against?
If not, do we have any processes for curating such a list? Would anyone be interested in a brainstorming session at Lyon to discuss this further?
Basically, as a developer, I would love to have more confidence that some code I wrote doesn't break on our most popular articles. Or, if we can get more sophisticated, that *certain properties of my code hold true for certain kinds of generated pages*.*
Please respond with your thoughts and whether you think I should create a phab task for the hackathon about this. In either case, ping me anytime or grab me at Lyon to discuss further!
Regards,
Brian
- Yes, I'm talking about using property-based testing generators to
create random, shrinkable MW pages that we can run tests on. Not sure if it's practical, but could be an interesting experiment.
-- EN Wikipedia user page: https://en.wikipedia.org/wiki/User:Brian.gerstle IRC: bgerstle
-- EN Wikipedia user page: https://en.wikipedia.org/wiki/User:Brian.gerstle IRC: bgerstle
The pageview API that we're focusing on this quarter and next will have top articles, and they'll be per project at least and per other dimensions most likely. Right now, there's nothing but ad-hoc queries I can help you run. stats.grok.se used to maintain such a "top articles" list per wiki but that hasn't been updated since March 2014.
On Tue, May 19, 2015 at 4:25 PM, Brian Gerstle bgerstle@wikimedia.org wrote:
+analytics
On Tue, May 19, 2015 at 3:23 PM, Brian Gerstle bgerstle@wikimedia.org wrote:
+search
On Tue, May 19, 2015 at 3:14 PM, Brian Gerstle bgerstle@wikimedia.org wrote:
The subject hints at a question that's been nagging me for a while, and now that I'm going to be hacking on testing in Lyon I wanted to ask:
Do we have a list of articles we usually run tests against?
If not, do we have any processes for curating such a list? Would anyone be interested in a brainstorming session at Lyon to discuss this further?
Basically, as a developer, I would love to have more confidence that some code I wrote doesn't break on our most popular articles. Or, if we can get more sophisticated, that *certain properties of my code hold true for certain kinds of generated pages*.*
Please respond with your thoughts and whether you think I should create a phab task for the hackathon about this. In either case, ping me anytime or grab me at Lyon to discuss further!
Regards,
Brian
- Yes, I'm talking about using property-based testing generators to
create random, shrinkable MW pages that we can run tests on. Not sure if it's practical, but could be an interesting experiment.
-- EN Wikipedia user page: https://en.wikipedia.org/wiki/User:Brian.gerstle IRC: bgerstle
-- EN Wikipedia user page: https://en.wikipedia.org/wiki/User:Brian.gerstle IRC: bgerstle
-- EN Wikipedia user page: https://en.wikipedia.org/wiki/User:Brian.gerstle IRC: bgerstle
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
I'm sure the Parsoid team has some favorite complex pages to contribute. For example, I know that Subbu likes https://it.wikipedia.org/wiki/Luna. Obama https://en.wikipedia.org/wiki/Barack_Obama is another classic, mostly for its copious use of citation templates.
Gabriel
On Tue, May 19, 2015 at 1:28 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
The pageview API that we're focusing on this quarter and next will have top articles, and they'll be per project at least and per other dimensions most likely. Right now, there's nothing but ad-hoc queries I can help you run. stats.grok.se used to maintain such a "top articles" list per wiki but that hasn't been updated since March 2014.
On Tue, May 19, 2015 at 4:25 PM, Brian Gerstle bgerstle@wikimedia.org wrote:
+analytics
On Tue, May 19, 2015 at 3:23 PM, Brian Gerstle bgerstle@wikimedia.org wrote:
+search
On Tue, May 19, 2015 at 3:14 PM, Brian Gerstle bgerstle@wikimedia.org wrote:
The subject hints at a question that's been nagging me for a while, and now that I'm going to be hacking on testing in Lyon I wanted to ask:
Do we have a list of articles we usually run tests against?
If not, do we have any processes for curating such a list? Would anyone be interested in a brainstorming session at Lyon to discuss this further?
Basically, as a developer, I would love to have more confidence that some code I wrote doesn't break on our most popular articles. Or, if we can get more sophisticated, that *certain properties of my code hold true for certain kinds of generated pages*.*
Please respond with your thoughts and whether you think I should create a phab task for the hackathon about this. In either case, ping me anytime or grab me at Lyon to discuss further!
Regards,
Brian
- Yes, I'm talking about using property-based testing generators to
create random, shrinkable MW pages that we can run tests on. Not sure if it's practical, but could be an interesting experiment.
-- EN Wikipedia user page: https://en.wikipedia.org/wiki/User:Brian.gerstle IRC: bgerstle
-- EN Wikipedia user page: https://en.wikipedia.org/wiki/User:Brian.gerstle IRC: bgerstle
-- EN Wikipedia user page: https://en.wikipedia.org/wiki/User:Brian.gerstle IRC: bgerstle
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics