Each metric should return just values of one type (e.g. no mixing of booleans and integers, like threshold and time to threshold) and we should specify for each metric : (1) what the expected type of the output is and (2) what aggregators are appropriate for that type.
Practically, we can group metrics into categories depending on the attribute they compute:
• binary attributes (e.g. "got reverted", "got blocked", "is productive", "hit threshold")
• counts ("bytes added", "pages created", "time to threshold")
• rates ("revert rate")
Each of these attributes will have a canonical type:
• boolean for binary attributes
• integer for counts
• float for rates
We can then specify what aggregator is valid as a function of the metric category/type.
How does that sound?
Dario
_______________________________________________
Wikimetrics mailing list
Wikimetrics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikimetrics