Help needed for the bug patch

List overview All Threads
Download

newer

older

soweego 1.0 release

ORES downtime July 16th @ 1500 UTC

K. Kaushik Reddy

29 Jul 2019 29 Jul '19

5:58 p.m.

Hi developers

This is K. Kaushik Reddy. I had been assigned with the measuring of the ROC AUC issue https://github.com/wikimedia/revscoring/blob/master/revscoring/scoring/statistics/classification/scaled_threshold_statistics.py#L150 to patch for, I had this ROC https://en.wikipedia.org/wiki/Receiver_operating_characteristic wikipage to help me with the concept of working. Since, I'm a in my beginner level, I need time to understand. *The problem is to find out why the algorithm is showing huge values at times.* I need help regarding the understanding of where the in the functions had gone wrong and what could be done? I hope I made my question clear.

Kaushik

Attachments:

attachment.htm (text/html — 791 bytes)

Show replies by date

K. Kaushik Reddy

29 Jul 29 Jul

9:47 p.m.

New subject: Fwd: Help needed for the bug patch

---------- Forwarded message --------- From: K. Kaushik Reddy reddykaushik18@gmail.com Date: Mon, Jul 29, 2019, 7:28 PM Subject: Help needed for the bug patch To: Application of Artificial Intelligence and other advanced computing strategies to Wikimedia Projects ai@lists.wikimedia.org Cc: Aaron Halfaker aaron.halfaker@gmail.com

Hi developers

Kaushik

Aaron Halfaker

10:31 p.m.

New subject: Fwd: Help needed for the bug patch

Hi Kaushik,

Per our recent conversation, I think that https://phabricator.wikimedia.org/T223788 is a better task to pick up right now.

BUT! I like that you're still curious about this task. So the real problem is that we get different metrics for True ROC-AUC and False ROC-AUC in a binary classifier. This should be impossible. You can get TPR and FPR from ORES directly. E.g., https://ores.wikimedia.org/v3/scores/enwiki/?model_info=statistics.threshold... will give you all of the statistics at each threshold. You can take the recall (which incidentally is just another name for TPR) and the FPR to generate some curves. If you run the same query for "false" (e.g. https://ores.wikimedia.org/v3/scores/enwiki/?model_info=statistics.threshold...) you can compare the ROC of both and help us figure out where the disparity is.

See https://scikit-learn.org/stable/modules/generated/sklearn.metrics.auc.html for a nice utility for generating area-under-the-curve metrics.

-Aaron

On Mon, Jul 29, 2019 at 12:47 PM K. Kaushik Reddy reddykaushik18@gmail.com wrote:

...

---------- Forwarded message --------- From: K. Kaushik Reddy reddykaushik18@gmail.com Date: Mon, Jul 29, 2019, 7:28 PM Subject: Help needed for the bug patch To: Application of Artificial Intelligence and other advanced computing strategies to Wikimedia Projects ai@lists.wikimedia.org Cc: Aaron Halfaker aaron.halfaker@gmail.com

Hi developers

This is K. Kaushik Reddy. I had been assigned with the measuring of the ROC AUC issue https://github.com/wikimedia/revscoring/blob/master/revscoring/scoring/statistics/classification/scaled_threshold_statistics.py#L150 to patch for, I had this ROC https://en.wikipedia.org/wiki/Receiver_operating_characteristic wikipage to help me with the concept of working. Since, I'm a in my beginner level, I need time to understand. *The problem is to find out why the algorithm is showing huge values at times.* I need help regarding the understanding of where the in the functions had gone wrong and what could be done? I hope I made my question clear.

Kaushik _______________________________________________ AI mailing list AI@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/ai

1943

Age (days ago)

1943

Last active (days ago)

ai@lists.wikimedia.org

2 comments

2 participants

tags (0)

participants (2)

Aaron Halfaker
K. Kaushik Reddy