Methodology:
1. take the last 30 days of sampled logs
2. look for IE6
3. count proportionately for (all requests|all text/html requests)

The results: of /both HTTPS and http/ requests, ~0.20% come from IE6; of all text/html requests, 1.3%. I'd note that this is likely to be an /over/estimate; there are some browsers which spoof IE6 that'll be included in these numbers, but would be excluded from e.g. Christian's.[0] On the HTTPS front, Christian's numbers, I trust - checking for HTTPs didn't work for me, since the terminators aren't actually included in the sampled logs.[1] Full results and codebase at https://github.com/Ironholds/POODLE

[0] I knowingly used a crappy ua parsing strategy because it was also a really fast ua parsing strategy, and 30 days of logs, even sampled logs, ain't nothing.
[1] This doesn't apply to Christian's results; he took the terminator IPs and checked for requests from those ranges. Sensible!

On 15 October 2014 16:50, Oliver Keyes <okeyes@wikimedia.org> wrote:
Darnit. Ah well! Okay; finished building the code to retrieve this data. Takes ~400 seconds to handle a day of logs, so take into account parallelisation and I should (should!) have something to show in a couple of hours for the first 3 Qs. The fourth, it seems, is beyond our ken.

On 15 October 2014 15:54, Max Semenik <maxsem.wiki@gmail.com> wrote:
There's no data for IE6 in EventLogging because IE6 gets no JS these days. Maybe, if there's old enough data...

On Wed, Oct 15, 2014 at 12:46 PM, Oliver Keyes <okeyes@wikimedia.org> wrote:
Update: Yuvi's pointed me towards a login attempts schema. All 4 are doable. Data tomorrow morning EST at the latest.

On 15 October 2014 15:19, Oliver Keyes <okeyes@wikimedia.org> wrote:
(With "jiffy" read "a day"; even with sampling, big logs are big, and I imagine we probably want ~30 days of data.)

On 15 October 2014 15:18, Oliver Keyes <okeyes@wikimedia.org> wrote:
First three are pretty trivial; last one is a bit of a pain, but doable if someone wants to poke me on IRC (/query Ironholds) and chat about what an unambiguous successful login action would look like in terms of requests. But I can do the first three in a jiffy.

On 15 October 2014 13:32, Brandon Black <bblack@wikimedia.org> wrote:


On Wed, Oct 15, 2014 at 3:43 PM, Oliver Keyes <okeyes@wikimedia.org> wrote:
You invoked my name!

Emphasis is "logged-in". If you guys want more solid overall numbers, I can get those in short order; this seems like a pretty critical question to have data on, fast. Lemme know.

If you can source some good reliable numbers, probably what we care about (all of which have been estimated to some degree in this thread already, I think?) is:

% of all requests from IE6
% of all https requests from IE6
% of all text/html https requests from IE6 (not so important IMHO, if it's difficult)
% of all logged-in https requests (or alternatively, % of all successful https login attempts) from IE6.




--
Oliver Keyes
Research Analyst
Wikimedia Foundation



--
Oliver Keyes
Research Analyst
Wikimedia Foundation



--
Oliver Keyes
Research Analyst
Wikimedia Foundation

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics




--
Best regards,
Max Semenik ([[User:MaxSem]])

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics




--
Oliver Keyes
Research Analyst
Wikimedia Foundation



--
Oliver Keyes
Research Analyst
Wikimedia Foundation