Some thoughts below (we will be creating tickets for some of these so we do not forget) 

connection speed field...somewhere ;)
I think many teams will benefit from this one but as far as I can see there is no way to get this info (and it being correct) at this time. Performance team just recently opened a ticket in this regard and we saw it was not possible. See: https://phabricator.wikimedia.org/T119801 and http://www.w3.org/TR/netinfo-api/


>'is_spider' column in event logs (Based on regex of user agent- based on conversation with Madhu and Kevin, I know its not going to get most of the bots
For eventlogging data? On the contrary, it will get most of the bot traffic. This is not true of our pageview data and I can explain in more detail via IRC why is that. Please let me know if I misunderstood this referring to EL. 

>App session tables split by OS (per ticket: T117615)
Agreed. This should be part of the background work we do that is not related to goals. 

.>Piwik up and running for iOS (simple machine spin-up, if not finished in Q2)
This would be very useful for other things besides IOS but it is harder than it looks as holding production data requires proper infrastructure (to be able to abide to our privacy policy) and to be honest I do not think we can get there next quarter. I think Dan can add more detail here as we have tried to use Piwik before outside labs w/o success. 


>Input pageview data into tableau or some other non-query table maker
Need to think more about this use case. 






On Wed, Dec 2, 2015 at 9:16 PM, Jon Katz <jkatz@wikimedia.org> wrote:
Hi Nuria,
Thank you for publishing your draft so early and requesting feedback.  I apologize for being so late in responding--the short holiday week caused a serious backlog on my end.  

For now, I only have a preliminary prioritized list of reading requests for analytics in Q3 (and beyond).  I can confirm by Friday. The following are in order of priority
  • Pywik up and running for iOS (simple machine spin-up, if not finished in Q2)
  • Input pageview data into tableau or some other non-query table maker
  • 'is_spider' column in event logs (Based on regex of user agent- based on conversation with Madhu and Kevin, I know its not going to get most of the bots)
  • App session tables split by OS (per ticket: T117615)
  • Browser reports you mention in your current priorities (this is low for us if it is in hive already, though you probably have other stakeholders)
  • connection speed field...somewhere ;)
I'd be happy to discuss more either via chat or email.  
Best,

J


On Tue, Nov 24, 2015 at 8:49 AM, Nuria Ruiz <nuria@wikimedia.org> wrote:
Hello, 

Please see preliminary goals for the analytics infrastructure team next quarter. Replies to this thread with suggestions and feedback are welcome. 



Thanks, 

Nuria

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics



_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics