Any R nerds with Stat1 access: Stat1 now has the RMySQL package installed. What this means is the following; if you have something from the db you want to analyse, there's no need to faff around on the command line or inside MySQL to manually export it as a file, import it into R, analyse it, and then leave a big juicy TSV around for people to poke at when they're bored. You can run SQL queries against the slaves from inside R, and have these queries and associated data fall off and die as soon as you close the session or run rm(). It's also good for replicability, since people will now (largely) be able to retrieve the same data and run the same analysis using a single script. Thanks to Jeff Green and Andrew Otto for getting it up and running :).
Awessoooome!
On Feb 11, 2013, at 12:21 PM, Oliver Keyes okeyes@wikimedia.org wrote:
Any R nerds with Stat1 access: Stat1 now has the RMySQL package installed. What this means is the following; if you have something from the db you want to analyse, there's no need to faff around on the command line or inside MySQL to manually export it as a file, import it into R, analyse it, and then leave a big juicy TSV around for people to poke at when they're bored. You can run SQL queries against the slaves from inside R, and have these queries and associated data fall off and die as soon as you close the session or run rm(). It's also good for replicability, since people will now (largely) be able to retrieve the same data and run the same analysis using a single script. Thanks to Jeff Green and Andrew Otto for getting it up and running :).
-- Oliver Keyes Community Liaison, Product Development Wikimedia Foundation _______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
that's great to hear, thanks Oliver for taking the lead on this and Jeff/Andrew for your support
On Feb 11, 2013, at 9:41 AM, Andrew Otto otto@wikimedia.org wrote:
Awessoooome!
On Feb 11, 2013, at 12:21 PM, Oliver Keyes okeyes@wikimedia.org wrote:
Any R nerds with Stat1 access: Stat1 now has the RMySQL package installed. What this means is the following; if you have something from the db you want to analyse, there's no need to faff around on the command line or inside MySQL to manually export it as a file, import it into R, analyse it, and then leave a big juicy TSV around for people to poke at when they're bored. You can run SQL queries against the slaves from inside R, and have these queries and associated data fall off and die as soon as you close the session or run rm(). It's also good for replicability, since people will now (largely) be able to retrieve the same data and run the same analysis using a single script. Thanks to Jeff Green and Andrew Otto for getting it up and running :).
-- Oliver Keyes Community Liaison, Product Development Wikimedia Foundation _______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Whaaaaaat. That's so cool! Thank you so much for looking out for us R nerds :)
On Mon, Feb 11, 2013 at 10:49 AM, Dario Taraborelli < dtaraborelli@wikimedia.org> wrote:
that's great to hear, thanks Oliver for taking the lead on this and Jeff/Andrew for your support
On Feb 11, 2013, at 9:41 AM, Andrew Otto otto@wikimedia.org wrote:
Awessoooome!
On Feb 11, 2013, at 12:21 PM, Oliver Keyes okeyes@wikimedia.org wrote:
Any R nerds with Stat1 access: Stat1 now has the RMySQL package
installed. What this means is the following; if you have something from the db you want to analyse, there's no need to faff around on the command line or inside MySQL to manually export it as a file, import it into R, analyse it, and then leave a big juicy TSV around for people to poke at when they're bored. You can run SQL queries against the slaves from inside R, and have these queries and associated data fall off and die as soon as you close the session or run rm(). It's also good for replicability, since people will now (largely) be able to retrieve the same data and run the same analysis using a single script. Thanks to Jeff Green and Andrew Otto for getting it up and running :).
-- Oliver Keyes Community Liaison, Product Development Wikimedia Foundation _______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
wmfresearch mailing list wmfresearch@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wmfresearch
Hardly selflessness ;p
On 11 February 2013 18:57, Maryana Pinchuk mpinchuk@wikimedia.org wrote:
Whaaaaaat. That's so cool! Thank you so much for looking out for us R nerds :)
On Mon, Feb 11, 2013 at 10:49 AM, Dario Taraborelli < dtaraborelli@wikimedia.org> wrote:
that's great to hear, thanks Oliver for taking the lead on this and Jeff/Andrew for your support
On Feb 11, 2013, at 9:41 AM, Andrew Otto otto@wikimedia.org wrote:
Awessoooome!
On Feb 11, 2013, at 12:21 PM, Oliver Keyes okeyes@wikimedia.org
wrote:
Any R nerds with Stat1 access: Stat1 now has the RMySQL package
installed. What this means is the following; if you have something from the db you want to analyse, there's no need to faff around on the command line or inside MySQL to manually export it as a file, import it into R, analyse it, and then leave a big juicy TSV around for people to poke at when they're bored. You can run SQL queries against the slaves from inside R, and have these queries and associated data fall off and die as soon as you close the session or run rm(). It's also good for replicability, since people will now (largely) be able to retrieve the same data and run the same analysis using a single script. Thanks to Jeff Green and Andrew Otto for getting it up and running :).
-- Oliver Keyes Community Liaison, Product Development Wikimedia Foundation _______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
wmfresearch mailing list wmfresearch@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wmfresearch
-- Maryana Pinchuk Associate Product Manager, Wikimedia Foundation wikimediafoundation.org
wmfresearch mailing list wmfresearch@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wmfresearch
(http://cran.r-project.org/web/packages/RMySQL/RMySQL.pdf is the relevant documentation, btw - you want to read up on dbConnect and dbSendQuery)
On 11 February 2013 18:58, Oliver Keyes okeyes@wikimedia.org wrote:
Hardly selflessness ;p
On 11 February 2013 18:57, Maryana Pinchuk mpinchuk@wikimedia.org wrote:
Whaaaaaat. That's so cool! Thank you so much for looking out for us R nerds :)
On Mon, Feb 11, 2013 at 10:49 AM, Dario Taraborelli < dtaraborelli@wikimedia.org> wrote:
that's great to hear, thanks Oliver for taking the lead on this and Jeff/Andrew for your support
On Feb 11, 2013, at 9:41 AM, Andrew Otto otto@wikimedia.org wrote:
Awessoooome!
On Feb 11, 2013, at 12:21 PM, Oliver Keyes okeyes@wikimedia.org
wrote:
Any R nerds with Stat1 access: Stat1 now has the RMySQL package
installed. What this means is the following; if you have something from the db you want to analyse, there's no need to faff around on the command line or inside MySQL to manually export it as a file, import it into R, analyse it, and then leave a big juicy TSV around for people to poke at when they're bored. You can run SQL queries against the slaves from inside R, and have these queries and associated data fall off and die as soon as you close the session or run rm(). It's also good for replicability, since people will now (largely) be able to retrieve the same data and run the same analysis using a single script. Thanks to Jeff Green and Andrew Otto for getting it up and running :).
-- Oliver Keyes Community Liaison, Product Development Wikimedia Foundation _______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
wmfresearch mailing list wmfresearch@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wmfresearch
-- Maryana Pinchuk Associate Product Manager, Wikimedia Foundation wikimediafoundation.org
wmfresearch mailing list wmfresearch@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wmfresearch
-- Oliver Keyes Community Liaison, Product Development Wikimedia Foundation
On Mon, Feb 11, 2013 at 6:49 PM, Dario Taraborelli dtaraborelli@wikimedia.org wrote:
that's great to hear, thanks Oliver for taking the lead on this and Jeff/Andrew for your support
On Feb 11, 2013, at 9:41 AM, Andrew Otto otto@wikimedia.org wrote:
Awessoooome!
On Feb 11, 2013, at 12:21 PM, Oliver Keyes okeyes@wikimedia.org wrote:
Any R nerds with Stat1 access: Stat1 now has the RMySQL package installed. What this means is the following; if you have something from the db you want to analyse, there's no need to faff around on the command line or inside MySQL to manually export it as a file, import it into R, analyse it, and then leave a big juicy TSV around for people to poke at when they're bored. You can run SQL queries against the slaves from inside R, and have these queries and associated data fall off and die as soon as you close the session or run rm(). It's also good for replicability, since people will now (largely) be able to retrieve the same data and run the same analysis using a single script. Thanks to Jeff Green and Andrew Otto for getting it up and running :).
I didn't read this blog yet but it looks relevant to you guys. See below.
-Jeremy
---------- Forwarded message ---------- From: Dan Nguyen dn@skift.com Date: Tue, Feb 12, 2013 at 1:50 PM Subject: [NICAR-L] 10 R packages I wish I knew about earlier
http://blog.yhathq.com/posts/10-R-packages-I-wish-I-knew-about-earlier.html
The subject is actually just the title of the post, I've barely ever used R though this really well-illustrated list has convinced me to put more effort into checking it out. I especially like the libraries that can directly access your SQL databases and let you write SQL inside an R script.
-- Dan Nguyen Head of Data at Skift.com Twitter: @dancow dn@skift.com
Good resources all! Four (or five?) of those packages are all written by *the same guy*, Hadley Wickham, who is, imo, pretty much single-handedly responsible for making R something people willingly use.
(if any of you have not used plyr or ggplot2 yet, you now have no excuse)
On 12 February 2013 13:57, Jeremy Baron jeremy@tuxmachine.com wrote:
On Mon, Feb 11, 2013 at 6:49 PM, Dario Taraborelli dtaraborelli@wikimedia.org wrote:
that's great to hear, thanks Oliver for taking the lead on this and
Jeff/Andrew for your support
On Feb 11, 2013, at 9:41 AM, Andrew Otto otto@wikimedia.org wrote:
Awessoooome!
On Feb 11, 2013, at 12:21 PM, Oliver Keyes okeyes@wikimedia.org
wrote:
Any R nerds with Stat1 access: Stat1 now has the RMySQL package
installed. What this means is the following; if you have something from the db you want to analyse, there's no need to faff around on the command line or inside MySQL to manually export it as a file, import it into R, analyse it, and then leave a big juicy TSV around for people to poke at when they're bored. You can run SQL queries against the slaves from inside R, and have these queries and associated data fall off and die as soon as you close the session or run rm(). It's also good for replicability, since people will now (largely) be able to retrieve the same data and run the same analysis using a single script. Thanks to Jeff Green and Andrew Otto for getting it up and running :).
I didn't read this blog yet but it looks relevant to you guys. See below.
-Jeremy
---------- Forwarded message ---------- From: Dan Nguyen dn@skift.com Date: Tue, Feb 12, 2013 at 1:50 PM Subject: [NICAR-L] 10 R packages I wish I knew about earlier
http://blog.yhathq.com/posts/10-R-packages-I-wish-I-knew-about-earlier.html
The subject is actually just the title of the post, I've barely ever used R though this really well-illustrated list has convinced me to put more effort into checking it out. I especially like the libraries that can directly access your SQL databases and let you write SQL inside an R script.
-- Dan Nguyen Head of Data at Skift.com Twitter: @dancow dn@skift.com
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics