Yeah, I see now. Previously, I'd just run "select * from revision limit 1" to see the lay of the land. I got some columns with rev_page=0, which confused me. Now I realized that I was looking at the ancient part of the table, which followed the old schema. I got it to work now.
Thanks for the help, everyone!
-ManishEarth


On Sat, Apr 23, 2011 at 12:09 PM, MZMcBride <z@mzmcbride.com> wrote:
Manish Goregaokar wrote:
>> An important key in every allwiki-iteration is imho the re-use of
>> connections.
> Yeah, I already do that. I've ORDER BY server'd , and I only close and reopen
> when the server changes. It's quite fast.
>
> What's the quick way to get user/page contribution data? I tried using the
> page table through putty, but it hanged. The revision table works, but I can't
> get the page title from it. For some reason, the column rev_page is populated
> with zeroes. I am able to get the username, revid, timestamp, and summary, but
> I can't fetch the page name/pageid. ANy help?

The quick way to get the user's edit count is to use the stored value from
user.user_editcount.

If you're trying to use the page table to get edit contribution data, you're
going to have trouble, because all edits (contributions) are in the revision
table.

If you want to use the revision table and fetch page titles, do a join on
rev_page = page_id. Then you can select page_namespace and page_title (or
the namespace name instead of an integer from the toolserver.namespacename
table).

More info about accessing and using the replicated databases is available
here: <https://wiki.toolserver.org/view/Database_access>.

MZMcBride

P.S. Plaintext e-mails, please. :-)



_______________________________________________
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/toolserver-l
Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette