Thanks MZMcBride. That makes sense. I will try to.
--Takashi
On Sun, Feb 5, 2017 at 11:39 MZMcBride z@mzmcbride.com wrote:
Takashi OTA wrote:
After importing categorylinks.sql and page.sql, downloaded from https://dumps.wikimedia.org/jawiki/latest/jawiki-latest-category.sql.gz https://dumps.wikimedia.org/jawiki/latest/jawiki-latest-page.sql.gz
on to local MySQL database "jawiki_p", with the instructions shown at: https://github.com/eggpi/citationhunt/blob/master/scripts/README.md .
(I have done it like; $ mysql -u root mysql> create database jawiki_p; mysql> use jawiki_p; mysql> source jawiki-latest-category.sql; mysql> source jawiki-latest-page.sql; )
When you run scripts/print_unsourced_pageids_from_wikipedia.py after setting CH_LANG, it dumped an error shown below:
(ch-venv) Mac-mini:scripts takot$ export CH_LANG=en (ch-venv) Mac-mini:scripts takot$ echo $CH_LANG ja (ch-venv) Mac-mini:scripts takot$ ./print_unsourced_pageids_from_wikipedia.py > unsourced Traceback (most recent call last): File "./print_unsourced_pageids_from_wikipedia.py", line 40, in <module> print_unsourced_ids_from_wikipedia() File "./print_unsourced_pageids_from_wikipedia.py", line 21, in print_unsourced_ids_from_wikipedia ' OR '.join(['cl_to = %s'] * len(categories)) + ')', categories) File "/Users/takot/ch-venv/lib/python2.7/site-packages/MySQLdb/cursors.py", line 205, in execute self.errorhandler(self, exc, value) File "/Users/takot/ch-venv/lib/python2.7/site-packages/MySQLdb/connections.py", line 36, in defaulterrorhandler raise errorclass, errorvalue _mysql_exceptions.ProgrammingError: (1146, "Table 'jawiki_p.categorylinks' doesn't exist")
I think you're confusing these two database tables:
- https://www.mediawiki.org/wiki/Manual:Category_table
- https://www.mediawiki.org/wiki/Manual:Categorylinks_table
It looks like you loaded category, but the script is complaining about categorylinks.
MZMcBride
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l