After importing categorylinks.sql and page.sql,
downloaded from
https://dumps.wikimedia.org/jawiki/latest/jawiki-latest-category.sql.gz
https://dumps.wikimedia.org/jawiki/latest/jawiki-latest-page.sql.gz
on to local MySQL database "jawiki_p", with the instructions shown at:
https://github.com/eggpi/citationhunt/blob/master/scripts/README.md .
(I have done it like;
$ mysql -u root
mysql> create database jawiki_p;
mysql> use jawiki_p;
mysql> source jawiki-latest-category.sql;
mysql> source jawiki-latest-page.sql; )
When you run scripts/print_unsourced_pageids_from_wikipedia.py
after setting CH_LANG, it dumped an error shown below:
(ch-venv) Mac-mini:scripts takot$ export CH_LANG=en
(ch-venv) Mac-mini:scripts takot$ echo $CH_LANG
ja
(ch-venv) Mac-mini:scripts takot$
./print_unsourced_pageids_from_wikipedia.py > unsourced
Traceback (most recent call last):
File "./print_unsourced_pageids_from_wikipedia.py", line 40, in
<module>
print_unsourced_ids_from_wikipedia()
File "./print_unsourced_pageids_from_wikipedia.py", line 21, in
print_unsourced_ids_from_wikipedia
' OR '.join(['cl_to = %s'] * len(categories)) + ')',
categories)
File
"/Users/takot/ch-venv/lib/python2.7/site-packages/MySQLdb/cursors.py",
line
205, in execute
self.errorhandler(self, exc, value)
File
"/Users/takot/ch-venv/lib/python2.7/site-packages/MySQLdb/connections.py",
line 36, in defaulterrorhandler
raise errorclass, errorvalue
_mysql_exceptions.ProgrammingError: (1146, "Table 'jawiki_p.categorylinks'
doesn't exist")
It looks like you loaded category, but the script is complaining about
categorylinks.
MZMcBride