Jsalsman added a comment.
John, DELPH-IN is completely language independent, able even to support Chinese variants, but I can't see whether anyone has done anything in Indonesian for it yet. Even if they have, it might be much easier on the student applicants to only suggest it as a possible option, as an alternative to using regular expression lists to match, e.g. numbers occurring in the same sentences as a list of case-independent keywords. Let me have a day or two to make sure that the morphology doesn't impede that and see what's out there for Indonesian.
TASK DETAIL
https://phabricator.wikimedia.org/T89416
REPLY HANDLER ACTIONS
Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign <username>.
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Jsalsman
Cc: jayvdb, Qgil, Risker, Bawolff, Aklapper, Jsalsman, Svick, jeremyb, pywikipedia-bugs
jayvdb added a comment.
An important pre-req before choosing a language is determining how the tool would detect/extract facts from prose.
http://erg.delph-in.net/logon says it is using the LinGO English Resource Grammar (ERG), so I assume that is English language only. Do you know of a comparable tool for Indonesian language? Or for other languages which might be suitable?
TASK DETAIL
https://phabricator.wikimedia.org/T89416
REPLY HANDLER ACTIONS
Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign <username>.
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: jayvdb
Cc: jayvdb, Qgil, Risker, Bawolff, Aklapper, Jsalsman, Svick, jeremyb, pywikipedia-bugs
jayvdb created this task.
jayvdb added subscribers: jayvdb, valhallasw, XZise, Xqt.
jayvdb added a project: pywikibot-core.
Restricted Application added subscribers: Aklapper, pywikipedia-bugs.
TASK DESCRIPTION
There is a new pep8 rule which was added in July 2014, and will be released soon. If it is deployed on the build machines, it would prevent any other changes from being merged.
The rule is
E402 module level import not at top of file, and looks like
pywikibot/data/api.py:10:1: E402 module level import not at top of file
https://github.com/jcrocholl/pep8/issues/264https://github.com/jcrocholl/pep8/commit/1ee296bca0fa611d3dbe87c5c5c8009e44…
pywikibot has several imports not at the top of the file, due to cyclic dependencies, but that number is small compared to the number of these errors caused by __version__ appearing before the imports.
Note that pep8 also needed to change their code to relocate __version__ to appear below imports.
https://github.com/jcrocholl/pep8/commit/373e0ac1138f0e24422b5e2e78f02ed055…
There is a broader issue of the usefulness of these __version__ variables for every file, and there is some $Id$ voodoo in bot.py:740-745 which we should revisit.
TASK DETAIL
https://phabricator.wikimedia.org/T87409
REPLY HANDLER ACTIONS
Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign <username>.
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: jayvdb
Cc: Xqt, XZise, valhallasw, jayvdb, Aklapper, pywikipedia-bugs