Re: [Pywikipedia-l] Template parsing code

24 Jan 2012


      -----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hello Hannes
Just wondering; is your text parser able to correctly find all headings
(e.g. '== bla ==' as well as '<h2>bla</h2>') and distinguish headings
from other similar text but within a paragraph? And finally return the
byte offset of those headings?
I am using such a piece of code written with help of difflib and it is
may be useful here also? (even though I had not that much time to write
a unittest with full coverage... but a simple one is there ;)
Greetings
DrTrigon
On 23.01.2012 23:34, Hannes Röst wrote:
...
Hello all
...
From one of my assignments as a bot operator I have some code
which
does template parsing and general text parsing (e.g. Image/File
tags). It is not using regex and thus able to correctly parse
nested templates and other such nasty things. I have written those
as library classes and written tests for them which cover almost
all of the code. I would now really like to contribute that code
back to the community.
Would you be interested in adding this code to the pywikibot 
framework? If yes, can I send the code to someone for code review
or how do you usually operate?
Greetings
Hannes
PS: wiki userpage is
http://en.wikipedia.org/wiki/User:Hannes_R%C3%B6st
_______________________________________________ Pywikipedia-l
mailing list Pywikipedia-l@lists.wikimedia.org 
https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l
...PGP SIGNATURE...
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.11 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAk8eceUACgkQAXWvBxzBrDBmJQCePmfUbs4Y8HNN18UT6vMFYo5r
N1AAoLuN1VLpZQOrwegmkKWl08Te0Rxp
=HXai
-----END PGP SIGNATURE-----

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

Re: [Pywikipedia-l] Template parsing code