On 12/01/12 14:37, Sebastian Hellmann wrote:
Hello all, is there a query language for wiki syntax? (NOTE: I really do not mean the Wikipedia API here.)
I am looking for an easy way to scrape data from Wiki pages. In this way, we could apply a crowd-sourcing approach to knowledge extraction from Wikis.
There must be thousands of data scraping approaches. But is there one amongst them that has developed a "wiki scraper language" ? Maybe with some sort of fuzziness involved, if the pages are too messy.
I don't think so. I think the most similar piece used are applying regex to the page. Which you may find too powerful/low-level.