Re: [Wikitech-l] Datamining infoboxes

23 Oct 2009


      2009/10/23 William Pietri william@scissor.com:
...
George Herbert wrote:
...
...
This discussion brings to mind several historical threads.
I wonder if a project to simply mine the whole article contents and
provide a DB of some sort with the articles and infobox contents would
be worthwhile.  Develop a specific parser and generate and publish the
complete set of article-infobox-(key-value) sets...
...
I don't know anybody on the data side at Metaweb anymore, but I know
that they did something like that to import a lot of structured
Wikipedia data into their Freebase project. They publish some sort of
data dump here:
http://download.freebase.com/wex/
Perhaps they'd be willing to open-source their parser.
They're right into open source, I suspect they would.
- d.

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Datamining infoboxes