El sáb, 02-02-2008 a las 14:31 -0700, Mark Williamson escribió:
Fran, very intriguing! I'd actually been thinking of something along these lines for the past couple of weeks.
Would it be possible to do a statistical analysis of articles, as well? I would imagine that in longer articles, there would be certain words that would have similar frequencies, whether or not they are direct translations.
In short, check out this link:
http://citeseer.ist.psu.edu/509449.html
which gives a nice overview of various techniques. If you'd like further details, please feel free to contact me off-list so we don't clog up the works.
Fran