I have seen many messy text-image mixes on Google books, especially older texts from manual typesetting days. That's why I was wondering if it would be possible to have a tool that stores pages as you go, so you can step in and adjust it on a per page basis. I am not familiar with abbyy.xml files, but this may be the way to go