Jan Hidders wrote: [snip]
That hard part is of course to come up with the contex-free grammar (it should probably be LALR(1) at that). Since I used to teach compiler theory I might be of some help there.
Yes, and that's the *only* part that will help. Having two or three or five intermediate formats doesn't do anything to help the problem -- making the actual parser actually work token by token will.
IMHO putting a lot of emphasis on output formats is a mistake, since it ignores the actual problem.
-- brion vibber (brion @ pobox.com)