In today's Language Search sprint planning meeting, we came up with the following short-term plan (spanning roughly the next few days to a week):
The developers are making good progress implementing Treys "Minimum Viable Product" Relevance Lab[1], and if things go well, it should be functional soon. It will allow us to feed sets of queries in, have them run through two different search rules, and compare the results.
For now, those result comparisons will be able to objectively note Zero Results Rates, and the rest of the comparison will be subjective human "Were results A better than results B?" Such are the limitations of an MVP.