If the data is actually copyrightable, then yes. Facts
as such are not
copyrightable. But if there was a bot transferring stuff from infoboxes, it
should at least check for any actual text (e.g. long values with spaces), and
not transfer it, because of license reasons.
I agree. Just to clarify what "actual text" should mean: Although a
short sentence with several words may occasionally be a copyrightable
text (e.g. a poem), it is very rarely so. On Wikipedia infoboxes, due
to scope, purpose and style, this can almost be excluded.
It is not desirable to exclude brief scope notes or source notes,
which occasionally occur in Wikipedia infoboxes, just because they
contain several words. I personally would recommend an extraction
dryrun and manually check for parameters that have more than perhaps
12-15 words, whether they are creative (= copyrightable) or plain
expressions of fact or sources (= not copyrightable).
Gregor