Probably a silly question but ... did you all consider creating a datatype
for molecue representation ? This seem to be a very similar usecase than
mathematica formula. Essentially we're not dealing with a raw string but a
representation of molecule formulas, with its own encoding ...
Changing the limit seem to be a poor workaround to a dedicated datatype -
nobody seems to have found a relevant usecase and it seem to me that we're
essentially abusing strings for storing blobs ...
2016-10-08 11:33 GMT+02:00 Egon Willighagen <egon.willighagen(a)gmail.com>om>:
On Sat, Oct 8, 2016 at 11:28 AM, Lydia Pintscher <
On Sat, Oct 8, 2016 at 11:23 AM, Egon
External identifier then. Cool. And for string like in
Sebastian's initial email
says 1500 to 2000. Is this still a good number after this discussion?
Yes, that would cover more than 99.9% of all InChIs in PubChem. (See
Sebastian's reply earlier in this thread.)
Department of Bioinformatics - BiGCaT
Maastricht University (http://www.bigcat.unimaas.nl/
Wikidata mailing list