The on-wiki version of this newsletter can be found here: https://meta.wikimedia.org/wiki/Abstract_Wikipedia/Updates/2023-11-16 -- What types next?
We currently support two types: strings and Booleans. Our next big step for expanding the capabilities of Wikifunctions is by introducing more types. We are working hard to get all the pieces in place: we have previously talked about serialization https://meta.wikimedia.org/wiki/Abstract_Wikipedia/Updates/2023-09-27 to make types work seamlessly with programming languages, and renderers and parsers https://meta.wikimedia.org/wiki/Abstract_Wikipedia/Updates/2023-09-20 to improve the experience when using them. https://meta.wikimedia.org/wiki/File:Data_types_-_en.svgData can take many different types for many different domains
The first new type we expect to open for the community is a type for lists. This will not only allow for many new functions that are not possible yet (such as breaking a sentence into words), but also to simplify existing implementations such as this test for mutation in Breton https://www.wikifunctions.org/view/en/Z11601 or this (overly simple) check if a letter is a vowel https://www.wikifunctions.org/view/en/Z11894, implementing a function we created collaboratively in this week’s Volunteer Corner.
But, as asked by Cool314 on the Project Chat https://www.wikifunctions.org/wiki/Wikifunctions:Project_chat#Once_we_get_new_types,_which_ones_will_be_coming_out_first?, what should be the next types that the community would like to see? They also suggest starting with the list type, which, as said, is on the way. They say to follow with integers, and if that means non-negative integers or counting numbers, I would be totally on-board. But what would be next?
The list of suggested functions requiring new types https://www.wikifunctions.org/wiki/Wikifunctions:Suggest_a_function#Proposed_functions_requiring_future_types also strongly suggests counting numbers. After that we find suggestions talking about more complicated numbers, such as negative numbers, floating-point numbers, fractions; then also bytes and specific-length vectors of bytes; colors; years, months, and dates; and others.
One thing we need to take into account is to go for simpler types first: for example, a calendar day could be built from a counting number and a month, or it could be built from two counting numbers – but one way or the other, we would need counting numbers first.
Another question is whether we prefer to have our types to be built from simpler elements, or whether we prefer more complex types. To give a very simple example, there are many different ways to represent integers, two of which would be:
1. An integer could be represented by an object with a single key, a single string that starts with an optional “-” and ends with a whole number, i.e. a list of digits with no leading zeros 2. An integer could be represented by an object with two keys, one being the sign (which is one of negative, positive, or none), and the other a counting number, which we would have previously defined as a type
Then there is also the question of which string would represent which value, a question that needs to be answered for each individual type. For numbers one straightforward solution is to take the string representation of the numbers in Hindu-Arabic numerals, but one could also consider binary, hexadecimal, or even base64 encodings, potentially reducing storage space. I think that the potential to more easily understand a JSON representation would beat the small storage gain.
Finally, what limits, if any, should we put on these values? Frequently, programming languages have a numerical limit for their 'number' type of the range from 0 to 4.2 billion (2^^32) or 0 to 18 qunitillion https://en.wiktionary.org/wiki/quintillion (2^^64). Should we add a limit like this as well, or should we expect function writers to work with any possible input?
Note that this does not mean that we only will be able to deal with Hindu-Arabic numerals: with parsers and renderers https://meta.wikimedia.org/wiki/Abstract_Wikipedia/Updates/2023-09-20 we will be able to display numbers appropriately for a given language, remaining fully translated for whichever language and locale users are using. We might not necessarily have that feature available immediately when introducing new types, but we will be working on enabling them soon.
Let us hear and discuss what you think (or even come up with a process for us to follow, based on the types you are suggesting). What are the types you are looking forward to? Recent changes to Wikifunctions software
From now on, we will try to give a quick summary with each update of what work you can see as it rolls out to Wikifunctions.
In terms of big items, following the completion of "General Availability" last week https://meta.wikimedia.org/wiki/Abstract_Wikipedia/Updates/2023-11-07, we've been primarily working on the software to better support types (as discussed above; T343469 https://phabricator.wikimedia.org/T343469). On the back-end this has meant infrastructure support for applying custom code to convert from Wikifunction types to native types and back (T297509 https://phabricator.wikimedia.org/T297509), and on the front-end we've been working on using lists as inputs and outputs of functions, coming soon (T326301 https://phabricator.wikimedia.org/T326301). We've also been working with several volunteers to understand how we can improve the on-boarding experience (T285509 https://phabricator.wikimedia.org/T285509); thank you to everyone who has taken part.
Our general improvements have included progress on being more consistent in using proper, specific, translatable errors consistently throughout the system (T321113 https://phabricator.wikimedia.org/T321113), and researching if it's possible to make any further simple improvements to picking a type or object (T345547 https://phabricator.wikimedia.org/T345547).
As minor fixes, we now use a different title on the creation page depending on whether you're making a function, implementation, test case, or type, rather than just say 'object' (T350673 https://phabricator.wikimedia.org/T350673 and T341847 https://phabricator.wikimedia.org/T341847). We also fixed the view of aliases to be one bulleted list of several items, not several lists of just one (T345404 https://phabricator.wikimedia.org/T345404).
You can browse the full list of deployed changes https://www.mediawiki.org/wiki/MediaWiki_1.42/wmf.5#WikiLambda for the MediaWiki front-end for Wikifunctions. We didn't deploy any back-end service changes this week. No newsletter next week
Due to holidays, we will be skipping next week’s newsletter. Expect to hear from us again after Thanksgiving https://en.wikipedia.org/wiki/Thanksgiving!
abstract-wikipedia@lists.wikimedia.org