Hi all,
A number of people
<https://en.wikipedia.org/wiki/Wikipedia:Help_desk#inaccurate_covid_19_cases…>
have noticed over the past ~day that the data for cases reported in
Wisconsin by county is showing up incorrectly in the Google stats card.
I've verified that this is still an issue. It looks like it might be due to
the stats card pulling in data from the wrong row of the table
<https://en.wikipedia.org/wiki/2020_coronavirus_pandemic_in_Wisconsin#Statis…>
(negative
or per-capita tests rather than confirmed positive), but I'm not 100% sure.
Quickly scanning the history of the page, looks like there were some recent
formatting tweaks – possible that they caused an issue in scraping/parsing?
*Google team: *let me know if there's any more information I can provide to
assist with triage!
*COVID case count task force folks:* looks like the person who added the
by-county table to that page was looking for help with table formatting
<https://en.wikipedia.org/wiki/Talk:2020_coronavirus_pandemic_in_Wisconsin>
(maybe
related to why this issue occurred?), just FYI in case you want to help
them out :)
Best,
Maryana
--
*Maryana Pinchuk* (she/her)
Senior Partnerships Manager
Wikimedia Foundation <https://wikimediafoundation.org/>
==What is this list?==
This readout highlights the partnership between Google
<https://www.blog.google/products/search/connecting-people-covid-19-informat…>
and Wikipedia <https://en.wikipedia.org/wiki/Wikipedia:WikiProject_COVID-19>
on surfacing COVID-19 data. This digest aims to shed light on user trends
and data sources so that both Google and Wikipedia can more effectively
surface relevant COVID-19 information to users across the globe.
The information shared in this digest will be:
-
Updated on a weekly cadence
-
Populated/updated by relevant stakeholders by 6pm US PST every Thursday and
sent out Friday morning US PST.
Shared transparently with Google, Wikimedia Foundation, and Wikipedia’s
language communities.
==Google Feedback==
No new updates this week!
==WMF Feedback==
The Product Analytics team has been providing frequently updated metrics
around Wikipedia usage during the COVID-19 pandemic. This data is now
available via dashboards that you can check at any time.
Pageviews information (updated daily) is available via Superset[1]:
https://superset.wikimedia.org/superset/dashboard/108/
Edits and editors information (updated weekly):
https://analytics.wikimedia.org/published/notebooks/weekly_edits/weekly_edi…
[1] To access Superset, you need a Wikimedia developer account (create one
here: https://www.mediawiki.org/wiki/Developer_account) and have your
account added to the WMF or NDA LDAP group (submit a request here:
https://phabricator.wikimedia.org/project/profile/1564/). You will need
your UNIX shell username and password when logging into Superset.
==Community Feedback==
No new discussions that I'm aware of, but please feel free to chime in if
there are any new proposed changes!
--
*Maryana Pinchuk* (she/her)
Senior Partnerships Manager
Wikimedia Foundation <https://wikimediafoundation.org/>
Hello,
Both Worldometer and the Johns Hopkins University have persistently
maintained incorrect counts for some countries. This leads a lot of users
to either update our tables to the wrong counts (ignoring our cited
sources) or repeatedly ask for updates in our talk pages.
I created a summary of common errors here:
https://en.wikipedia.org/wiki/Wikipedia:WikiProject_COVID-19/Case_Count_Tas…
France, New Zealand and Canada are countries were these errors are
persistent and never corrected. I would like to expand on the case for
France, which presents the worst error (quantitatively):
* Some overseas territories are double-counted in France total. Overseas
departments, as well as the collectivities of Saint Barthélemy and Saint
Martin are included in France official totals. Only the collectivities of
French Polynesia, New Caledonia, Saint Pierre and Miquelon and Wallis and
Futuna are not included. JHU double counts the former.
* Official France already counts include cases at nursing homes (EHPAD),
but Worldometer and JHU adds them to the total, effectively double-counting
them. The World Health Organization Situation Reports match this official
count, not JHU's or Worldometer's.
I have contacted JHU with no response so far. Do you think it would be
possible to approach them and ask about this? Unless me and other editors
got something completely wrong, JHU and Worldometer figures for France are
incorrectly inflated by ~30k cases.
Best,
Mario Gómez
==What is this list?==
This readout highlights the partnership between Google
<https://www.blog.google/products/search/connecting-people-covid-19-informat…>
and Wikipedia <https://en.wikipedia.org/wiki/Wikipedia:WikiProject_COVID-19>
on surfacing COVID-19 data. This digest aims to shed light on user trends
and data sources so that both Google and Wikipedia can more effectively
surface relevant COVID-19 information to users across the globe.
The information shared in this digest will be:
-
Updated on a weekly cadence
-
Populated/updated by relevant stakeholders by 6pm US PST every Thursday and
sent out Friday morning US PST.
Shared transparently with Google, Wikimedia Foundation, and Wikipedia’s
language communities.
==Google roadmap==
A few updates/callouts to Google's roadmap table here
<https://en.wikipedia.org/wiki/Wikipedia_talk:WikiProject_COVID-19/Case_Coun…>
:
1. For US totals, the Google team has found county-level data onwiki and is
displaying it for most US states, with the exception of *Nevada* and
*Kentucky*. If anyone can find a reliable source for county-level data for
these states and create tables for them onwiki, it would be much
appreciated!
2. Missing data: missing data noted in other rows of the roadmap table
linked above are still applicable this week:
- *top priority:* global distribution of cases by age, gender, and severity
- recoveries data
- daily data by country
- tests over time
==Google Feedback==
1. *Top Priority:* Finding global COVID data by age, gender, and severity
is Google’s top data priority this week. Does Wikipedia have this data that
includes global totals?
2. *[Fixed]* Incorrect data: Some of the map/stats flags were temporarily
wrong. Especially all the French territories (e.g. Guadeloupe) should show
the French flag. Affected territories include:
-The Islands of Guadeloupe, Martinique, Saint-Martin, Saint-Barthélemy,
Saint Pierre and Miquelon (Atlantic Ocean).
-Reunion island, Mayotte, the French Southern and Antarctic Lands (Indian
Ocean)
French Polynesia, New Caledonia, Wallis and Futuna (Pacific Ocean)
==Community Feedback==
1. *Request to update Google stats card global data attribution link. *The
Wikipedia attribution line of the stats card for global data currently
links to https://en.wikipedia.org/wiki/2019%E2%80%9320_coronavirus_pandemic,
but the data is actually housed in
https://en.wikipedia.org/wiki/Template:2019%E2%80%9320_coronavirus_pandemic….
The latter page is the one watched by the Wikipedians who monitor/update
the stats. The community has noticed an uptick of stats-related questions
and requests on the discussion page of
https://en.wikipedia.org/wiki/2019%E2%80%9320_coronavirus_pandemic,
probably as a result of Google’s attribution link – they’d like *Google to
update the global data attribution link to point to
https://en.wikipedia.org/wiki/Template:2019%E2%80%9320_coronavirus_pandemic…
<https://en.wikipedia.org/wiki/Template:2019%E2%80%9320_coronavirus_pandemic…>*,
so they can respond to questions/suggestions on the associated discussion
page.
2. *Request for comment on proposals for changing French overseas regions
reporting*. The COVID task force continues to discuss what to do about
these and other special territories that tend not to have reliable sources
for reporting case statistics. There is a proposal to either a) merge some
of the French overseas territories back with France and report a mix of
aggregate statistics for mainland + overseas France and separate stats on
overseas territories that do have a reliable source (following the
convention of ECDC, The New York Times, Bloomberg and the Berliner
Morgenpost), or b) keep the table as-is and note that there will be a
24-hour delay in reporting stats for these areas (because the only reliable
source is the daily WHO situation report). The full details of the
proposals are here
<https://en.wikipedia.org/wiki/Template_talk:2019%E2%80%9320_coronavirus_pan…>.
*Google team*, please let Mario (who started this discussion and offered
these two proposals) know if you have any thoughts on these potential
changes!
--
*Maryana Pinchuk* (she/her)
Senior Partnerships Manager
Wikimedia Foundation <https://wikimediafoundation.org/>
Hello,
For the worldwide table, it may be a good idea to link directly to the main
page for the table:
https://en.wikipedia.org/wiki/Template:2019%E2%80%9320_coronavirus_pandemic…
This page is transcluded to the "2019–20 coronavirus pandemic by country
and territory" article. We are receiving many user requests in the talk
page for the later article, but that is not watched by as many people and
it is not the best place to report issues.
Best,
Mario Gómez
Hello,
Listing criteria for countries and territories in the cases table could
change soon. The RfC did not start yet and proposals and feedback are
welcome:
https://en.wikipedia.org/wiki/Template_talk:2019%E2%80%9320_coronavirus_pan…
The number of affected territories should be less compared to the previous
RfC (late March).
Best,
Mario Gómez
==What is this list?==
This readout highlights the partnership between Google
<https://www.blog.google/products/search/connecting-people-covid-19-informat…>
and Wikipedia <https://en.wikipedia.org/wiki/Wikipedia:WikiProject_COVID-19>
on surfacing COVID-19 data. This digest aims to shed light on user trends
and data sources so that both Google and Wikipedia can more effectively
surface relevant COVID-19 information to users across the globe.
The information shared in this digest will be:
-
Updated on a weekly cadence
-
Populated/updated by relevant stakeholders by 6pm US PST every Thursday and
sent out Friday morning US PST.
Shared transparently with Google, Wikimedia Foundation, and Wikipedia’s
language communities.
==Google insights==
Top COVID-19 Segments (ranked by Google query volume):
1. Global
2. Top 20 Countries
3. US > State/Territory
4. US > City/Regions
5. Top 30 Countries > Region
6. US > Distribution By Age
7. US > State/Territory > Distribution By Age
8. Global > Distribution By Age
9. US > Distribution By Case Severity
10. US > State/Territory > Distribution By Case Severity
11. Global > Distribution By Cases Severity
12. Global > COVID-19 Policy Changes By Country
==Google roadmap==
Current and planned Wikipedia pages Google is using in the stats card
feature:
Statistic Segment
Description
Status
Source URL(s)
Notes / Feedback
Global (Total)
Total Cases, Total Recoveries, Total Deaths
Live
https://en.wikipedia.org/wiki/2019%E2%80%9320_coronavirus_pandemic_by_count…
Country (Total)
Total Cases, Total Recoveries, Total Deaths
Live
https://en.wikipedia.org/wiki/2019%E2%80%9320_coronavirus_pandemic_by_count…
US > State/ Territory (Total)
Total Cases, Total Recoveries, Total Deaths
Live
https://en.wikipedia.org/wiki/2020_coronavirus_pandemic_in_the_United_States
Global (Daily)
Daily Cases, Total Recoveries, Total Deaths
Live
https://en.wikipedia.org/wiki/2019%E2%80%9320_coronavirus_pandemic_cases/WH…https://en.wikipedia.org/wiki/2019%E2%80%9320_coronavirus_pandemic_deaths/W…
Missing daily recoveries numbers
Country (Daily)
Daily Cases, Total Recoveries, Total Deaths
In Progress (Apr-05)
https://en.wikipedia.org/wiki/2019%E2%80%9320_coronavirus_pandemic_by_count…
Pages linked from the main page table
Coverage is not perfect (not available in all countries). Google is hoping
to get info on at least the top 50 countries.
US > State/ Territory (Daily)
Daily Cases, Total Recoveries, Total Deaths
In Progress (Apr-05)
https://en.wikipedia.org/wiki/Template:2019%E2%80%9320_coronavirus_pandemic…
Pages linked from the main page table
Country > Testing
Total Tests, Positive Tests, Tests/Million People, Positive/Thousand Tests
In Progress (Apr-05)
https://en.wikipedia.org/wiki/COVID-19_testing
Information is currently only a snapshot view. Google is still looking for
tests over time.
Global > Age, Gender, Severity
Distribution of cases by age, gender and severity
Planned (Apr-12)
Missing data
Information is limited to a small number of countries. Google is hoping to
get this information for all countries.
Country > Age, Gender, Severity
Distribution of cases by age, gender and severity
Planned (Apr-12)
ttps://en.wikipedia.org/wiki/2019%E2%80%9320_coronavirus_pandemic_by_countr…
<https://en.wikipedia.org/wiki/2019%E2%80%9320_coronavirus_pandemic_by_count…>
Pages linked from the main page table
Information is limited to a small number of countries. Google is hoping to
get this information for all countries.
US > State/ Territory > Age, Gender, Severity
Distribution of cases by age, gender and severity
Planned (Apr-12)
https://en.wikipedia.org/wiki/Template:2019%E2%80%9320_coronavirus_pandemic…
Pages linked from the main page table
==Wikimedia Foundation updates==
- WMF Tech and Research teams are tracking all Mediawiki-related
COVID-related bug reports and feature requests in Phabricator (with the
covid-19 tag): https://phabricator.wikimedia.org/project/view/4648/ –
anyone can submit issues and requests to this tag (or ask a WMF staffer for
help if you're unfamiliar with Phabricator).
==Feedback==
Any additional feedback (ie: data latency issues or inaccuracies on
Google’s reporting of COVID-19 data).
- On March 31, Wikipedians noticed that Greenland was not highlighted on
the Google map visualization as having any cases, likely because Greenland
and Denmark were not separated in the 2019–20_coronavirus_pandemic_data
template – this appears to have been fixed.
--
*Maryana Pinchuk* (she/her)
Senior Partnerships Manager
Wikimedia Foundation <https://wikimediafoundation.org/>