[Wiki-research-l] Re: Wikipedia search sltr model

19 May 2022

Hi!

Oh that was very kind of you, thanks a lot. The format is indeed self-explanatory, it
should not be a problem. However I was looking at the feature set and just to confirm: is
each of these fields computed at query time with the provided query_string for each of the
top 448 results after the first rescore? (in any case, this is what is suggested in the
LTR plugin for elastic search docs).

If so, it would already be useful for me to have the actual mapping of a wikipedia page in
elasticsearch (the definition of the fields “title” or “opening_text” are more or less
evident, but not so much for “all_near_match” or “file_text.plain”). Is that available
anywhere?

Thank you very much again and have a nice day!

Aitor

...
  On 18 May 2022, at 23:17, ebernhardson(a)wikimedia.org
wrote:

 Hi!

 These models have never been published, but not for any particular reason. I suppose
no-one had ever asked about them. I copied the current models out of elasticsearch into
https://people.wikimedia.org/~ebernhardson/cirrus_models.20220518/ if looking them over
might help you. They are in the format the sltr plugin stores them, which seems useful as
it includes both the feature definitions and the xgboost model in JSON.

 Erik B.
 _______________________________________________
 Wiki-research-l mailing list -- wiki-research-l(a)lists.wikimedia.org
 To unsubscribe send an email to wiki-research-l-leave(a)lists.wikimedia.org 

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

[Wiki-research-l] Re: Wikipedia search sltr model