+discovery list.
On Tue, Dec 6, 2016 at 12:53 PM, Sumit Asthana <asthana.sumit23(a)gmail.com>
wrote:
> Hi,
>
> I was extracting the Wikipedia cirrus dump of articles using
> ?action=cirrusDump for feature extraction from articles and noticed two
> keys "score" and "popularity_score". Can anyone tell what exactly do these
> keys denote and how're they calculated?
>
> I'm curious to know the possible use cases of these scores in Machine
> Learning as I'm currently processing articles.
>
> --
> -Thanks,
> Sumit <http://mediawiki.org/wiki/User:Sumit.iitp>
>
> _______________________________________________
> AI mailing list
> AI(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/ai
>
>
Thank you!!!
On Sun, Dec 4, 2016 at 3:59 AM, 陈新雄 <amiucxx(a)gmail.com> wrote:
> Hi all:
>
> Please find attached the document of THULAC.
>
> If you have any questions, contact me ASAP.
>
> Regards,
> Xinxiong
>
> 2016-12-02 9:50 GMT+08:00 陈新雄 <amiucxx(a)gmail.com>:
>
>> Hi all,
>>
>> I'm Xinxiong Chen, a PhD student from Tsinghua University. I graduate
>> from THU NLP&CSS lab (Tsinghua University Natural Language Processing and
>> Computational Social Science Lab) this July. I'm the main developer of
>> THULAC and I will translate THULAC's documentation into English and
>> offer technical support.
>>
>> I know from Chelsy that most of you are using python so I will
>> translate the document of python version.
>>
>> Chelsy and I are classmates in high school and we are friends from
>> then. So don't hesitate to contact me if you have any questions.
>>
>> Regards,
>> Xinxiong
>>
>>
>>
>> 2016-12-02 9:16 GMT+08:00 Chelsy Xie <cxie(a)wikimedia.org>:
>>
>>> Hello everyone,
>>>
>>> I'm very happy to introduce you to Xinxiong Chen <amiucxx(a)gmail.com>,
>>> the main developer of THULAC <https://github.com/thunlp/THULAC-Python>
>>> and a CS PhD student at Tsinghua University. :)
>>>
>>> Discovery team is looking for a new Chinese tokenizer and THULAC
>>> <https://github.com/thunlp/THULAC-Python> may be helpful. Here
>>> <https://github.com/thunlp/THULAC-Python#代表分词软件的性能对比> is a comparison
>>> between THULAC and other Chinese tokenizers (jieba, LTP-3.2.0 and ICTCLAS).
>>> It's very kind of Xinxiong to help us to translate THULAC's documentation
>>> into English and offer technical support.
>>>
>>> Thank you very much Xinxiong!
>>>
>>> Cheers,
>>> Chelsy
>>>
>>>
>>
>>
>> --
>> 陈 新雄(Chen Xinxiong)
>>
>> Department of Computer Science and Technology
>> Tsinghua University
>>
>> Beijing 100084, China
>>
>
>
>
> --
> 陈 新雄(Chen Xinxiong)
>
> Department of Computer Science and Technology
> Tsinghua University
>
> Beijing 100084, China
>
Hello everyone,
I'm very happy to introduce you to Xinxiong Chen <amiucxx(a)gmail.com>,
the main developer of THULAC <https://github.com/thunlp/THULAC-Python> and
a CS PhD student at Tsinghua University. :)
Discovery team is looking for a new Chinese tokenizer and THULAC
<https://github.com/thunlp/THULAC-Python> may be helpful. Here
<https://github.com/thunlp/THULAC-Python#代表分词软件的性能对比> is a comparison
between THULAC and other Chinese tokenizers (jieba, LTP-3.2.0 and ICTCLAS).
It's very kind of Xinxiong to help us to translate THULAC's documentation
into English and offer technical support.
Thank you very much Xinxiong!
Cheers,
Chelsy