Roundup Tracker - Issues

Message5464

Author ollydbg
Recipients ThomasAH, ber, jerome, ollydbg, pefu, yanqian
Date 2016-02-24.06:36:54
Message-id <1456295815.83.0.289995605814.issue2550788@psf.upfronthosting.co.za>
In-reply-to
Hi, guys, I'm not sure what the exact "non-ascii" issue you have. For
me, the need is to just handle Chinese chars(also the English words is
still needed). I use the mmseg 1.3.0 : Python Package Index -
https://pypi.python.org/pypi/mmseg/1.3.0 to parse all the text, and
added the terms to the xapian's database. (Note that not only mmseg, but
other fxsjy/jieba: 结巴中文分词 - https://github.com/fxsjy/jieba should
works OK, because they just cut a long Chinese sentence to several
Chinese words)

Now, I can search the Chinese words correctly.

If you are interested to use mmseg, I can upload the patch against
roundup's source code.
History
Date User Action Args
2016-02-24 06:36:55ollydbgsetmessageid: <1456295815.83.0.289995605814.issue2550788@psf.upfronthosting.co.za>
2016-02-24 06:36:55ollydbgsetrecipients: + ollydbg, ber, pefu, ThomasAH, jerome, yanqian
2016-02-24 06:36:55ollydbglinkissue2550788 messages
2016-02-24 06:36:54ollydbgcreate