Roundup Tracker - Issues

Message4036

Author jvstein
Recipients ThomasAH, ber, jvstein, olly
Date 2010-03-08.04:33:32
Message-id <1268022813.99.0.447206473855.issue2550583@psf.upfronthosting.co.za>
In-reply-to
I noticed that Xapian has some problems stemming uppercase strings.

>>> indexer = xapian.TermGenerator()
>>> stemmer = xapian.Stem("english")
>>> stemmer("SILENTLY")
'SILENTLi'
>>> stemmer("silently")
'silent'
>>> stemmer("organization")
'organ'
>>> stemmer("ORGANIZATION")
'ORGANIZATION'

This is probably contributing to the low search results. Patch is attached to switch the index 
to lowercase.
History
Date User Action Args
2010-03-08 04:33:33jvsteinsetmessageid: <1268022813.99.0.447206473855.issue2550583@psf.upfronthosting.co.za>
2010-03-08 04:33:33jvsteinsetrecipients: + jvstein, ber, ThomasAH, olly
2010-03-08 04:33:33jvsteinlinkissue2550583 messages
2010-03-08 04:33:33jvsteincreate