Message4036
I noticed that Xapian has some problems stemming uppercase strings.
>>> indexer = xapian.TermGenerator()
>>> stemmer = xapian.Stem("english")
>>> stemmer("SILENTLY")
'SILENTLi'
>>> stemmer("silently")
'silent'
>>> stemmer("organization")
'organ'
>>> stemmer("ORGANIZATION")
'ORGANIZATION'
This is probably contributing to the low search results. Patch is attached to switch the index
to lowercase. |
|
Date |
User |
Action |
Args |
2010-03-08 04:33:33 | jvstein | set | messageid: <1268022813.99.0.447206473855.issue2550583@psf.upfronthosting.co.za> |
2010-03-08 04:33:33 | jvstein | set | recipients:
+ jvstein, ber, ThomasAH, olly |
2010-03-08 04:33:33 | jvstein | link | issue2550583 messages |
2010-03-08 04:33:33 | jvstein | create | |
|