I've got some more proposals on StackOverflow:

1. Decode into Unicode at ORM (HyperDB) level
2. Use https://pypi.python.org/pypi/unicode-nazi

As for demo data, I think it is possible to ask bugs.python.org
guys for a copy of database for testing.