Author rouilj
Recipients ber, marlowa, rouilj
Date 2017-10-14.14:12:01
committed first pass at this in rev e20f472fde7d.

Commit hg5306:91354bf0b683 fixed a bug found after looking at code
coverage and testing some missed code paths. Plus hg5307:5b4931cfc182
added test for the entity conversion code path in in the dehtml routine.

Using beautiful soup 4 is enabled but I couldn't develop tests for it,
so mileage may vary.
