Some Full-width Unicode characters should be retained

Some full-width Unicode characters should be retained in the HTML output. A typical case is the ideographic space (U+3000) , CJK language users use it instead of ASCII space as initial indentation character for a paragraph.