I’m building a new Drupal site, and the content authors and editors ared dyed-in-the-wool Microsoft Word users. I’ve got WYSIWYG set up to force all pasting to be treated as "Paste from Word", so much of Word’s garbage seems to be handled well. The one remaining problem is characters like smart quotes, curly apostrophes, em-dashes, and the like. Although I’m shooting for end-to-end Unicode, and theoretically that should prevent those characters from showing up as weird garbage, I consider it safer to use HTML entities, in case at some future point someone else changes something that interferes with the end-to-end Unicode architecture.
Is there a way to convert non-ASCII characters into HTML entities, either when they are pasted or before they are displayed?
Converting to straight quotes (and using Typogrify to render them as smart quotes) would be fine too, if you can tell me how to convert to straight quotes. Preferably, other characters should be as well handled.