summaryrefslogtreecommitdiff
path: root/doc/bugs/UTF-8_BOM_showing_up_inside_a_page__63__.mdwn
blob: 6f1dc45033ed9cad2b1838303e9bbf1f3a97f338 (plain)

I have a git-backed ikiwiki install, and when I commit and push a file from an x86 host (LANG=en_US.UTF-8) to the Ikiwiki box, which is Debian GNU/Linux on Sparc, I sometimes get unusual characters (ef bb ff) before the first character of the wiki text. It seems that this is a UTF-8 "byte order mark" that is getting inserted automatically into the .wiki file by my editor: http://vim.wikia.com/wiki/VimTip246#Tip:.23246-_Working_with_Unicode

Example:

    http://monkey.linuxworld.com/lwce-2007/

Is there any way for ikiwiki to spot when .wiki files have this BOM and deal with it, or should I make sure to strip it out before committing?

It would be easy to make ikiwiki strip out the BOM. For example, a simple plugin could be written to s/// them out as a filter.

I'm unsure if ikiwiki should do this by default. --[[Joey]]

Looked at this some more. It seems this would be a browser bug, after all, it's not displaying the BOM properly (as a zero-width character). To test, I've added a BOM to this file. --[[Joey]]