summaryrefslogtreecommitdiff
path: root/doc/plugins/htmlscrubber.mdwn
blob: b651ffc99bb70a2d129c5158682c307034fac6c2 (plain)

[[!template id=plugin name=htmlscrubber core=1 author="[[Joey]]"]] [[!tag type/html]]

This plugin is enabled by default. It sanitizes the html on pages it renders to avoid XSS attacks and the like.

It excludes all html tags and attributes except for those that are whitelisted using the same lists as used by Mark Pilgrim's Universal Feed Parser, documented at http://feedparser.org/docs/html-sanitization.html. Notably it strips style and link tags, and the style attribute.

All attributes that can be used to specify an url are checked to make sure that the url is in a known, safe scheme, and to block embedded javascript in such urls.

It uses the [[!cpan HTML::Scrubber]] perl module to perform its html sanitisation, and this perl module also deals with various entity encoding tricks.

While I believe that this makes ikiwiki as resistant to malicious html content as anything else on the web, I cannot guarantee that it will actually protect every user of every browser from every browser security hole, badly designed feature, etc. I can provide NO WARRANTY, like it says in ikiwiki's [[GPL]] license.

The web's security model is fundamentally broken; ikiwiki's html sanitisation is only a patch on the underlying gaping hole that is your web browser.

Note that enabling or disabling the htmlscrubber plugin also affects some other HTML-related functionality, such as whether [[meta]] allows potentially unsafe HTML tags.


Some examples of embedded javascript that won't be let through when this plugin is active:

  • script tag test window.location='http://example.org';
  • CSS script test
  • entity-encoded CSS script test
  • entity-encoded CSS script test
  • click me