Avoid %links accumulating duplicates. (For TOVA) - ikiwiki-upstream - Unnamed repository; edit this file to name it for gitweb.

diff options

author	Joey Hess <joey@gnu.kitenet.net>	2009-05-05 23:40:09 -0400
committer	Joey Hess <joey@gnu.kitenet.net>	2009-05-06 00:27:24 -0400
commit	2a7721febd6cac1af5e7f4b4949ffe066c62c837 (patch)
tree	c0e488da71e36ce1842e2553e2cf683e49d15676 /t/html.t
parent	1c7c9e95f227a3ff7906c000ec15bb163edc463f (diff)

Avoid %links accumulating duplicates. (For TOVA)

This is sorta an optimisation, and sorta a bug fix. In one test case I have available, it can speed a page build up from 3 minutes to 3 seconds. The root of the problem is that $links{$page} contains arrays of links, rather than hashes of links. And when a link is found, it is just pushed onto the array, without checking for dups. Now, the array is emptied before scanning a page, so there should not be a lot of opportunity for lots of duplicate links to pile up in it. But, in some cases, they can, and if there are hundreds of duplicate links in the array, then scanning it for matching links, as match_link and some other code does, becomes much more expensive than it needs to be. Perhaps the real right fix would be to change the data structure to a hash. But, the list of links is never accessed like that, you always want to iterate through it. I also looked at deduping the list in saveindex, but that does a lot of unnecessary work, and doesn't completly solve the problem. So, finally, I decided to add an add_link function that handles deduping, and make ikiwiki-transition remove the old dup links.

Diffstat (limited to 't/html.t')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: