294102
This is a very usefull hack. However there is a few more feature I would find usefull.
You script cuts out short words like "the" "and" "to" but many words in english are longer than three letters but should also be cut out such as "from", "because", "also", and others. Would it be posible to create a data file of words to exclude. Or crossreferance the keywords with a dictionary file and only include nouns and verbs. Thus excluding all proverbs. I understand this is a bit more than just a hack and could be a leanthy mod. Plus it would have to be upadted with each fresh language supported. However it could be significanly benificial.
Sencondly: Would it be posible for someone out there to create a module that parsed the text on your pages a came back with a word count on each keyword used. This would be usefull to determine how well the search engines may rank you in thier listings. You would be able to alter the keyword density of your documents to optomise your ranking without the overkill of spamming keywords on pages, or the underkill of not enough keywords. I have found problems in underkill before. I once had a web design company site which ranked better under aromatherapy than under web design. This was because one of the clients was an alternative medicine group and so lots of my pages were discussing the clients site and not mine. Indeed at one point my web design site ranked higher for aroatherapy that the aromatherapy site itself. A mod like that discribed above would help predict how search engines will rank your pages before you submit them so you can correct things before it's to late. It's hard enough getting listed in search engines without worrying about listing incorectly.