[wp-hackers] Overriding get_posts() behaviour
ml_wordpress at copperleaf.org
ml_wordpress at copperleaf.org
Sun Jul 3 12:48:16 GMT 2005
I've also taken a look at Denis' plugin and have a few ideas that maybe
you guys could add. I've modified Denis' plugin on my testbed just for
fun and could send you the code if you wish. Here is a list of ideas:
1) Probably the most useful function for me was that I added a filter to
the sem_search_index function that allows additional plugins to add
additional words to the node_content column. This could actually be a
foundation filter for the plugin in that all search words could be added
that way: the_content, the_title, post_tags, and, in my case, data from
columns in new tables.
2) I found that by using a fulltext search, is you search on 'bean', it
won't match 'beans'. I don't know if there is a something in the
fulltext search that can allow you to do 'like' queries. If not, maybe
that could be either a global option or something selected from an
advanced search page that would allow you to do fulltext searches or
like searches. (BTW Denis, some hosts default to Innodb tables so in
your create table statement you need to specify ENGINE=myisam so that
fulltext indexes can be created.)
3) I added some code that would clean out all funky characters, remove
all duplicates and collapse all whitespace in the node_content column.
This can shorten the size of the field significantly and removing the
dups is nice if you aren't doing weighted searches. Something else to
consider would be to remove all stopwords. (Configurable from an admin
page?)
4) One last idea is that perhaps an option could be so store the soundex
(or some other algorithm) for the word list so that searches are done on
that instead of the actual word.
Anyway, like I said above, I'd be glad to send you guys my mods or I'd
be glad to help with parts of this if you wish.
Bill
More information about the wp-hackers
mailing list