Custom Search Engine [Not Google CSE]...

vrufusx65v

New Member
Messages
74
Reaction score
0
Points
0
What's the best way to create a Custom Search Engine for your website? I've tried google CSE and im not really liking it. I have been over to Net.Tuts and their search engine is basically what i would like to achieve in creating and developing. any ideas at all?
 

learning_brain

New Member
Messages
206
Reaction score
1
Points
0
What sort of search engine do you want? A full text? Image or something more specific?

I have a search engine which isn't reliant on google and have numerous pages.

1) A crawler - this opens up URL's and searches for <img> tags and then saves to an index which can be searched using fulltext.

2) A URL queue - the crawler is split into twp parts - a content search and a URL search. If all content is exhausted, it will go to the next available URL in the queue.

3) A search page - pretty basic - searches through the index.

4) A URL check - a page that continuously loops through the index to ensure they are still there and that the content is the same.

5) A few other maintenace pages...

It depends on what you want to achieve. Don't ever expect to get to the standard of google or any other major search engine becuase you just wont have the processing power. Try to get a Unique feature in that really appeals.
 

vrufusx65v

New Member
Messages
74
Reaction score
0
Points
0
well, from the descriptions you've given me i'd say either a URL queue or a search page. i believe a search page is what net.tuts uses, cause you type in what you want to search and then the page refreshes with the results, its the same page, the only thing that changes is the URL says (http://net.tutsplus.com/?s=search+engine). to me, that looks like a search page, and if it is, then that's what i'd like to develop...
 
Last edited:

vrufusx65v

New Member
Messages
74
Reaction score
0
Points
0
thanks for the info, i'll check out Sphinx and see what i come up with... =)
 

learning_brain

New Member
Messages
206
Reaction score
1
Points
0
If it's just based on internal textual content, Sphinx is easily the best solution. I have tried it in the past and it works great with customisable output pages as well.
 
Top