Search I
Results for: Motorcycle emptyness
Not much yet...
Update: the following was written a few days before the first working version of the search was implemented on the site. The kind of implementation I finally went with is described here.
You have reached what is to become my search page. As everybody knows, when it comes to searching google does it best. This doesn't mean I'm not going to try and do it myself, just that I'll probably fail miserably.
So.. what will we need for a proper full-text, page-ranked search (can you tell where I'm getting my inspiration from?)
- A way to access all* content on this site, or failing that, a bot to crawl it.
Now, the contents of my site are non-existent at this time, so this all depends on the decisions I'll be making later. However, some things to consider are:- Crawling takes time, and the one good reason not to use google with site:whiteboxcomputing.com is that google will only crawl a site this size every few months or so. This means I should either crawl regularly myself, or do a crawl on demand.
- Crawling is the kick ass way to do it and solves a simple security problem: a bot only sees the parts of the site that a user sees.
- Depending on the way the site data is accessed, a search page will need some kind of storage for the data retrieved.
- A ranking system, somehow incorporating keywords, user-clicks-from-the-search-page, and possibly server statistics
- A way to fool the ranking system, so that users searching for 'home' go to the home page, rather than any page referring to it
- Again, depending on the way the site data is accessed, some way to seperate content from mark-up
Anything else?
(*) except the stuff I don't want you to see
Jul 4th, 2008
Comments
No comments yet! Feel free to post some using the form below.
If you wish to add code to your comment you can use code tags, like this: <code class="php">yourCodeHere</code>.
Quite a large number of languages are supported, although I can't guarantee it'll be pretty. Inside the code tags you can use any characters except for the string "</code>".