Requires Free Membership to View
A technique known as "document ranking" is being employed by current search engines. Using document ranking, documents are given scores or "rankings" in order to arrange the documents according to how relevant their content is to the search key.
Current research trends are looking into categorizing retrieved data using taxonomies. A taxonomy is a semantic-based categorization of data or services. A categorization approach usually relies on human judgment when categorization is done.
Another project is the XML-based Resource Description Framework, developed by the W3C. RDF seeks to empower search engines by standardizing Web-based metadata. To properly integrate a search engine into a J2EE environment, a number of patterns should be used including:
Model-View-Controller
Command
DAO
Session Facade
Page-by-page Iterator
Front Controller
Factory
These patterns should be employed across a four-tier architecture as depicted here.
For search engine source code, check out the FreeWais project, or the Jakarta Lucene text search engine or Jcorporate's eSearch.
SearchKeyPro is a commercial search engine written in Java by Astaware.
This was first published in June 2002

Join the conversationComment
Share
Comments
Results
Contribute to the conversation