Technical
Technical in-depth articles and information about search engines and directories. Find resources on theme search engines, term vector technology, and authority hub status sites. Details on the usage of query strings and web site and web page cloaking.

|
| Efficient Crawling Through URL OrderingThis paper studies in what order a crawler should visit the URLs it has seen, in order to obtain more "important" pages first. Obtaining important pages rapidly can be very useful when a crawler cannot visit the entire Web in a reasonable amount of time. |
|
|
| The Anatomy of a Search EngineThe Anatomy of a Large-Scale Hypertextual Web Search Engine research paper written by Sergey Brin and Lawrence Page founders of Google. In this paper they present Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext. |
|
|
| Efficient Computation of PageRankPaper discusses efficient techniques for computing PageRank, a ranking metric for hypertext documents. Several methods are discussed analyzing the convergence of PageRank based on induced ordering of the pages. |
|
|
| Cornell News: HITS web searchA Cornell University computer scientist has developed a new method of searching the World Wide Web that uses the way sites are linked to one another, rather than their text content, to find the most valuable sites on a given topic. |
|
|
| Hilltop: A Search Engine based on Expert DocumentsIn this paper we propose a new approach to authoritative ranking, which we call Hilltop. Our approach is based on the same assumptions as the other connectivity algorithms, namely that the number and quality of the sources referring to a page are a good measure of the page's quality. |
|
|
| How Search Engines Rank Web PagesExplains how search engines rank web pages by determining relevancy though analyzing keyword location, frequency and through other methods. |
|

|
| Web Robots PagesWeb Robots FAQ's, Robots Exclusion, A list of Robots, Robots Mailing List, Articles and papers, and related sites. |
|
|
| Apache 1.3 URL Rewriting GuideThis document supplements the mod_rewrite reference documentation. It describes how one can use Apache's mod_rewrite to solve typical URL-based problems webmasters are usually confronted with in practice. |
|
|
| Apache module mod_rewriteThis module uses a rule-based rewriting engine (based on a regular-expression parser) to rewrite requested URLs on the fly. |
|
|
| Jon Kleinberg's HomepageJon Kleinberg's researches algorithms which exploit the combinatorial structure of networks and information. This includes techniques for analyzing and modeling link structure in the World Wide Web. |
|