Googlebot gives the indexer everything of the pages it finds. These pages are stored in Google indexer's index database. This index is sorted alphabetically by search term. All of the index entries store a list of documents and webpages in which the term appears and the exact location within the page where it occurs. This kind of storing of data allows to fetch information from this indexer at faster rate and can answer user queries more frequently.
To improve quality of search performance and search results it displays, Google ignores common words called stop words from indexing. This stop words includes the, is, on, or, of, how, why, etc. This words can be safely ignored and doesn't affect displayed results in any sense. Indexer also converts everything into lowercase for convenience.
As we all know that Googlebot is Google’s web crawler, which finds and retrieves pages on the web (internet) and hands them off to the Google indexer which than sorts every word on every page and stores the resulting index of words in a huge database. It’s easy to imagine Googlebot as a little spider crawling across the internet, but in reality Googlebot doesn’t traverse the web at all. It functions much like your web browser we use for surfing the internet. First sending a request to a web server for a web page it wants to crawl and then downloading the entire page and handing it off to Google’s indexer for processing.
Googlebot consists of many computers requesting and fetching pages much more quickly than we can with our common web browser available. In fact, Googlebot can request thousands of different pages at the same time.
Googlebot uses two main methods to find pages on the internet:
1. through an add URL form >> www.google.com/addurl.html
2. through finding links by crawling the web.
Search engine of Google runs on a distributed network of thousands of low-cost computers and can therefore carry out fast parallel processing. Googlebot is the first and most important step in this complex processing of data and indexing. In this method parallel processors are used. Parallel processing is a method of computation in which many calculations can be performed at the same time while significantly speeding up data processing. Google has three distinct parts:
1 Googlebot, a web crawler that finds and fetches web pages all across the web (internet).
2 The google indexer that sorts every word on every page crawled by googlebot and stores the resulting index of words in a huge database of its own.
3 The query processor, which compares your search query to the information and data stored inside indexer and recommends the documents that it considers most relevant.
Earthtimes.org Google Applies For 'GPay' Mobile Payments Patent InformationWeek, NY - 1 hour ago By Richard Martin Adding to its rapidly growing suite of mobile applications and services, Google has applied for a patent for a mobile payments service ...