A metasearch engine is an Internet search system that supports unified access to multiple search engines through one search query.
A search engine visits billions of websites, using an automated program known as a spider. Information collected by these spiders is stored in a database known as an index. When a user enters a search query into a search engine, such as Google or Bing, an algorithm will determine which of the pages contained in its index are the most relevant to the search terms used in the query. Web pages that have not been included in the search engine's index will not be available in its search engine results.
Metasearch engines don't send their own spiders out to search the Web, and they don't have an index of their own to draw from. Instead, a metasearch engine will pull results from a number of search engines, and then apply their own algorithms to reorder the results. To those who visit a search engine site or a metasearch site, the two will most often appear to be the same, generally a search field in the center of a large white space. Technically, a metasearch engine is not a search engine, but a web portal that aggregates search engine results through a proprietary algorithm.
The same search query will produce different results from one search engine to another. This is because they are using different indexes and each has its own proprietary algorithm. When the same search query is made on two different metasearch engines, the results will also differ because, of course, they each use their own proprietary algorithms, but also because the two metasearch engines may use data from different search engines.
When a user enters a search term into the search field of a metasearch engine and begins a search, the engine will send the request to multiple search engines. Since the metasearch engine is under contract to pay the search engines whose results they use, a metasearch engine that uses both Google and Bing will have access to a much larger pool of data than one that uses data from smaller search engines.
After the metasearch engine sends the query to the search engines that it has access to, its server will wait for the responses from each requested search engine before displaying search results to the user. In some metasearch engines, the results pages continue to be updated as results from other search engines come in.
The order of the results that are displayed is determined by the metasearch engine's individual algorithm. Results may be compiled based, in part, to the popularity of the requested search engines. Thus, Google results may be given precedence over a lesser known engine. Most metasearch engines also filter out duplicates so that the URL does not appear twice in the results that are displayed.
Some metasearch engines allow the user to select the search engines to be used for their search.
Some search engines maintain their own indexes as well as incorporating results from other sources. For example, Yahoo has its own search engine spider and maintains its own index, but it also uses results from Bing. Prior to entering into a contract with Bing, it used Google results for a time. For the sake of categorization, the Aviva Directory will be listing them as search engines, not metasearch engines.
Because even the largest of search engines have indexed only a small portion of the Web, a metasearch engine will provide access to a larger portion of the Internet than a search engine, as it can be assumed that individual search engines have indexed different parts of the Web, although there will be many similarities in results.
We might assume that, because metasearch engines get their results from multiple search engines, the results they provide will be more helpful than those of any one individual search engine. In reality, it's a matter of personal preference.
If each individual search engine indexed a completely different part of the Web, then a metasearch engine that is able to bring them all together would be a better choice than any one search engine. In reality, it seems that there are more similarities than there are differences in the content provided in the indexes of the various search engines. Because there are some differences, you might find that one search engine will be more efficient for a particular search while another might yield better results in another type of search.
Probably, the differences between individual search engines and metasearch engines have more to do with the algorithms they use to order their results than in the pages that are included in their indexes.
In the early 2000s, the consolidation of the search engine market resulted in several once popular search engines being converted into metasearch engines, and some of these have gone back and forth a few times and may continue to do so.
 
 
Recommended Resources
Implemented in 2001, and distributed under the BSD license, Carrot2 is an open-source results clustering metasearch engine, drawing its data from Bing, Apache Lucene, OpenSearch, PubMed, Apache Solr, the eTools metasearch engine, and generic XML files, presenting results to search queries, and categorizing its results, on the fly, into various topical categories, which may be displayed in folders, a circular graph, or in what it refers to as a foam tree.
http://search.carrot2.org/
Beginning operations in 1996, Dogpile is a metasearch engine that utilizes data from Google, Yahoo, Bing, and other sources. Included are preference settings, where users can set a variety of customized search preferences, as well as offering suggested spellings for words that may have been misspelled. Besides the Web, Dogpile includes options to search images, videos, news, and shopping. The engine will display recent searches made, up to fifteen, but it resets when the browser is closed.
https://www.dogpile.com/
eTools is a Swiss-based international metasearch engine that uses data from Ask, Base, Bing, DuckDuckGo, Exalead, Faroo, Fastbot, Google, Mojeek, Moose, Search, Tiger, Wikipedia, Yahoo, and Yandex. In response to a user’s query, eTools will simultaneously query each of these data sources, then merging the results before returning them to the user. The process is described on its site, along with press reports. Support is also included for mobile devices.
https://www.etools.ch/
Begun in 1995 and operated by InfoSpace, Excite is an Internet portal that includes web-based email, news, and a customizable user homepage, as well as a metasearch engine. In the mid- to late-1990s, Excite was one of the most visited pages on the Web. Users have the options to search the Web, images, video, or news. Like most search engines, Excite displays ads at the top of its results pages, but it also may include ads in the middle and at the end.
http://www.excite.com/
Online since 2004, Info.com is a metasearch engine that uses Google, Yahoo, Ask, Teoma, AlltheWeb, Inktomi, Yandex, Kanoodle, LookSmart, About.com, and other search engines and directories. Owned by InfoSpace, users can choose to search the Web, images, videos, or news. Ads are displayed above and below native page results and suggested searches are listed to the side. Its End User License Agreements are published to the site.
http://www.info.com/
Founded in 1999, Monster Crawler is a metasearch engine that returns results from Yahoo Search, Google, Bing, and Ask, eliminates the duplicates, and delivers results according to its own algorithm. Acquired by Reverse Media Group in 2016, the search engine provides multimedia results, including images, audio, video, news, and local information, as well as Web results. Paid ads are displayed on top of and beneath native search results.
http://www.monstercrawler.com/
Presearch is a decentralized meta search engine that operates through a distributed network of community-operated nodes that aggregate results from various sources. It promises to provide search choices, quality results, privacy, and rewards for those who want to end the search monopoly, offering tokens based on usage, promotion of, and contribution to the platform, powered by thousands of user computers in over 100 countries. Its plans are to layer its own index on top of the platform.
https://presearch.com/
Intended for students and researchers, RefSeek searches more than five billion documents, including web pages, journals, newspapers, encyclopedias, and books, offering relevant information without the information overload presented by general search engines. Users can choose to search its index of the Web or to search documents only. A categorized directory is also featured, although there is no facility for submitting new sites for consideration.
https://www.refseek.com/
Established in 1995, WebCrawler is the oldest surviving search engine on the Web, and the first to provide full-text search. Now owned by InfoSpace, the search engine operated as a metasearch for several years, then began operating as an independent search engine, with its own index. Users of the search engine have the option of searching the Web, images, video, or news. Ads are presented at the top and at the bottom of native results and suggested search terms are on the side.
http://www.webcrawler.com/