Search Engines Essay, Research Paper
There are currently over a billion pages of information on the Internet about every topic imaginable. The question is how can you possibly find what you want? Computer algorithms can be written to search the Internet but most are not practical because they must sacrifice precision for coverage. However, a few engines have found interesting ways of providing high quality information quickly. Page value ranking, topic-specific searches, and Meta search engines are three of the most popular because they work smarter not harder.
While no commercial search engine will make public their algorithm, the basic structure can be inferred by testing the results. The reason for this is because there would be a thousand imitation sites, meaning little or no profit for the developers. The most primitive of searches is the sequential search, which goes through every item in the list one at a time. Yet the sheer size of the web immediately rules out this possibility. While sequential might return the best results, you would most likely never see any results because of the web?s inflammatory growth rate. Even the fastest computers would take a long time, and in that time, all kinds of new pages will have been created.
Some of the older ?spiders? like Alta Vista are designed to literally roam randomly through the web using links to other pages. This is accomplished with high-speed servers with 300 connections open at one time. These web ?spiders? are content based which means they actually read and categorize the HTML on every page. One flaw of this is the verbal-disagreement problem where you have a particular word that can describe two different concepts. Type a few words in the query and you will be lucky if you can find anything relates to what you are looking for. The query words can be anywhere in a page and they are likely to be taken out of context.
Content-based searches can also be easily manipulates. Some tactics are very deceptive, for example ??some automobile web sites have stooped to writing ?Buy This Car? dozens of times in hidden fonts?a subliminal version of listing AAAA Autos in the Yellow Pages?(1). The truth is that one would never know if a site was doing this unless you looked at the code and most consumers do not look at the code. A less subtle tactic is to pay to get to the top. For example, the engine GoTo accepts payment from those who wish to be at the top of a results list because the sites at the top will get more traffic.
Lawrence Page and Sergey Brin of Google have come up with a different idea for searching called PageRank. They realized that the most popular sites are those linked the most in other pages. Here is the pseudocode algorithm for searching:
1. Parse the query
2. Convert the words into wordIDs.
3. Seek to the start of the doclist in the short barrel for every word.
4. Scan through the doclists until there is a document that matches all the search terms.
5. Compute the rank of that document for the query.
6. If we are in the short barrels and at the end of any doclist, seek to the start of the doclist in the full barrel for every word and go to step 4.
7. If we are not at the end of any doclist go to step 4.
8. Sort the documents that have matched by rank and return the top k.
Each link to a page is like a vote for that page as well as the pages linked to that page. Thus a hierarchy of pages is created and the search results are much more reliable. Lycos and Excite also use the same system, but Google goes further. ?It then looked at the position of the words on the page, the size of the fonts, and the likelihood that the words were related to each other? (1). Going the extra distance gives Google much better precision.
Google and engines like it can still be manipulated to achieve higher rankings. Anyone who creates a set of pages with links between them can fool the system and add value to their page. So the race continues to find yet another search engine. One promising way to search for something is to use a topic-specific search engine. Among the topic-specific engines are VactionSpot.com, KidsHealth.org, and epicurious.com. These engines give you better results because they are often a front-end to a database of information, they are regularly maintained and updated, and they have a narrow focus and smaller size.
It makes sense that if you do a specific search, then you are less likely to end up with irrelevant information. The good news is that you are getting high quality results in a short period of time. The only problem with topic-specific engines is finding the right one. This is where query routing comes into play. You have two types: manual and automatic. Manual routing means you find the best topic matching your query yourself which can be confusing. Automatic routing is designing an algorithm to do it for you.
One of the newer automatic routers is called Q-Pilot. Q-Pilot uses both offline and online areas for quicker access. When a user enters a query, that query is expanded to create multiple topics that are more specific. These topics are taken from a ?neighborhood? of pages and often represent another search engine?s topics. ?Q-Pilot?uses the web as its knowledge base and autonomously learns what it does not know? (2). This almost sounds like artif
Наверняка у вас есть товары или услуги, продажа которых приносит вам максимальную прибыль. Для быстрого старта в сети вам необходимо создание посадочной страницы (одностраничного сайта), на которой будет размещена информация о маржинальных товарах/услугах интернет магазина. За 8 лет опыта разработки конверсионных страниц мы выработали оптимальную структуру, которая позволит привлекать через landing page больше продаж. На такую структуру «одевается» ваш контент — фирменный стиль, тексты, фотографии, уникальные торговые предложения, после чего страница выходит в свет. Разработка лендинга и запуск в сети — до 7 рабочих дней. Стоит отметить, что в разработку самой посадочной страницы входит и написание копирайтером продающих текстов для вашего бизнеса, чтобы каждый посетитель страницы захотел совершить покупку именно у вас. Результат: качественно разработаная продающая посадочная страница, которая готова приносить вам новых клиентов.