TimesArgus.com - We Are Vermont

Google is Nuts



Toolbox

Jerry Chase - Published: April 15, 2006

If you use Google, you will want to use a different primary search engine after reading this. The Google page rank concept provides defective searches that can leave out the very information you are seeking, and ignore the work of dedicated web authors. Like most people, I never paid a whole lot of attention how google ranked sites in searches, and I just assumed that it was based on content with a few other factors tossed in to keep the scammers and spammers from rising through the ranks. I now know more, and I believe that the criteria it uses is fatally flawed and is destroying itself.

A few years back, I posted an extensive history of an area in Vermont
called "Bolton Falls" on one of my websites. That work was lauded by
the local historical society, and has remained available on the web since then with very few changes. It is a student friendly site with no advertisements and no hidden attempts to tweak the google ranking. It is simply good solid information about the area.

Yesterday, I decided to see if there was any new information on the
web about that area that I needed to include on the pages, so I
plugged "Bolton Falls" into the google search engine. As expected, I
didn't see anything that looked like new information. What was
unexpected was that my site, which is _by far_ the most informative on the subject, had dropped in rank to the ONE-HUNDRED NINETEENTH listed, right after the site that got listed for the phrase "Except maybe if Michael Bolton falls off a cliff or something"

Another extensive set of pages on my site concerns the "Mount
Mansfield Electric Railroad" Using that rare quoted search term,
my site on the subject ranks 15th, right after a couple of other secondary search engine listings that listed my site themselves, for the use of the words "snowplow" and "lumberyard"

Just to make sure that the world hadn't gone totally crackers, I did a
Yahoo search on the two phrases. My sites came up as the number one
and number three choices, as I had expected, and as had been the case
on google previously.

What this comparative search for my sites on different search engines points out is the horrendous mess that Google has made out of its search engine, degrading it from the number one choice in searching the net to something less useful than Billy-Bob's Search-o-rama and Goat Cheese Outlet.

Google depends on "PageRank." ( a registered trademark)
Quoting Google:

PageRank relies on the uniquely democratic nature of the web by using
its vast link structure as an indicator of an individual page's value.
In essence, Google interprets a link from page A to page B as a vote,
by page A, for page B. But, Google looks at more than the sheer volume
of votes, or links a page receives; it also analyzes the page that
casts the vote. Votes cast by pages that are themselves "important"
weigh more heavily and help to make other pages "important."

Important, high-quality sites receive a higher PageRank, which Google
remembers each time it conducts a search. Of course, important pages
mean nothing to you if they don't match your query. So, Google
combines PageRank with sophisticated text-matching techniques to find
pages that are both important and relevant to your search. Google goes
far beyond the number of times a term appears on a page and examines
all aspects of the page's content (and the content of the pages
linking to it) to determine if it's a good match for your query.

Close quote.

Translation: Google has changed the search engine concept from being a place where you get ranked based on the quality and quantity of your information, to one where you get ranked based on how many friends are willing to link to you and what other esoteric algorithms its programmers can dream up.

Popularity ranking does not work with many subjects, because reality exists independently of popularity, and is not achieved by common consensus. As Google grows further away from its roots, it becomes more bizarre in ranking pages. That is a failing that is further substantiated by google itself. Quote:

http://googleblog.blogspot.com/2005/09/googlebombing-failure.html

Googlebombing 'failure'
9/16/2005 12:54:00 PM

Posted by Marissa Mayer, Director of Consumer Web Products

If you do a Google search on the word [failure] or the phrase
[miserable failure], the top result is currently the White House’s
official biographical page for President Bush. We've received some
complaints recently from users who assume that this reflects a
political bias on our part. I'd like to explain how these results come
up in order to allay these concerns.

Google's search results are generated by computer programs that rank
web pages in large part by examining the number and relative
popularity of the sites that link to them. By using a practice called
googlebombing, however, determined pranksters can occasionally produce
odd results. In this case, a number of webmasters use the phrases
[failure] and [miserable failure] to describe and link to President
Bush's website, thus pushing it to the top of searches for those
phrases. We don't condone the practice of googlebombing, or any other
action that seeks to affect the integrity of our search results, but
we're also reluctant to alter our results by hand in order to prevent
such items from showing up. Pranks like this may be distracting to
some, but they don't affect the overall quality of our search service,
whose objectivity, as always, remains the core of our mission.

Close quote.

What google is attempting to justify and foist on the public as a legitimate search tool is the rule of the mob, where thought, logic, and science play second fiddle to the whim of the moment and the popular fetish. In a peculiar way, it is an anti-Snopes, and Ayn Rand's worst nightmare, where unfounded rumor can be linked to rumor, and the herd of sheep that follow the rumor give credence to the rumor by adding more hits and creating more links.

The "prank" of the "miserable failure" googlebomb exposes the issue.
Until the "prank" appelation was used, such blips were called loftily called "the mind of the web." by Google. They are no more the mind of the web or a schoolboy prank than the mind of a rock or the bombing of Pearl Harbor. The anomalies are representative of the perversities of humanity and Google's ranking system. The real miserable failure is the google search engine, and the way that it lacks the critical thinking skills of a tiny Vermont trout, which can determine what is food and what is a worm on a string designed to hook attention.

Group intelligence IS a valid theory. However, it only is valid under certain specific constraints and it creates its own problems. A group of plumbers is unlikely to choose the best site for gingham dresses. A group of kids (the largest part of the music market) is unlikely to promote Frank Sinatra sites. When a large mob co-opts a word or series of words on Google, a smaller mob gets shoved to the bottom of the rankings. I guess that works, in moderation, but certainly not as a primary search criteria.

Another fallacy of a popularity based search engine, such as Google, is that it is self-feeding and self-aggrandizing. As a site becomes more popular, more sites link to it, and hope for links from it. The quality of content soon becomes irrelevant compared to the number of links, dumping the lesser linked sites, as has happened with my websites.

As a user of the web, I have found one of the fascinating parts of the web is that it is a huge library of knowledge with minimal filtering by those groups that are constantly seeking to filter and censor what people see. Google has turned that library into a bookstore of the most popular subjects, with money and relationships as the key filters. The geeky kid who puts up a page on how he can solve the energy crunch, and who just might have a good idea, is crushed in Google rankings behind the usual suspects. The twit who has a webcam movie of his skill at flipping bottlecaps into the mouth of his nude girlfriend gets top billing.

It is easy to fall into the trap of thinking that the greatest
popularity equals the best choice in a search. However, television, motion pictures, and radio have all proven that popularity depends on dumbing down subject matter and playing instead to emotions, sex, broad slapstick and horror. Those are what make money and sell commercials, and that is where google is heading.

Galileo was not thinking the popular choice when he re-asserted that the earth rotated the sun. Pasteur was not promoting the popular choice when he told doctors to wash their hands. Popularity is a transient phenomenon. Betamax was popular at one point in time. Eight track tapes were popular at one point.

Do I think the lack of proper ranking of my web pages is Google's fault? Hell, yes.

Google designed the ranking system it uses, and it claims to be a
useful search engine. It is NOT my responsibility as a web page author to kiss Google's feet and whine "Oh please give me a better ranking, oh great Google." It is Google's responsibility to both users and web authors to provide useful and intelligent searches.

When you search using Google, it is currently behaving like an uncle who, when you mention your car having a steering problem in hopes of tapping his expertise, launches into his story of "Back when I had my 1952 Plymouth..." Neither he nor Google understand the importance of staying with a subject, and shutting up about favorite stories.

The simple solution for Google is to change the weighting of factors to provide a more balanced view. Google has taken an extreme and claimed it as proper and normal. I've learned over the years that extremes usually end up in disaster, and do no service other than showing how bad extremes are.

Galileo summarized the problem with this type of ranking long ago:
"I seem to discern the belief that in philosophizing one must support
oneself on the opinion of some celebrated author, as if our minds
ought to remain completely sterile and barren unless wedded to the
reasoning of someone else." That was not the case then, and is not the case now, yet Google holds by that credo, and hails the rule of the mediocre and those who know how to manipulate the mediocre.

When I use a search engine, I expect it to search based on content. If Google can't do that with something as obvious as the examples I just gave, then there is no reason for me to trust it to do any better on any other subject.

There are a number of alternate search engines available, such as Yahoo, Vivisimo, and others. Please do yourself a favor, and use them.








READER COMMENTS


Mr. Jerry Chase,

Your sentiments of frustration are appreciated. However, your observations are reflected and confirmed by both Mr. Larry Page and Mr. Bill Gates.

Mr. Larry Page (co-founder of Google) himself rated Google 3 out of 10.
Mr. Bill Gates (Microsoft) rates search as eons away from 'good search'.

Further, Mr. Gates recently made the profound statement that we are not experiencing /information overload/, we are experiencing /Miss Management of Mister Information/.

This directly relates to the phenomena, search.

Other than both of their billion dollar bank accounts, other than their intellectual skills in mathematics and programming, their 'insight' and 'honesty' regarding this matter deserves respect.

Both Mr. Page and Mr. Gates have to balance providing a service, that is, search and making money. Both Google and Microsoft are businesses, "to make money with the intent to profit".

That, in its self, is not an easy task. Both are equally, if not more frustrated than you, yourself!

I believe that both you and I are well familiar with the World's Most Powerful Search Engine. I personally started learning and using it at the age of 7, many decades ago.

The Page or the Gates who modifies it to the World Wide Web will be the proud owner of an ultimate search engine.

You may have taken notice that that is an objective of many of the search engine owners/creators. The issue, today, is, how?

I tend to believe that the objective will be accomplished as a group effort.

When accomplished, your information will be relatively easily found (it still takes a certain amount of gray-matter to conduct a search/do research).

The World Wide Web, W.W.W. is defined as a container of information on the internet. It was invented by and freely handed down to /the people/ by Mr. Breners-Lee. The internet by the Dept of Navy.

Now that we have both the platform for this /container of information/ and the network for distributing and making this /container of information/ available we need to learn how to manage it.

Mr. Dave None's reply suggests that that will take a long, long time. Good relevant search is NOT going to be /people/ driven. It is going to be driven by the intellects, of which there are very, very, very few among us.

Less than 1% of the 10,000,000,000 web pages found on the W.W.W. follow web standards (W3C, Mr. Breners-Lee, CEO) and are VALID web pages!!!

Web pages are created by /the people/. Over 99% of these webmasters are sloppy, invalid. These /people/ have painted their own picture and they are not masters, at all.

You may have noticed that the MSIE browser is the one browser for which creating html documents needs to be hacked. This will ultimately (need to) change.

You may have also noticed that the ONLY SERPS of the Big 3 (Google, MSN, Yahoo!) that follow web standards and are valid are MSN, creator of the MSIE browser!?!

This may be driven by Mr. Gate's project, STAND UP, since it is difficult to espouse excellence to youngsters when excellence isn't being practiced.

Web standards are no different than any other set of standards. If the dwelling in which you reside didn't follow standards, you'd have a ton of bricks hit you in the head.

Currently, the W.W.W. has 10 billion bricks falling ...!

This tells us that one of many stumbling blocks in good search is good design. This statement is highly debated today (mostly by SEO marketers who are trying to make a buck and a half). Unfortunately, their only arguement is laziness and ignorance.
-- Posted by Vladimir Toman on Sun, Apr 16, 2006, 9:18 am EST

report this comment



I have known these things for awhile now and I believe you have the wrong impression.
First off, Google is popular because we the people made it that way. They did not advertise themselves, people simply used Google, realized that they were able to find the exact information that they wanted and told friends to use it to. Pretty soon everybody started using it. Why? Because it works.
You must remember that Google is not there to tell you the answers to your questions, but rather to tell you what the internet says are the answers to your questions.
If you type in "music" as a key word and don't get back your favorite artist, you can conclude that perhaps *your* favorite artist is not everyone else's favorite artist. You should adjust your search query.
Google is a web search engine: it searches the web and Google is not responsible for any information that the web says about your key words. It is simply telling you what's out there. Period.
If the internet thinks that Bush is a misserable failure, then that's what it thinks. Does that mean that most people in the world think he is? No necessarily, just that the idea is out there on the web and is very prevailent. In fact, your article just contributed to keeping Bush's homepage as number one for "miserable failure"!
Google works and works well. It was the first company to have the "link-back" idea, and almost every other major search engine has adopted their form of it. Why? Because it works, and it works well.
That should be enough to ponder for now. I just the article connects concepts that should not be connected, namely, that what Google says the internet says is supposed to somehow be true about real life. Most of what people put out on the web has context to it and so most of the queries you give to Google will return relevent result. Those that do not are the exceptions.
-- Posted by Dave None on Sat, Apr 15, 2006, 10:30 pm EST

report this comment



You are totally right. For the most part Google works great but some content gets exploited for other reasons and great content drops to the bottom of the ocean if you don't care about back links (and why would you?!).
-- Posted by Chris Kameir on Sat, Apr 15, 2006, 9:14 pm EST

report this comment


You must be logged in to leave a comment. Register | Log In

Logout