17th July 2007

GWHG Highlight: Overwhelmed


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

Today in my Google Webmaster Help Group highlights I am going to pick a thread from each sub-group to emphasise.

Crawling, Indexing, and ranking:

The archive for this group is currently unavailable.

We’re sorry for the inconvenience. Please try again shortly.

Google Webmaster Tools:

The archive for this group is currently unavailable.

We’re sorry for the inconvenience. Please try again shortly.

Sitemap Protocol:

The archive for this group is currently unavailable.

We’re sorry for the inconvenience. Please try again shortly.

Suggestions & Feature Requests - webmaster-related only please:

The archive for this group is currently unavailable.

We’re sorry for the inconvenience. Please try again shortly.

Random Chit-Chat:

The archive for this group is currently unavailable.

We’re sorry for the inconvenience. Please try again shortly.

It’s like driving up onto a car accident, you wonder if you should call 911 or if a dozen other people already have. I’ve got to assume that someone is trying to get the hamster back on the wheel by now. I just hope they didn’t crash the mother board on the Google Groups server, because parts for Commodore 64s aren’t that easy to come by any more.

posted in GWHG, highlights | 2 Comments

13th July 2007

Site popularity: Displacement, velocity, and acceleration


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

I want to start out by saying that I haven’t wasted my time reading patents nor have any inside knowledge whether this is fact or not. It’s just pure theory, conjecture, hypothesis, observation, opinion…it’s a blog post.

We all know that one of the factors of a site’s ranking possibilities is the popularity of the site/page. The defacto measurement for this popularity is the amount of links the site has. Links drive entire economic segments of internet marketing from buy and selling, or just pure creating them. At the center of the link popularity firestorm is Google’s PageRank, measurement of the pages importance, which is purely a calculation of the total and quality of the links pointing to a page.

Douglas Fairbanks is a really popular guy, he has millions of fans that pay their last nickle to see him his movies. Unfortunately, Douglas is dead, and hasn’t made a movie since the 1930s. His popularity didn’t wain, his devoted fans were still fans, however new things came along (like sound) in the movies and they became fans of those as well. The point being, like in life, being popular is a continuous effort. You cannot reach a certain amount of fame, links, and then sit back and enjoy the ride.

Google keeps track of your links. We check for them using the link: operator, log into our webmaster tools account to check the links, go to yahoo and use their site explorer, and wait for that quarterly PageRank update. By logging the links to a site they also collect another crucial metric that is rarely discussed, time. Somewhere in Google is a database that is logging: Link X with give PR to Y page found on DD/MM/YY at HH:MM:SS. All of the online tools available track the quantity and to a lesser extent the quality of the links, however the time factor is not mentioned. Given the time factor a whole host of calculations possibilities arise, I’m going to go over some implications.

Displacement

Displacement is the total number of links a site has, it’s the distance from zero along a straight line to the total. It’s the one factor we can gather some data on ourselves by using online tools. When evaluating a sites performance problems, it’s usually the first place any forum observer goes and says things like, “You don’t have enough links, get more to rank for anything” or conversely, “I don’t know why you don’t rank you’ve got 8000 links”. The displacement of your site’s links is the sum total of all the links you’ve received, less the ones you’ve lost, for a snap-shot of the site’s health. Older sites tend to have more links, since they’ve been around a while to gain them, as do popular or trendy sites, as they tend to get them quickly. Not-so-good sites or sites about obscure subject that nobody is interested in tend to have less, new sites may have none.

Velocity

Using the time data of link acquisition another variable can be calculated, the link velocity. Velocity is defined as the rate of change of displacement, given in units of displacement per time ( MPH, m/s, ft/min) or for site popularity let’s say Links/Week, Links/Day, Links/Year, or Links/Site Age. Velocity is the rate at which your site is gaining/loosing links to it. It’s not easily viewed in any of the online tools or data given. Positive velocity is anything above gaining zero links per time period. If you’ve gotten one link in the last week, and not lost any, you’ve got positive link velocity. However if you haven’t gotten any links this week but lost 3, you’ve got a negative velocity. Velocity is great indication of how the site is currently doing, much more than displacement. For example, if you’ve got a site with 10,000 links to it, normally we’d say that site is fairly popular, but if it’s only gaining 2 links a week at the moment, it really isn’t that popular any more. Sure you still get your credit for having 10,000 links but some consideration has to be given to what your doing today.

Another calculation is overall velocity, or velocity calculated over the time frame of the entire event . Let’s consider two marathon runners. Both have run the entire distance of 26+ miles. The first runner completed the journey in 3:00 hours, the second took 7:00 hours. Our first runner has an over all velocity of 26/3 or a little over 8 -1/2 MPH, while the second ran an average of 3-3/4 MPH. In the web world they’d both have the same PageRank (26+ miles) but entirely different link velocities, one may have taken 3 years to get to a PageRank of 6 while the other has done it in 9 months.

Acceleration

Doing a further calculation on the links and time data we can come up with the link acceleration, defined as the rate of change of velocity. Knowing the acceleration of an object tells us the trend that object is taking, is it gaining or loosing velocity. So now for a web site we’ve got three parameters to look at; the total links, the velocity of gaining or loosing links, and the rate at which the site is gaining or loosing. For a given site that has 10,000 links to it, last week it may have gained 50 new links for a velocity of 50 links/week. The week prior the site gained 40 links so the site is accelerating, it’s velocity has gone up by 10 links per week per week, showing an upward trend in popularity. On the other hand, let’s take the same site with 10,000 links that has gained 50 links this week, however last week it gained 80 links. The acceleration of the site is negative 30 links/week/week. It’s still fairly popular, it’s still gaining in popularity, but the rate at which its gaining popularity has slowed down. This isn’t something is normally noticeable when doing a site evaluation, as the data just isn’t there for us to gather, unless it’s been gathered and logged with time as a constant.

Comparison

I’ve discussed how a sites link profile could be used to evaluate its current popularity and trends, but there is another consideration. Since Google has this information on all the web sites in it’s index, one would have to assume that they use it on a comparison basis. For a given search term Google whirls and buzzes and comes up with a ranking based on it’s 11 secret herbs and spices, then comes to link evaluation. The first is raw popularity, how many links point to two sites relevant to the search, the second is velocity - how fast or slow has that site been gaining links, and the 3rd is acceleration is that link gaining/loosing been going up or down. Why comparison is important is because link popularity, velocity, and acceleration do not have the same weight in the ranking algorithm for all sectors. If you are searching for the history of WWII one would assume older more popular sites would rank higher because the history of WWII hasn’t changed, the interest in the subject is pretty steady, and velocity and acceleration should follow the web as an aggregate (as the amount of all available links grows so should a sites share). This would be where an old established authority site would probably be unbeatable in the ranks. Other topics however may not have a history to consider, they are new, so velocity and acceleration would have to be considered more.

Some implications and observations

So now that I’ve hopefully got you thinking about something other than just how many links you’ve got, let’s consider the implications of such ideas in regular search behaviors.

Google has yet to celebrate its 10th anniversary and the internet does not seem to be going anywhere soon. If the algorithm was purely based on PageRank or total links eventually all the web results would settle down to a select small group. Older established sites with their millions of links would continue to get millions of links and eventually be unbeatable. Well, this is obviously not the case as new sites and trends pop up in popularity all the time. It is conceivable that in 20 years time when we’ve got a real history to look at, 30 years of Google, that there will be sites that have millions (if not billions by that time) of links but yet don’t rank for anything at all. Sites that may be popular today will still have their links, but will not gain them as before and be filtered to the bottom. Just like our friend Douglas Fairbanks, his fans didn’t stop loving him, they just starting liking other things more. I’m looking forward to the day when I can look back at the internet with my grandkids and tell them about the days when every search turned up a wiki result. And I can show them the old and busted wiki site siting there with 10 billion pages of content and 20 billion links not showing up at all in search results because no one has linked to it in the last 10 years….ahh one can dream.

It’s been observed by many that PageRank isn’t everything, and the primary proof of this is the search results page. It’s been pointed out in a million different places that a PageRank 2 page can outrank a PageRank 6 page. Other than on-page factors such as content, is the link velocity and acceleration. The PageRank 2 page may not have as many total links at the moment, but it’s been getting them at a quicker pace than the PageRank 6 page.

Another phenomena that is discussed often is the newness factor. Fresh sites and pages tend to get a bump in the SERPs then settle down into a lower rank. A new page has no history, so when an acceleration calculation is done it’s acceleration is huge. If it’s deviation and velocity were zero last week, but this week it has 10 links it has accelerated tremendously. In order for an established page with 100 links to it already have the ability to match our new pages velocity it would need to get 1000 links to it in the same week.

I’ve read in some forums the theory of don’t get “too many links too fast” I’ve always thought that was an odd theory as it’s a natural phenomena. When Apple announce the iPhone, I’d imagine it got a few links that day. However where there may be a grain of truth to it is in unnatural linking. Let’s say you decide your site is lagging so you take a break from content generation and go on a week long link building campaign for your site. You write to 100’s of sites asking for them to take a look at your content, suggesting where they could benefit from linking to a page on your site. You also go and submit to a few hundred directories, and then go buy a couple hundred link ads. Initially you’ll probably see a substantial boost in traffic and probably rankings, you may even see some more green pixels in the tool bar on the next update, so you’re happy. You go back to business as usual, and then a month later are in WebmasterWorld whining that you’ve got the too-many-links-too-fast penalty. I’d suggest that there is no such penalty, just that you’ve made your site look like its loosing popularity rather than gaining it. Sure you get some credit for having more links than it used to but when put into context with the temporal data the site looks like it had a big gain in popularity one week and then the next the popularity wained. Sure you want to outpace your competitors in link building rate, but remember slowing down in that link building is also a sign. So a link building campaign un-naturally sets the bar higher for a site, when that campaign stops you can no longer maintain the false popularity acceleration that it portrayed. So our webmaster quits whining in WebmasterWorld and moves on to other things, the site will eventually settle back into its natural link growth and probably will regain its original rankings.

Spend any time watching webmastering forums and one recurring theme you’ll see is “I’ve done nothing and all of a sudden my rankings dropped” also known as the -950 or whatever penalty. At this point many will head on over to site explorer and check out the links, the site owner will point to a bunch of great links they’ve got on Microsoft’s home page, etc. What isn’t considered is acceleration. Remember there is negative acceleration as well, or the velocity is slowing down, and there is even negative velocity, where your link total is dwindling. If your competitors are gaining links at a regular pace but you’ve just lost some, your site may appear like it’s penalized. Once again the problem lies in only looking at the link total and not knowing the link trending. If the site has 10,000 links and gains 100, it’s probably not going to be noticed by observing link: commands, 10,100 looks a lot like 10,000. On the other hand, if the site was normally getting 100 links a month, but then in one day lost 500 links an interesting thing happens. The popularity will appear pretty much the same, 9500 links looks just as good in site explorer as 10,000 links. BUT the velocity will be negative, and the acceleration will be HUGELY negative because the links were lost in a short period of time. Now 500 people rarely get together and decide to remove some links, but Google does it all the time. In Googles ever-quest to improve its ranking algorithm they are always re-evaluating which links count and don’t count. If they’ve recently discovered that 500 of your links are footer links on sites you bought them from and simply discount those links as not counting it may appear as a penalty because of negative acceleration and velocity. No amount of writing reconsideration requests is going to get the site back into the rankings, because the effect of the negative link building will be there. This also explains why some sites that suffer sudden ranking drop come back into the rankings slowly. As time goes by, that negative spike in acceleration slowly fades into the sites average. The sites natural positive acceleration will slowly show that it’s again gaining in popularity and the effects of loosing the links all at once will be eliminated.

Blogs tend to get a bad rap for being able to rank fresh post quite fast then fading into obscurity. I think this has to do with a blogs infrastructure and the link velocity and acceleration factors. When a new blog post is published it is shown on the front page of the blog, in a couple of categories, and probably in the archives. If the blog is remotely popular many sites aggregate the feed and also publish the story on their front page, categories etc. Unlike adding a new product under an existing category on your ecommerce site, a new blog post get’s tons of link pop right out of the gate. It’s link acceleration is huge. After some time goes by acceleration stops, velocity goes to zero, and displacement stagnates. The blog post then fades into a ranking position much lower than the initial publication.

The circles I travel in tend to bring me to a lot of professional SEO sites and people. One of the main tenants of being an SEO is that SEO is not a one time thing, you cannot just SEO a site and let it ride,you need continued SEO. Many have very good proof of this by being able to document sites that they used to work on, the client stopped using their services, and then slowly the site drops into the abyss. Part of an SEO process is a link building campaign. The campaign can be as white-hat as possible only generating natural links, however it is un-natural in that it outpace the sites natural abilities. Stopping this campaign will be seen as a negative acceleration and thus a slow down in link velocity for the site. The key to a good SEO campaign of link building is not to outpace the natural link building of the site by too much. When link building is stopped, it cannot stop all at once. The site needs to ween itself from link building, slowly reduce it’s link building activities until it’s normal velocity is within the deadband of the un-natural link building. At that point the site can live on its own without a negative rankings drop.

In conclusion I’d just like to sum up the fact that looking at a link total for a site is not the only indication of it’s health. There may be other factors in a sites popularity upswing or down swing in rank other than just the total links. Any un-natural link building whether by-the-rules or not can be seen in time trended algorithms.

Coming up next: How you can monitor your own link velocity and acceleration, kind-of.

posted in SEO, Webmastering | 3 Comments

12th July 2007

GWHG Highlight: MFA (adsense) vs. MFA (affiliates)


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

Before I get started on the subject at hand, I’d like to point out a new commenter on this blog, Susan M, of Google fame. I appreciate her time and insight. I’m not an A-list-party-with-googlers-all-the-time kind of blogger, but you will see that the people who regularly comment here are all very much more intelligent than me, a theory which is only backed up with your presence.

There are two quite interesting threads in GWHG right now. One has gotten a significant amount of blog airplay because Adam Lasnik made a pretty revealing comment**. [as an aside I have to volunteer that I responded** to it somewhat negatively as the banned site is no more in violation of Google quality guidelines than other very popular sites. Popularity, as we all know from going to high school, is not an indication of quality.] The one Adam is involved in was about an obvious MFA (made for adsense) site** that has been banned, the second site that hasn’t gotten any Googler play is a MFA (made for affiliates) site**, which is possibly under penalty.

I don’t have the answers for the two sites involved, but I did make a few observations while viewing them. If Google is working on cleaning up it’s index by removing sites of lower perceived value I applaud them, there is a lot of junk out there. A lot of junk that they created of course as a secondary effect of adsense. If they really want to make an indent of the junk I’d like to point out two sites that provide very little in the way of valuable content. The wiki is mostly information pulled together from other sites, and about.com is just a giant made for adsense trap taking advantage of subdomain spamming techniques. Spam doesn’t just mean using hidden text and links but also useless sites, a much more subjective assessment tha’s probably as not as easy to mechanize.

MFA (Made for Adsense)

These sites have no real purpose but to generate clicks on adsense ads. The designers put together content that will attract high paying ads (the ads you get are contextual). Part of the TOS (terms of service) of Adsense is that you are not allowed to encourage clicks or even draw undue attention to the ads. The revenue model for being a successful adsense publisher is that you need people clicking on those ads, you don’t get paid by them viewing your site. The best way to get the ads clicked is to design the site to be less fulfilling than the ads. In order to make any money on adsense you need to design the site to be good enough to generate some traffic, but be bad enough so that the viewer doesn’t get what they came for and will go looking further, hopefully through the ad. If you write the worlds most definitive article on digital cameras, answering all the users possible questions perfectly, they won’t click your digital camera ads, why would they?. If you write a vague article mentioning digital cameras enough to get some search traffic, but crappy enough that they won’t get any real answers, they are more than likely to click your ad looking for satisfaction. It’s an unfortunate fact about contextual ad publishing, the best sites as far as content don’t do well, the garbage ones do.

MFA (made for affiliates)

The model for building an affiliate site is different than getting paid for clicks. You only get paid when someone follows your affiliate link and then purchases an item. Contrary to adsense you encourage people to click the ads or follow the links. Unlike adsense you don’t get paid just for them clicking the ad, they need to purchase something, you need to close the sale to get the pay out. In affiliate driven sites, the job of the content is to inspire you the visitor to go somewhere else and purchase an item. Poor affiliate sites that are not successful may generate traffic, may generate clicks, but don’t close on the sale. The best affiliate sites give the consumer enough information to make an educated purchase decision. Affiliate marketing pretty much encourages good writing and research. The poor ones usually just copy content and republish it, those types of operations require millions of page views to be at all successful. Writing the same digital camera information site monetized by affiliate sales would require your visitors from search engines be VERY satisfied with the information they received, so satisfied in fact that they are willing to go and buy the item.

The motivation for publishing both types of sites of course is renumeration, but the methods needed to be successful in either one inspire entirely different content creation styles. I back Google up in their quest to clean up the worst MFA (adsense) sites as long as they get rid of the worst but very popular crap as well. I’d also hope they continue their assault on copied or scraped affiliate sites, we don’t need another site in the world publishing Amazon’s write up for some SEO Books. On the other hand, if I am looking for some lawn care products I hope I find a site like that one, which provides a 3rd party point of view on many related products. It’s information I cannot find on Amazon’s site.

(Like the adsense and affiliate link drops? Ironic isn’t it?)

** Sorry for the nofollow, but I don’t link to places that have a policy of not linking out. Add me to the what we are reading blogroll (or any google domain for that matter) and I’ll be sure to remove all of the nofollows. :)

posted in GWHG, Webmastering, highlights | 1 Comment

10th July 2007

GWHG Highlight: Javascript


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

Google Groups pfingo wonders:

When i click on the cache, i get a google error.

And Webado sharply notes.

Disable javascript and then go and visit your homepage at http:// www . pfingo . com/

You will see a blank page but viewing the source code you will see this: [code]

This is 404 page (not found) , so this is all that a robot will see.

For your human visitors you have the javascript redirection which is totally useless for robots.

Google is not a person. She doesn’t view your website with firefox or internet explorer, which also means when crawling your site your java script is not going to be executed. If you use that script to redirect your visitor, google is not going to see it.

When designing your site you must not only consider how it looks in many browsers but how it works with features like Java and Flash turned off. Not all people, including Google, browse with these features on. Using allows you to download add-ons to disable javascript, view as IE, turn off images, highlight external links, etc.

posted in SEO, highlights | 0 Comments

10th July 2007

GWHG Highlight: Hidden text and the reconsideration request


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

Google GroupsA thread was started on July 3, 2007 by the owner of a site who believes that Google has stopped indexing his/her site because:

About three weeks ago I turn[ed] a cookies feature on which would help to prevent abuse of the site. I believe this also cause all bots to stop crawling the site.

Google does mention that the use of cookies could be problematic, specially if it’s required to properly see the site.

Use a text browser such as Lynx to examine your site, because most search engine spiders see your site much as Lynx would. If fancy features such as JavaScript, cookies, session IDs, frames, DHTML, or Flash keep you from seeing all of your site in a text browser, then search engine spiders may have trouble crawling your site

Had the cookies caused a problem it could have been diagnosed by using the Lynx browser.

That’s not why I am pointing out this thread.

Googler MattD steps in and points out some “old” pages of the site that contain a significant amount of hidden text (click link to view the hidden text). Noteworthy in this discussion is the fact that MattD went beyond normal protocol and provided site specific information. The danger of doing this is that everyone may expect this sort of person treatment, which isn’t feasible and is the wrong assumption, but it also is a great milestone and example that should be held up as model for others to learn from. From this example I drew the following opinions.

  1. It’s good to have an idea of what you may have done to get in trouble, but don’t let that idea get in the way of other possibilities. Often having multiple people look at the site will get you differing views that you the owner who is often too close to the site and wouldn’t see as a problem.
  2. We don’t know how MattD knew what the site was in trouble for, was it a manual review or a signal in some of their wonder tools? Either way they know. Remember Susan mentioned that a review of your site will probably include a deeper look at it’s over-all practices.
  3. When submitting your reconsideration request you must be forthright and include ALL discretions, even the old ones specially the old ones. More than likely a ban or penalty is not from what you did last night but from a while ago, a review of the entire site is in order along with a recount of all the changes.
  4. It is entirely possible that the site and or pages ranking was affected by the hidden text, after reconsideration the site may not regain its original position since that effect is now gone.
  5. If you are penalized its because Google has decided that you were attempting to fool the search algorithm. If when you submit a reconsideration request that is incomplete and doesn’t include all problems, that could also be considered an attempt to deceive, though Adam Lasnik has said multiple reconsideration requests are not seen as a signal to be held against you. I wouldn’t assume that filing a 2nd or 3rd request would be aggregated with the previous one, more than likely a different person is reviewing it. If I were to submit an additional request with more information I’d include the previous statements as well
  6. This is always a problem with a 3rd party looking at a site. We are not always given all of the information available, access to all of the sites pages on the server, or knowledge of what was done before. We only see the state the site is in now and without a context in which to put that in. Google on the other hand is the king of data storage and can contrast and compare multiple various previous incarnations.

posted in SEO, highlights, reconsideration request | 2 Comments

2nd July 2007

pa rum pum pum pum


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

drummer.jpg

…I am a poor boy too, pa rum pum pum pum. I have no gift to bring, pa rum pum pum pum. That’s fit to give the King…

My displeasure with Google’s handling of the Webmasters Help Group is well known, and I am weary from trying to influence change, so weary in fact that I’ve been pretty much reduced to occasional lurking with a random post or two. There are others though, much more disciplined and forgiving than I am. For these people still holding on to the dream, I commend them, and though I am poor in resources and cannot thank them financially, I would like to offer them a link drop, for whatever that is worth.

So congrats to the following for trying to keep at least some signal in the noise filled shell of its former great self Google Webmasters Help Group.

Webado - Web Hosting and Design in Canada

Cass-Hacks - Powerful XHTML DHTML presentation and accessibility tools

Phil Payne - Website rescue, redesign and maintenance

Dockarl - BlixKrieg Wordpress Theme

JohnMu - Search Engine Tools

Sebastian - Links, Links, Links…

Red Cardinal - Search Engine Optimisation Ireland

Who did I miss?

posted in GWHG | 8 Comments

27th June 2007

I’m confused


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

Since Adam posted that more Googlers soon to be ignoring the Google Webmaster Help Group where should us regulars send people for answers from Google?

Is it seoroundtable blog comments, webmasterworld where they strain to figure out what sites people are talking about, maybe Matt Cutts blog where the real estate industry goes to complain about fraudulent link exchanges, or do we have to fork over ass-loads of money to sit in on secret sessions?

Now that threadwatch is dying it may be time for someone to start a googlerwatch site, as we cannot count on them bothering with the official channels any more. We tried to set up a not-for-profit site chronicling what Googlers were saying in the Official Help Group, but they stopped talking, and the heck if I’m going to funnel visitors to ad filled A-list blogs.

posted in GWHG, Webmastering | 0 Comments

19th June 2007

Why Link: doesn’t work in Google


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

The question “Why doesn’t Google show all my links?” gets asked all the time in forums. Generally the poster is really asking, “What do I need to do to make Google recognize my links?”, somehow inferring that they have a problem because Google doesn’t show their links.

2nd thing first

The answer to the second question is that they don’t show your links because they don’t show your links by design, its not an indication that anything is wrong with your site. For example the link operator show I have 85 links to this blog, whereas Webmaster Tools reports over 2000.

First off let’s take a look at their on-line documentation, for the definition of the link operator they say:

The query [link:] will list webpages that have links to the specified webpage. For instance, [link:www.google.com] will list webpages that have links pointing to the Google homepage. Note there can be no space between the “link:” and the web page url.

This functionality is also accessible from the Advanced Search page, under Page Specific Search > Links.

This is where most of the confusion begins. Being that we trust Google to be the #1 search engine that it is, when they tell us “will list webpages that have links” we believe them. Unfortunately the reality is that it “will list [SOME] webpages that have links.” Which is further explained in this official blog post, where they say:

You asked, and we listened: We’ve extended our support for querying links to your site to much beyond the link: operator you might have used in the past. Now you can use webmaster tools to view a much larger sample of links to pages on your site that we found on the web. Unlike the link: operator, this data is much more comprehensive and can be classified, filtered, and downloaded. All you need to do is verify site ownership to see this information.

As I indicated above using a verified account in Webmaster Tools will give you a lot more information, but there is a caveate. Not all of those links will count for anything. They don’t show every link out there, and the ones they do may be nofollowed, as explained by Matt Cutts:

Do not assume just because you see a backlink that it’s carrying weight. I’m going to say that again: Do not assume just because you see a backlink that it’s carrying weight. Sometime in the next year, someone will say “But I saw an insert-link-fad-here backlink show up in Google’s backlink tool, so it must count. Right?” And then I’ll point them back here, where I say do not assume just because you see a backlink that it’s carrying weight.

First thing 2nd

To answer the first question, “Why doesn’t Google show all my links?,” let’s imagine if they did perfectly with each and every page that links to you.  Further imagine that they only show the relative links that carry weight, and even better they rank them in order of importance.

Imagine that you’re the owner of www.sockpuppets.com and you want to check your backlinks.  You go to [fictitious] Google and type in link:sockpuppets.com and it shows you 2875 sites that link to you.  You’ve got all the information on which sites linked to you, and what sites obviously don’t that you think should.

While perusing the internet for a gift for his niece a less than scrupulous  web designer finds your site, he likes it so much he thinks he’ll copy it as imitation is the sincerest form of flattery.  First he downloads your site and all of the contents, replacing the items you sell with affiliate links to other sites.  Then he heads over to godaddy.com and buys sockpuppets.info for $1.99 and uploads the copy of your site to his fresh new domain.

After that is done he heads over to [fictitious] Google an types in link:sockpuppets.com and gets the complete list of sites that link to yours, imports those sites into a database, and loads up his favorite mass mailing script, and sends off a letter to 2875 sites that goes something like this:

 Dear Webmaster of {sitename},

We thank you for your past support of www.sockpuppets.com through a link to our site.  We never rest at sockpuppets and are always striving to improve our and thus your user’s experience.  As part of our continued development we have recently launched our new and improved site on the newer top level domain www.sockpuppets.info.

During this transition the older version at sockpuppets.com will still be available, but that will be phased out next quarter.

We hope we can continue to count on your support and ask that you take a brief moment to update your link to our new domain of www.sockpuppets.info.

If you have any questions at all please feel free to send me an email.

Thank you very much,

John Smith
Webmaster
Webmaster@sockpuppet.info

Sure not all sites will respond but a good percentage will.  In Google’s view those long standing links that you’ve worked so hard to build and maintain have just changed their vote from your site to this new hot site on the Internet.  It won’t be long before the copied site is outranking your own site.

Now this little fictional story may be over simplified but the lesson is true.  The reason Google doesn’t show the links to your site is that others could use that information to hurt you.  By requiring you to verify ownership of the site, or at least control of it, they are assuring that only the right people can get this information.

posted in Google, Webmastering | 4 Comments

19th June 2007

Where everyone knows your name…


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

I need a new forum to hang my hat in. I’ve enjoyed the Google Webmaster Help Group for quite a while now, but it’s loosing its focus. The place has been inundated with retards trolls, and left for dead by Google*. No longer is it the wonderful place it once was that had insightful official information. That official information is now pretty much exclusively fed through non-official A-list SEO bloggers now or at their conferences around the bar.

I really enjoy the search engine standards discussion the most, so I’m trying to find a new active forum. I figure if Google isn’t going to be answering any more question officially I might as well make my contributions somewhere where some who needs the money is gaining from it. I get the feeling that Google is busy counting their money and not looking to improve their webmaster relations or be bothered with supporting us little people.

I’ve tried Digital Point and WebmasterWorld. WMW has a higher quality of posters, but you can’t discuss real world issues there so their search engine forums are worthless. Digital point will allow spirited discussion, but there are WAY too many idiotic questions that the intelligent stuff get’s hard to find. Been lurking about cre8asiteforums but the volume is so low its hard to get a discussion going, maybe after I get to know some of the players it will improve.

Any other suggestions? If you’ve got a favorite please post a comment so they at least get a link, and perhaps a new contributer (me). I love helping people and learning, not arguing and reading lies, so moderation is a must.

* I was putting together my final statistics of the group, but I got depressed and sad, so I scrapped it. Not only is Googler participation nil, but the once strong regulars are backing away. It’s hard not to when hourly the same trolls spew their lies unattested.

posted in GWHG, SEO, Webmastering | 10 Comments

8th June 2007

Your neighborhood


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

Car on blocksYour site is judged not only on the content it provides, the sites that link to you, but also the sites you link to. The subject of those links help set the theme of your site but also define your web neighborhood, which in most cases you want to keep good.

We’ve been told in the Webmaster Guidelines to avoid bad neighborhoods.

In particular, avoid links to web spammers or “bad neighborhoods” on the web, as your own ranking may be affected adversely by those links

If you must link to such a terrible place, then by all means use the comment-spam-paid-link-announcing-bad-neighborhood-no-page-rank-passing nofollow attribute in the links.

In the old days before electricity Google used to announce to the world that a site sucked by gray barring the visible PageRank or simply delisiting the entire site.

When judging a site you used to be able to visit it and check out it’s visible PageRank run a site command and see that it’s indexed and be pretty well assured that the site was doing fine. With the emergence of penalties instead of bans, PageRank not changing, and gray bars showing up for all kinds of pages the job of being a good linker has become more difficult.

With that said, I’ve come up with a short quick list of things that can be done to check if the site is still in good graces with Google. Some of these items are based on pure conjecture by forum members ( the infamous -30 and -950 debaters) so any one failure should not be a clear indication of the sites linkability but may be cause for more investigation.

  1. Check for PageRank, if the site is new this may not exist.
  2. Check number of pages indexed using the site command.
    1. Compare this number to the URLs in their sitemap if possible.
    2. Crawl the site with Gsitecrawler (which used to have sitelinks but doesn’t now???) to see how many URLs they actually have, compare with indexed.
  3. Check the supplemental count. Is it in-line with what you’d expect for the main pages visible PageRank. A PageRank 6 site should be able to hold more than 3 pages in the real index!. :)
  4. Search for the domain name, see if they turn up as the first result.
  5. Does the home page show up first in the site command?
  6. Most importantly, and often overlooked, if you are linking to site because they are the authority on chickens, search for chickens. Where do they fall in the results?
  7. Building on #6 check the ranking for the keywords and combinations of keywords that the site is obviously targeting.
  8. Not all bad sites have been caught, yet, so do a good check for compliance with the Webmaster Guidelines. While the site may pass the sniff test on all other fronts, if its got hidden links and text on it, it may be banned tomorrow and you don’t want to be linking to it.
  9. Check their link profile using Site Explorer. If 90% of their links are from directories, irrelevant sites, or forum signatures you know you’ve got something suspicious going on.
  10. Check the sites basic maintenance quality. A site that has a good robots.txt file, no conical issues, doesn’t use session IDs, etc is probably more aware of search engine standards than one that doesn’t.
  11. Check the META tags. Not that this is a sign of being banned, but for me a sign of quality. If the site has 23 META tags from 100 “keywords” to “google-pray” and “revisit-after” you know you are dealing with an amateur that may walk into trouble in the future.
  12. Send Matt Cutts an email and ask him to check the site out. He usually get’s back to me in a day or two. Here’s his email address setup just for site health inquiries, in case you don’t have it already:

mattcuttsemail.png

I’d be interested in hearing what you use to judge a linking partner. If you’ve got a list published include it in the comments, all approved comments go non-nofollowed in 2 weeks!

Updates

JohnMu says, “My main tool for evaluating the value of a site before I link to it is the MSN linkfromdomain:-query. There’s nothing better — even if MSN only has 1/100th of the data that Google (or even Yahoo) has. It would be great to be able to check up on sites like that, eg: [linkfromdomain:othersite.com xxx] or better: [linkfromdomain:othersite.com seo]“

posted in Google, SEO, Webmastering | 1 Comment

7th June 2007

Baghdad Bob is coming!


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

Who will be our next Minister of Disinformation?

Bahgdad Bob

You?

posted in GWHG, Site News | 1 Comment

15th May 2007

Crazy idea: Search on Google


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

I may be going out on a limb here, but I would think that anyone familiar enough with the concept of getting their website listed on Google and having such an interest that they’d seek out help in that manner would understand the use of the Google as a search resource.

But surprisingly, when someone stubmles their way into Google’s Webmaster Help Group they loose all abilities to search the thousands and thousands of questions that were asked and answered before them.

If Google gave me a nickle for every time someone asked when their PageRank would update or why link: doesn’t show all of their links, I’d be flying along side Larry and Sergey in my  767 / space shuttle / flying electric car or whatever else I could afford then.

Okay, onto the point of this post.  I wanted to upload the following picture, so I could point out to newbies that Google has developed this new SEARCH technology, and where it is.  I could have FTP’d it, but opening up wordpress is easier, and now I’ve got something to point them to.

search-this-group.png

posted in GWHG | 3 Comments

18th April 2007

Affiliates playing the wrong game


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

Okay, Google has been caching my home page daily, which is nice, but now I feel obligated to give it something to see everyday…

The Windup

More and more I am seeing sites that have suffered in the search results which are heavily laden with affiliate links. After further review, they are not just affiliate links but usually affiliate content as well. You know the kind, the millions and millions of travel affiliate sites that set up spammy looking names like cheap-florida-hotels.cheap-travel-and-viagra.com. Oh people come up with cute foolish names like the -30 penalty or the -950 penalty, but they are all the same; My site used to rank, and today it doesn’t. In other words Google used to like it but now figured out that your site actually suffers from MSSA.

Read the rest of this entry »

posted in Webmastering | 2 Comments

17th April 2007

PageRank 8 Links for sale


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

If you’re looking for a permanent high PageRank link, I’m willing to offer the first 5000 people that contact me, for the low low price $299.00 each. This is a limited time offer, and will be canceled as soon as I’ve sent 5000.

(Actual much cheaper links can be found here)

Below you will see a screen shot of my gmail account, which clearly has a PageRank of 8*. I’ll send you the link, to any email you provide with any anchor text and description you like.

Read the rest of this entry »

posted in Webmastering | 40 Comments

13th April 2007

New Tools Coming


Warning: copy() [function.copy]: Filename cannot be empty in /home/jlhdes/public_html/wp-content/plugins/mytube/mytube.php on line 220

Update 4/18/07 - Adam hinted, Vanessa delivered. Two new tools rolled out in Webmaster tools. A much more involved content removal system to give you even more control and expanded page analysis that now includes phrases.


Adam Lasnik hinted in his generous comment here:

I believe you’ll see really cool new tools and information in Webmaster Tools which will answer an increasing majority of questions we now see in the forum.

Vannesa Fox elaborated on this a bit more in her uneasily casual conversation that was taped with Rand Fishkin. I don’t want to steal their thunder and paraphrase it, so suffice it to say it’s well worth watching and listening to.

posted in Google, Webmastering, search | 0 Comments

  • Please Support

  • Marquette University

  • Sponsored

125x125

  • Donations


  • ;

Enter your email address:

Delivered by FeedBurner

rss posts
  • Readers