Showing posts with label search engine spiders. Show all posts
Showing posts with label search engine spiders. Show all posts

May 17, 2010

What a Search Engines Spider Does?

The first thing that you need to understand is what a search engine "spider" is, and how it works. A "spider" (also known as a "robot" or "crawler") is a software program that search engines use to find what’s out there on the ever-changing web.
There are many types of spider in use, but for now, we’re only interested in the one that actually "crawls" the web finding pages. This is a somewhat oversimplified picture, but basically this program starts at a website, loads the pages, and follows the hyperlinks on each page. In this way, the theory goes, everything on the web will eventually be found, as the spider crawls from one website to another. Search engines may run thousands of instances of their web-crawling spider programs simultaneously, on multiple servers.
When a "crawler" visits one of your web pages, it loads the page’s contents into a database. Once a page has been fetched, the text of your page is loaded into the search engine’s index, which is a massive database of words, and where they occur on different web pages.
So there are really three steps. It starts with crawling (fetching pages), then indexing (breaking them down into words for the index), and a final step where the links (web page addresses / URLs) that are found get fed back into the crawling program to be retrieved.
When the spider (some of them will check later to verify that a page really is offline) doesn't find a page, it will eventually be deleted from the index. This is one reason why it’s important to use a reliable web hosting provider.

May 14, 2010

Black Hat SEO

Black Hat SEO search engine optimization is customarily defined as techniques that are used to get higher search rankings in an unethical manner. These black hat SEO techniques usually include one or more of the following characteristics:
  • Breaks search engine rules and regulations
  • Creates a poor user experience directly because of the black hat SEO techniques utilized on the Web site
  • Unethically presents content in a different visual or non-visual way to search engine spiders and search engine users.
A lot of what is known as black hat SEO actually used to be legit, but some folks went a bit overboard and now these techniques are frowned upon by the general SEO community at large. These black hat SEO practices will actually provide short-term gains in terms of rankings, but if you are discovered utilizing these spammy techniques on your Web site, you run the risk of being penalized by search engines. Black hat SEO basically is a short-sighted solution to a long-term problem, which is creating a Web site that provides both a great user experience and all that goes with that.
Black Hat SEO Techniques To Avoid
- Keyword stuffing: Packing long lists of keywords and nothing else onto your site will get you penalized eventually by search engines. Learn how to find and place keywords and phrases the right way on your Web site with my article titled Learn Where And How To Put Keywords In Your Site Pages.
- Invisible text: This is putting lists of keywords in white text on a white background in hopes of attracting more search engine spiders. Again, not a good way to attract searchers or search engine crawlers.
- Doorway Pages: A doorway page is basically a “fake” page that the user will never see. It is purely for search engine spiders, and attempts to trick them into indexing the site higher. Read more about doorway pages.
Black Hat SEO is tempting; after all, these tricks actually do work, temporarily. They do end up getting sites higher search rankings; that is, until these same sites get banned for using unethical practices. It’s just not worth the risk. Use efficient search engine optimization techniques to get your site ranked higher, and stay away from anything that even looks like Black Hat SEO.