findarticlesplus.com
  Site Home >> About Us >> Place Your Link >> Privacy Policy >> ToS >> Add Your Article
Search:   

 

Self Enhancement

 

Vehicles & Automotive

 

Society & Communities

 

Sports & Adventure

 

Healthcare & Treatment

 

Jobs & Careers

 

Academics & Education

 

Issues & News

 

Family & Home

 

Science & Space

 

Children

 

Policies & Law

 

Property & Estate

 

Fitness & Health

 

Recreation

 

Shopping Online

 

Online & Board Games

 

Travel & Accommodation

 

Business & Commerce

 

Internet & Computers

 

Finance & Investment

 

Drink & Food

 

Fashion & Lifestyle

 

Culture & Art

 

Site Home –› Internet & Computers –› Domain Names Registrars
 

Harnessing the Power of Robots.txt

 
Once we have a website up and running, we need to make sure that all visiting search engines can access all the pages we want them to look at.

Sometimes, we may want search engines to not index certain parts of the site, or even ban other SE from the site all together.

This is where a simple, little 2 line text file called robots.txt comes in.

Robots.txt resides in your websites main directory (on LINUX systems this is your /public_html/ directory), and looks something like the following:

User-agent: *
Disallow:

The first line controls the 'bot' that will be visiting your site, the second line controls if they are allowed in, or which parts of the site they are not allowed to visit'

If you want to handle multiple 'bots', then simple repeat the above lines.
So an example:

User-agent: googlebot
Disallow:

User-agent: askjeeves
Disallow: /

This will allow Goggle (user-agent name GoogleBot) to visit every page and directory, while at the same time banning Ask Jeeves from the site completely.
To find a 'reasonably' up to date list of robot user names this visit http://www.robotstxt.org/wc/active/html/index.html

Even if you want to allow every robot to index every page of your site, it's still very advisable to put a robots.txt file on your site. It will stop your error logs filling up with entries from search engines trying to access your robots.txt file that doesn't exist.

For more information on robots.txt see, the full list of resources about robots.txt at http://www.websitesecrets101.com/robotstxt-further-reading-resources

Author: Bruce Hearder
 
Author Bio:

Bruce Hearder owns and runs www.online-money101.com Signup for the Online-Money101 newsletter and learn the simple techniques that everyday people like yourself use to make money on the web, every single day. Visit www.online-money101.com today

 
 
 

Related Articles

 
Different Types of Banner
 
Get Spidered by Google in Record Time...48 Hours or Less!
 
Smart Blogging For Business ? What To Do To Boost Visitors Immediately!
 
Meta-Tags Are Dead - The End of An Era of Deception
 
The Importance of One Way Links in Online Marketing
 
The Secret to Writing Killer Subject Headlines - Without It, You're Message Is Doomed For Deletion
 
Cell Phone Ringtones 101
 
10 Amazing Product Selling Formulas
 
Why Do You Need A Registry Cleaner?
 
What??s A Blog And Why Should I Have One? - Using Blogs To Generate Business
 
 
 
 
 

Choosing A Laptop: Windows Laptop Or Macintosh Powerbook

At present the Windows Laptop and the MacIntosh Powerbook are the most preferred plaptops for people ... - Jim Grayson
 

Sell Your Art Online With Your Own Unique Website

If you are serious about selling your art online, then you should be serious about developing your o ... - Ralph Serpe
 

Create Your Three Letter Autoresponder Follow-Up

A woman told me this week, "Your advertising is wasted if you don??t follow-up on your leads." Her s ... - Kevin Nunley
 
 

Corporate ERP: Microsoft Great Plains For The Plant In Brazil - Overview For CIO

Microsoft Business Solutions Great Plains as new ERP for multinational corporation manufacturing fac ... - Andrew Karasev
 

Optimizing Your Asp.Net Pages for Faster Loading and Better Performance.

If you read the internet and all of the websites dedicated to Asp.Net you will inevitably read about ... - John Belthoff
 
 
Site Home >> Privacy Policy >> ToS
© 2006-2008 www.findarticlesplus.com All Rights Reserved Worldwide.