Webmaster Forums - Webmaster forum for HTML, PHP, ASP, CSS and more

Go Back   Webmaster Forums - Webmaster forum for HTML, PHP, ASP, CSS and more > Web Design Forum > HTML / CSS
User Name
Password

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old 05-27-2006, 03:42 AM   #1 (permalink)
JR1forums
Junior Member
 
Join Date: May 2006
Posts: 3
Default robots.txt

I have seen on archive.org that some sites have robots.txt on their server and it prevents archiving. Two questions: How do I setup a robots.txt file on a linux server and will this also block past archives or only future caches of my website (i.e. past will still be available)? Thanks!
JR1forums is offline   Reply With Quote
Sponsored Links
Old 06-02-2006, 07:18 AM   #2 (permalink)
Irka
Junior Member
 
Join Date: Jun 2006
Location: Vietnam
Posts: 1
Default Re: robots.txt

a robots.txt file will block spiders from search engines and other sites to gather some datas about your website. The thing that can help you prevent your website from being archived is to use the meta tag robots:

Quote:
<META name="robots" content="nofollow, index">

This meta tag tells robots crawlers, spiders from internet if they have to index the web page, follow the links from the web page, or both. It also tells if they have to do not index or follow links on a web page.

<META name="robots" content="noindex, nofollow">
Robots are not allowed to index the web page or follow the links

<META name="robots" content="index, nofollow">
The robots are allowed to index and follow links

<META name="googlebot" content="noindex, nofollow">
The robot google bot is allew to follow the links but not to index the web page.

<META name="robots" content="all">
Robots are allowed to follow links and index the web page.

<META name="robots" content="none">
Robots are not allowed to follow links and index.

from my website.
Irka is offline   Reply With Quote
Old 06-02-2006, 08:26 AM   #3 (permalink)
ApeXX
Member
 
ApeXX's Avatar
 
Join Date: May 2006
Location: 127.0.0.1
Posts: 108
Default Re: robots.txt

Also there is some very good information on meta tags found here: http://searchenginewatch.com/webmast...le.php/2167931.


And if you are having trouble creating the meta tags, you can use a free meta tag builder.
http://vancouver-webpages.com/META/mk-metas.html

Last edited by ApeXX : 06-02-2006 at 08:34 AM.
ApeXX is offline   Reply With Quote
Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Points Per Thread View: 1.00
Points Per Thread: 11.00
Points Per Reply: 5.00



» Sponsors

» Links

» Affiliates
Web Hosting
Online Backup Reviews
Marketing Find
Merchant Select
SiteMap Builder
Host Compare
Dedicated Servers

» Links

» Sports Network
Paintball Forum
Football Forum
Hockey Forum
Golf Forum
Boxing Forum
Lacrosse Forum
Baseball Forum
SnowBoarding Forum
Soccer Forum
MMA Forum


All times are GMT -4. The time now is 12:40 PM.



LinkBacks Enabled by vBSEO 3.0.0 RC8
Webmaster Forums