SEOClerks

What Is The Robots.txt File?



Write the reason you're deleting this FAQ

What Is The Robots.txt File?

What Is The Robots.txt File?

Comments

Please login or sign up to leave a comment

Join
stayonline
Web site owners use the /robots.txt file to give instructions about their site to web robots; this is calledThe Robots Exclusion Protocol.

It works likes this: a robot wants to vists a Web site URL, say http://www.example.com/welcome.html. Before it does so, it firsts checks for http://www.example.com/robots.txt, and finds:

User-agent: *
Disallow: /

The "User-agent: *" means this section applies to all robots. The "Disallow: /" tells the robot that it should not visit any pages on the site.

There are two important considerations when using /robots.txt:

  • robots can ignore your /robots.txt. Especially malware robots that scan the web for security vulnerabilities, and email address harvesters used by spammers will pay no attention.
  • the /robots.txt file is a publicly available file. Anyone can see what sections of your server you don't want robots to use.

So don't try to use /robots.txt to hide information.



Are you sure you want to delete this post?

spyindiadanish
Thanks for sharing such a nice memory....



Are you sure you want to delete this post?

jaysh4922
Robot.txt is an on-page SEO technique and it is basically used to allow for the web robots also known as the web wanderers, crawlers or spiders. It is a program that traverses the website automatically and this helps the popular search engine like Google to index the website and its content.



Are you sure you want to delete this post?

johnniewalk
Robot .txt file is a option where we can use in webmaster tool if you want to index or not to index a particular image you can use robot .txt file



Are you sure you want to delete this post?

josephinek1
Robot.txt:

It is an HTML tag placed on the source of a web page which redirects search engine spiders which files to crawl on or not.



Are you sure you want to delete this post?

patco444
It's a file that can tell which inner pages of your website to be indexed by Google & other SE's What Is The Robots.txt File?



Are you sure you want to delete this post?