Computing.Net > Forums > Web Development > web crawler robots

Computer Problems? Computing.Net has over 1,000,000 posts about all things technology related! Over 90% answered within 24 hours! Click here to start participating now! Also, be sure to check out the New User Guide.

web crawler robots

Reply to Message Icon

Name: patcoola
Date: April 18, 2006 at 01:41:56 Pacific
OS: Windows XP pro
CPU/Ram: P4m 2ghz/1gb DDR2-RAM
Comment:

the google crawler and other crawlers are crawling my website under both the domain name and the host user account url, because of the double urls the google crawler discards the pages it crawled, how do i set a robot.txt or .htaccess to do not crawl the host user url and only crawl the domian url?

ex: http://myhost.com/myusername/
// bad
http://www.mydomian.com/
// good



Sponsored Link
Ads by Google
Reply to Message Icon

Related Posts

See More







Post Locked

This post is quite old and has been locked from receiving new replies. Please create a new posting instead.


Go to Web Development Forum Home


Sponsored links

Ads by Google


Results for: web crawler robots

creating Web Crawler www.computing.net/answers/webdevel/creating-web-crawler/1791.html

Search engine for CMS?? www.computing.net/answers/webdevel/search-engine-for-cms/2808.html

Regarding search on GOOGLE www.computing.net/answers/webdevel/regarding-search-on-google/1235.html