############################### # # robots.txt file for American Sole # updated AS.com 11/6/08 sc # # addresses all robots by using wild card * # User-agent: * # list folders robots are not allowed to index # these are based on oscommerce robot.txt file contribution # These settings will keep spiders from indexing your unwanted pages # This assumes that your OSC install is in your web site's ROOT directory # ie: http://www.yoursite.com/index.php <- Use if this brings up your OSC main page Disallow: /admin Disallow: /account.php Disallow: /advanced_search.php Disallow: /checkout_confirmation.php Disallow: /checkout_payment.php Disallow: /checkout_payment_address.php Disallow: /checkout_process.php Disallow: /checkout_shipping.php Disallow: /checkout_shipping_address.php Disallow: /checkout_success.php Disallow: /create_account.php Disallow: /login.php Disallow: /password_forgotten.php Disallow: /popup_image.php Disallow: /shopping_cart.php # # my directories to disallow # Disallow: /admin Disallow: /dvl Disallow: /includes Disallow: /internal Disallow: /cgi-bin Disallow: /DOCS Disallow: /download Disallow: /entropybanner Disallow: /extras Disallow: /ext Disallow: /tmp ##### have static pages for these ##### ###Disallow: /index.php # IF YOU DO NOT WISH TO HAVE THE GOOGLE IMAGE BOT SCAN YOUR DOMAIN FOR IMAGES # THEN YOU CAN INCLUDE THE FOLLOWING IN YOUR ROBOTS FILE. # I FOUND THAT MY BANDWIDTH USAGE DROPPED BY A MASSIVE AMOUNT AFTER I GOT RID # OF THE GOOGLE IMAGE BOT. ALL I HAD WAS IMAGE HUNTERS STEALING PRODUCT SHOTS # AND NOT EVEN BROWSING THE SITE. User-agent: Googlebot-Image Disallow: / # # list specific files robots are not allowed to index # # Disallow: /tutorials/meta_tags.html # Disallow: /tutorials/custom_error_page.html # # End of robots.txt file # ###############################