## GENERAL SETTINGS ## Enable robots.txt rules for all crawlers User-agent: * Sitemap: http://www.hoelzel-biotech.com/sitemap.xml ## DEVELOPMENT RELATED SETTINGS ## Do not crawl development files and folders: CVS, svn directories and dump files Disallow: CVS Disallow: .svn Disallow: .idea Disallow: .sql Disallow: .tgz ## GENERAL MAGENTO SETTINGS ## Do not crawl Magento admin page Disallow: /admin/ ## Do not crawl common Magento technical folders Disallow: /*reviews Disallow: /app/ Disallow: /downloader/ Disallow: /errors/ Disallow: /includes/ Disallow: /lib/ Disallow: /lhc/ Disallow: /old_tools/ Disallow: /sztools/ Disallow: /tools/ Disallow: /pkginfo/ Disallow: /shell/ Disallow: /var/ Disallow: /report/ Disallow: /tmp/ ## Do not crawl common Magento files Disallow: /api.php Disallow: /cron.php Disallow: /cron.sh Disallow: /error_log Disallow: /get.php Disallow: /install.php Disallow: /LICENSE.html Disallow: /LICENSE.txt Disallow: /LICENSE_AFL.txt Disallow: /README.txt Disallow: /RELEASE_NOTES.txt ## MAGENTO SEO IMPROVEMENTS ## Do not crawl sub category pages that are sorted or filtered. Disallow: /*? Disallow: /*?dir* Disallow: /*?dir=desc Disallow: /*?dir=asc Disallow: /*?limit=all Disallow: /*?mode* ## Do not crawl 2-nd home page copy (example.com/index.php/). Uncomment it only if you activated Magento SEO URLs. Disallow: /index.php/ ## Do not crawl links with session IDs Disallow: /*?SID= ## Do not crawl checkout and user account pages Disallow: /checkout/ Disallow: /onestepcheckout/ Disallow: /customer/ Disallow: /customer/account/ Disallow: /customer/account/login/ Disallow: /sendfriend/ Disallow: /tag/ Disallow: /review/ Disallow: /de/checkout/ Disallow: /de/onestepcheckout/ Disallow: /de/customer/ Disallow: /de/customer/account/ Disallow: /de/customer/account/login/ Disallow: /de/sendfriend/ Disallow: /de/tag/ Disallow: /de/review/ Disallow: /en/checkout/ Disallow: /en/onestepcheckout/ Disallow: /en/customer/ Disallow: /en/customer/account/ Disallow: /en/customer/account/login/ Disallow: /en/sendfriend/ Disallow: /en/tag/ Disallow: /en/review/ ## Do not crawl seach pages and not-SEO optimized catalog links Disallow: /catalogsearch/ Disallow: /catalog/product_compare/ Disallow: /catalog/category/view/ Disallow: /catalog/product/view/ Disallow: /pfullscreen/ Disallow: /quicklook/ Disallow: /de/catalogsearch/ Disallow: /de/catalog/product_compare/ Disallow: /de/catalog/category/view/ Disallow: /de/catalog/product/view/ Disallow: /de/pfullscreen/ Disallow: /de/quicklook/ Disallow: /en/catalogsearch/ Disallow: /en/catalog/product_compare/ Disallow: /en/catalog/category/view/ Disallow: /en/catalog/product/view/ Disallow: /en/pfullscreen/ Disallow: /en/quicklook/ ## Do not crawl availability check ## Disallow: /productalert/add/stock/product_id/ ## SERVER SETTINGS ## Do not crawl common server technical folders and files Disallow: /cgi-bin/ Disallow: /cleanup.php Disallow: /apc.php Disallow: /memcache.php Disallow: /phpinfo.php ## IMAGE CRAWLERS SETTINGS ## Extra: Uncomment if you do not wish Google and Bing to index your images User-agent: Googlebot-Image Disallow: / User-agent: msnbot-media Disallow: / Disallow: /*.php$