# robots.txt file # www.mainegeneral.org # Version 4.1-ALL, 11/25/09 # Adjusted 11-26-2009 # BASIC robots.txt file for sitemaker websites. # All robots allowed, some pages excluded # Must be named "robots.txt" and be in the root folder. # ALL ROBOTS User-agent: * Disallow: Crawl-delay: 5 Disallow: /blank Disallow: /BLANK Disallow: /print Disallow: /PRINT Disallow: /secure # GOOGLEBOT User-agent: Googlebot Disallow: Disallow: /blank Disallow: /BLANK Disallow: /print Disallow: /PRINT Disallow: /secure Disallow: /?src= Disallow: /body.cfm/ Disallow: /body.cfm.cfm Disallow: /Body Disallow: /Wide_body # GSA User-agent: gsa-crawler Disallow: Disallow: /blank Disallow: /BLANK Disallow: /print Disallow: /PRINT Disallow: /secure Disallow: /1