Robots.txt Tester
Inspect how smartcopons.com controls crawler access, blocked paths, sitemap references and AI crawler rules.
Robots.txt Status
Robots.txt Status
Present
Score
68
/100
ยท Needs review
View Full Robots.txt
Robots.txt Content Preview
User-agent: * # Block WordPress admin (keep ajax allowed) Disallow: /wp-admin/ Allow: /wp-admin/admin-ajax.php # Block RSS/Atom feeds Disallow: /feed/ Disallow: */feed/ Disallow: */feed/rss2/ # Block WordPress system pages Disallow: /wp-login.php Disallow: /wp-json/ Disallow: /wp-content/plugins/ # Block user account & form pages Disallow: /my-account/ Disallow: /checkout/ Disallow: /submit-coupon/ # Block short URL patterns (not real content) Disallow: /str/ Disallow: /cpn/ # Block search results Disallow: /search/ # Block URL parameters (duplicate pages) Disallow: /*?fsp_sid= Disallow: /*?utm_source= Disallow: /*?noamp= Disallow: /*?amp= Disallow: /*?sout= # Main Site Sitemap: https://smartcopons.com/sitemap_index.xml # Subdomain Sitemaps Sitemap: https://iherbdiscountcode.smartcopons.com/sitemap.xml Sitemap: https://namshidiscountcode.smartcopons.com/sitemap.xml Sitemap: https://noondiscountcode.smartcopons.com/sitemap.xml Sitemap: https://kinguindiscountcode.smartcopons.com/sitemap.xml Sitemap: https://magrabidiscountcode.smartcopons.com/sitemap.xml Sitemap: https://vogaclosetdiscountcode.smartcopons.com/sitemap.xml Sitemap: https://us.smartcopons.com/sitemap.xml Sitemap: https://sa.smartcopons.com/sitemap.xml Sitemap: https://travel.smartcopons.com/sitemap.xml # AI SEARCH CRAWLERS (ALLOWED) User-agent: GPTBot Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: ClaudeBot Allow: / User-agent: PerplexityBot Allow: / # AI TRAINING CRAWLERS (BLOCKED) User-agent: CCBot Disallow: / User-agent: anthropic-ai Disallow: / User-agent: Bytespider Disallow: / User-agent: Google-Extended Disallow: /
User-agent Rules
| User-agent(s) | Allowed paths | Disallowed paths |
|---|---|---|
| * |
|
|
| gptbot |
|
No explicit Disallow rules. |
| oai-searchbot |
|
No explicit Disallow rules. |
| chatgpt-user |
|
No explicit Disallow rules. |
| claudebot |
|
No explicit Disallow rules. |
| perplexitybot |
|
No explicit Disallow rules. |
| ccbot | No explicit Allow rules. |
|
| anthropic-ai | No explicit Allow rules. |
|
| bytespider | No explicit Allow rules. |
|
| google-extended | No explicit Allow rules. |
|
Blocked and Allowed Paths
| Blocked paths |
|
|---|---|
| Allowed paths |
|
| Crawl-delay | No Crawl-delay directive detected. |
Sitemaps Detected
- https://smartcopons.com/sitemap_index.xml
- https://iherbdiscountcode.smartcopons.com/sitemap.xml
- https://namshidiscountcode.smartcopons.com/sitemap.xml
- https://noondiscountcode.smartcopons.com/sitemap.xml
- https://kinguindiscountcode.smartcopons.com/sitemap.xml
- https://magrabidiscountcode.smartcopons.com/sitemap.xml
- https://vogaclosetdiscountcode.smartcopons.com/sitemap.xml
- https://us.smartcopons.com/sitemap.xml
- https://sa.smartcopons.com/sitemap.xml
- https://travel.smartcopons.com/sitemap.xml
AI Crawler Policy
At least one AI crawler (such as GPTBot, ChatGPT-User, ClaudeBot, PerplexityBot or Google-Extended) appears to be blocked by robots.txt.
Issues Found
- At least one user-agent has Disallow: / which blocks the entire site.
- Sitemap URL appears to be unreachable: https://namshidiscountcode.smartcopons.com/sitemap.xml
- Wildcard (*) patterns should typically be used with explicit paths, e.g. /path/*.
- Blocking CSS or JS may prevent search engines from rendering pages correctly.
Recommendations
- Avoid blocking the entire site (Disallow: /); restrict only sensitive or low-value paths instead.
- Review your AI crawler policy for GPTBot, ChatGPT-User, ClaudeBot, PerplexityBot and Google-Extended to ensure it matches your content strategy.
- Ensure important pages, CSS and JavaScript assets are crawlable so search engines can fully render your site.