Robots.txt Tester

Inspect how zainulabideen-portfolio.netlify.app controls crawler access, blocked paths, sitemap references and AI crawler rules.

Preview

Score: 100

  • Document your AI crawler policy explicitly in robots.txt so future bots know how to treat your content.
  • Ensure important pages, CSS and JavaScript assets are crawlable so search engines can fully render your site.

Get your full report + exact fixes

See what’s hurting your SEO and how to fix it step by step.

  • Full breakdown
  • Actionable fixes
  • Prioritized next steps

No spam. One email with your report and next steps.

Robots.txt Status

Robots.txt Status Present Score 100 /100 · Strong
Domain zainulabideen-portfolio.netlify.app
Last analyzed June 12, 2026

View Full Robots.txt

Robots.txt Content Preview

# ============================================================================
#  robots.txt — Zain Ul Abideen Portfolio
#  https://zainulabideen-portfolio.netlify.app
# ============================================================================

# ─── Sitemap (must come first for some crawlers) ────────────────────────────
Sitemap: https://zainulabideen-portfolio.netlify.app/sitemap.xml

# ─── Default: allow everything for all crawlers ──────────────────────────────
User-agent: *
Allow: /
Disallow: /api/
Disallow: /*?*utm_
Disallow: /*?*ref=
Disallow: /*?*fbclid=
Disallow: /*?*gclid=

# ─── Explicit allowlists for public media (helps page-rendering bots) ────────
Allow: /images/
Allow: /models/
Allow: /assets/
Allow: /poppins.woff2
Allow: /sitemap.xml
Allow: /robots.txt
Allow: /site.webmanifest
Allow: /humans.txt
Allow: /.well-known/security.txt

# ============================================================================
#  Search-engine crawlers — priority bots
# ============================================================================

# Google
User-agent: Googlebot
Allow: /

User-agent: Googlebot-Image
Allow: /images/
Allow: /assets/

User-agent: Googlebot-Mobile
Allow: /

User-agent: Googlebot-News
Allow: /

User-agent: AdsBot-Google
Allow: /

User-agent: Mediapartners-Google
Allow: /

# Bing
User-agent: Bingbot
Allow: /

User-agent: AdIdxBot
Allow: /

# Yahoo / Slurp
User-agent: Slurp
Allow: /

# DuckDuckGo
User-agent: DuckDuckBot
Allow: /

# Yandex
User-agent: YandexBot
Allow: /

# Baidu
User-agent: Baiduspider
Allow: /

# Naver
User-agent: Yeti
Allow: /

# Seznam
User-agent: SeznamBot
Allow: /

# Apple Spotlight / Siri
User-agent: Applebot
Allow: /

# ============================================================================
#  Social / preview crawlers (for OG cards, rich previews)
# ============================================================================
User-agent: LinkedInBot
Allow: /

User-agent: facebookexternalhit
Allow: /

User-agent: Facebot
Allow: /

User-agent: Twitterbot
Allow: /

User-agent: WhatsApp
Allow: /

User-agent: TelegramBot
Allow: /

User-agent: Discordbot
Allow: /

User-agent: SkypeUriPreview
Allow: /

User-agent: Slackbot
Allow: /

User-agent: Slackbot-LinkExpanding
Allow: /

User-agent: Pinterest
Allow: /

User-agent: redditbot
Allow: /

# ============================================================================
#  AI / LLM crawlers — explicitly allowed (helps surface portfolio in AI answers)
# ============================================================================
User-agent: GPTBot
Allow: /

User-agent: OAI-SearchBot
Allow: /

User-agent: ChatGPT-User
Allow: /

User-agent: ClaudeBot
Allow: /

User-agent: Claude-Web
Allow: /

User-agent: anthropic-ai
Allow: /

User-agent: PerplexityBot
Allow: /

User-agent: Perplexity-User
Allow: /

User-agent: Google-Extended
Allow: /

User-agent: Meta-ExternalAgent
Allow: /

User-agent: Meta-ExternalFetcher
Allow: /

User-agent: cohere-ai
Allow: /

User-agent: Bytespider
Allow: /

User-agent: Amazonbot
Allow: /

User-agent: Diffbot
Allow: /

User-agent: DuckAssistBot
Allow: /

User-agent: YouBot
Allow: /

User-agent: Kagibot
Allow: /

# ============================================================================
#  Archive crawlers
# ============================================================================
User-agent: ia_archiver
Allow: /

User-agent: archive.org_bot
Allow: /

# ============================================================================
#  Block known aggressive / low-value scrapers
# ============================================================================
User-agent: AhrefsBot
Crawl-delay: 10

User-agent: SemrushBot
Crawl-delay: 10

User-agent: MJ12bot
Crawl-delay: 10

User-agent: DotBot
Crawl-delay: 10

User-agent Rules

User-agent(s) Allowed paths Disallowed paths
*
  • /
  • /images/
  • /models/
  • /assets/
  • /poppins.woff2
  • /sitemap.xml
  • /robots.txt
  • /site.webmanifest
  • /humans.txt
  • /.well-known/security.txt
  • /api/
  • /*?*utm_
  • /*?*ref=
  • /*?*fbclid=
  • /*?*gclid=
googlebot
  • /
No explicit Disallow rules.
googlebot-image
  • /images/
  • /assets/
No explicit Disallow rules.
googlebot-mobile
  • /
No explicit Disallow rules.
googlebot-news
  • /
No explicit Disallow rules.
adsbot-google
  • /
No explicit Disallow rules.
mediapartners-google
  • /
No explicit Disallow rules.
bingbot
  • /
No explicit Disallow rules.
adidxbot
  • /
No explicit Disallow rules.
slurp
  • /
No explicit Disallow rules.
duckduckbot
  • /
No explicit Disallow rules.
yandexbot
  • /
No explicit Disallow rules.
baiduspider
  • /
No explicit Disallow rules.
yeti
  • /
No explicit Disallow rules.
seznambot
  • /
No explicit Disallow rules.
applebot
  • /
No explicit Disallow rules.
linkedinbot
  • /
No explicit Disallow rules.
facebookexternalhit
  • /
No explicit Disallow rules.
facebot
  • /
No explicit Disallow rules.
twitterbot
  • /
No explicit Disallow rules.
whatsapp
  • /
No explicit Disallow rules.
telegrambot
  • /
No explicit Disallow rules.
discordbot
  • /
No explicit Disallow rules.
skypeuripreview
  • /
No explicit Disallow rules.
slackbot
  • /
No explicit Disallow rules.
slackbot-linkexpanding
  • /
No explicit Disallow rules.
pinterest
  • /
No explicit Disallow rules.
redditbot
  • /
No explicit Disallow rules.
gptbot
  • /
No explicit Disallow rules.
oai-searchbot
  • /
No explicit Disallow rules.
chatgpt-user
  • /
No explicit Disallow rules.
claudebot
  • /
No explicit Disallow rules.
claude-web
  • /
No explicit Disallow rules.
anthropic-ai
  • /
No explicit Disallow rules.
perplexitybot
  • /
No explicit Disallow rules.
perplexity-user
  • /
No explicit Disallow rules.
google-extended
  • /
No explicit Disallow rules.
meta-externalagent
  • /
No explicit Disallow rules.
meta-externalfetcher
  • /
No explicit Disallow rules.
cohere-ai
  • /
No explicit Disallow rules.
bytespider
  • /
No explicit Disallow rules.
amazonbot
  • /
No explicit Disallow rules.
diffbot
  • /
No explicit Disallow rules.
duckassistbot
  • /
No explicit Disallow rules.
youbot
  • /
No explicit Disallow rules.
kagibot
  • /
No explicit Disallow rules.
ia_archiver
  • /
No explicit Disallow rules.
archive.org_bot
  • /
No explicit Disallow rules.
ahrefsbot No explicit Allow rules. No explicit Disallow rules.
semrushbot No explicit Allow rules. No explicit Disallow rules.
mj12bot No explicit Allow rules. No explicit Disallow rules.
dotbot No explicit Allow rules. No explicit Disallow rules.

Blocked and Allowed Paths

Blocked paths
  • /api/
  • /*?*utm_
  • /*?*ref=
  • /*?*fbclid=
  • /*?*gclid=
Allowed paths
  • /
  • /images/
  • /models/
  • /assets/
  • /poppins.woff2
  • /sitemap.xml
  • /robots.txt
  • /site.webmanifest
  • /humans.txt
  • /.well-known/security.txt
Crawl-delay 10.0 seconds

Sitemaps Detected

AI Crawler Policy

No explicit blocks were detected for common AI crawlers (GPTBot, ChatGPT-User, ClaudeBot, PerplexityBot, Google-Extended).

Recommendations

  • Document your AI crawler policy explicitly in robots.txt so future bots know how to treat your content.
  • Ensure important pages, CSS and JavaScript assets are crawlable so search engines can fully render your site.

Analyze this site with other tools

Want a website that actually generates leads?

Start a conversion-focused website project with a team that builds fast, SEO-optimized sites for real businesses.