# sortOTT robots.txt # Allow real search engines (they send users back to the site) # Block AI training crawlers (they consume bandwidth + function quota without sending traffic) # ─── Allowed: real search engines ─── User-agent: Googlebot Allow: / User-agent: Bingbot Allow: / User-agent: DuckDuckBot Allow: / User-agent: Slurp Allow: / User-agent: Yandex Allow: / User-agent: Baiduspider Allow: / User-agent: facebookexternalhit Allow: / User-agent: Twitterbot Allow: / User-agent: LinkedInBot Allow: / User-agent: WhatsApp Allow: / User-agent: TelegramBot Allow: / # ─── Blocked: AI training crawlers (no user traffic, just consume resources) ─── User-agent: GPTBot Disallow: / User-agent: ChatGPT-User Disallow: / User-agent: OAI-SearchBot Disallow: / User-agent: ClaudeBot Disallow: / User-agent: anthropic-ai Disallow: / User-agent: Claude-Web Disallow: / User-agent: PerplexityBot Disallow: / User-agent: Perplexity-User Disallow: / User-agent: Bytespider Disallow: / User-agent: Amazonbot Disallow: / User-agent: CCBot Disallow: / User-agent: cohere-ai Disallow: / User-agent: Diffbot Disallow: / User-agent: ImagesiftBot Disallow: / User-agent: Omgilibot Disallow: / User-agent: FacebookBot Disallow: / User-agent: Applebot-Extended Disallow: / # ─── Catch-all: anything else, allow ─── User-agent: * Allow: / Sitemap: https://sortott.com/sitemap.xml