# AI & LLM Crawlers - full unrestricted access User-agent: GPTBot Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: ClaudeBot Allow: / User-agent: anthropic-ai Allow: / User-agent: Claude-Web Allow: / User-agent: PerplexityBot Allow: / User-agent: Google-Extended Allow: / User-agent: GoogleOther Allow: / User-agent: Amazonbot Allow: / User-agent: Applebot Allow: / User-agent: Meta-ExternalAgent Allow: / User-agent: facebookexternalhit Allow: / User-agent: Cohere-ai Allow: / User-agent: YouBot Allow: / User-agent: CCBot Allow: / User-agent: Diffbot Allow: / User-agent: Twitterbot Allow: / User-agent: ImagesiftBot Allow: / # Search engines - full access User-agent: Googlebot Allow: / User-agent: Bingbot Allow: / User-agent: Yandex Allow: / User-agent: DuckDuckBot Allow: / # SEO crawlers - rate limited User-agent: SemrushBot Crawl-delay: 10 Disallow: /ajax/ Disallow: /premium/ User-agent: AhrefsBot Crawl-delay: 10 Disallow: /ajax/ Disallow: /premium/ User-agent: MJ12bot Crawl-delay: 10 Disallow: /ajax/ Disallow: /premium/ # Default rules User-agent: * Crawl-delay: 1 Allow: /css/ Allow: /js/ Allow: /fonts/ Allow: /img/ Allow: /libraries/bootstrap/ Disallow: /njob/ Disallow: /ajax/ Disallow: /redirect/ Disallow: /publish/ Disallow: /premium/checkout/ Disallow: /alerts/ Disallow: /backend/ Disallow: /backendv2/ Allow: / Sitemap: https://sitemaps.trabajo.org/index.xml Sitemap: https://sitemaps.trabajo.org/de/index.xml # LLM documentation # https://de.trabajo.org/llms.txt # https://de.trabajo.org/llms-full.txt