OpenAI

Advertiser Guidance for Allowing OpenAI Web Crawlers

Updated: 3 days ago

Why does OpenAI use web crawlers?

We use crawlers to validate the safety of web pages submitted as ads on ChatGPT. When you submit an ad, OpenAI may visit the landing page to ensure it complies with our policies. We may also use content from the landing page to determine when it’s most relevant to show the ad to users.

Which OpenAI crawlers should you allow?

You must allow OAI-AdsBot. We recommend allowing both OAI-AdsBot and OAI-SearchBot

OpenAI crawlers fail to crawl my website. What should I do?

Most websites have multiple layers of protection before a crawler can successfully access a webpage. We recommend working with your engineering/security team to validate that OpenAI crawlers can pass through each of the following layers:

1. robots.txt

Overview: The robots.txt file tells crawlers whether they are permitted to access certain parts of your website. OpenAI crawlers respect these rules. If access is disallowed in robots.txt, crawling will stop immediately.


Recommendation: Review your robots.txt configuration and confirm that OpenAI crawlers are explicitly allowed to access the relevant pages and paths.

User-agent: OAI-SearchBot

Allow: /

User-agent: OAI-AdsBot

Allow: /

2. Web Protection / Bot Mitigation

Overview: Many websites use services such as Cloudflare, Akamai, or other web protection providers to defend against DDoS attacks, scraping, and unauthorized traffic. These systems can mistakenly block legitimate crawlers, often returning 403 Forbidden errors. Because OpenAI crawlers can resemble automated traffic patterns, they may be denied unless specifically allowlisted.


Recommendation: Review your web protection or firewall configuration and allowlist OpenAI crawler traffic where possible, ideally based on our crawler user agents. Your engineering or infrastructure team should also inspect any automated bot mitigation rules that could be triggering false positives.

3. Human Verification / Anti-Bot Logic

Overview: Some websites implement additional application-level checks to verify that a visitor is human (for example: CAPTCHAs, JavaScript challenges, behavioral analysis, or session validation). Since OpenAI crawlers are automated systems, these checks may block access even if the crawler successfully passes earlier layers.


Recommendation: Review any human-verification or anti-automation logic implemented within your application and ensure OpenAI crawlers are exempted where appropriate, ideally by allowlisting our crawler user agents.

A Note on Stable IP Ranges

Some security systems require crawler traffic to originate from stable, publicly documented IP ranges before traffic can be reliably allowlisted.

Because crawler infrastructure may evolve over time, your engineering team should avoid relying solely on short-term IP observations from logs. Instead, we recommend validating traffic through a combination of: user-agent identification, verified bot programs (where supported), firewall allowlists, robots.txt behavior, and provider-level bot verification systems.

If you must allow a stable list of IP ranges, please reference:

A Note on Rate Limiting

Large batch uploads or sudden spikes in crawler traffic can sometimes trigger automated rate limiting or bot protection systems.

If you suspect rate limiting is occurring, ask your engineering team to review:

  • HTTP response codes (especially 429 Too Many Requests)

  • Firewall or CDN logs

  • Bot mitigation events

  • Request throttling rules

  • Traffic analytics around the time the crawler attempted access

This can help identify whether requests are being intentionally slowed or blocked by infrastructure protections.

You may also consider uploading ads across more time in smaller batches.

A Note on Cloudflare

OAI-AdsBot is now officially verified and allowlisted by Cloudflare.

Was this article helpful?