Can AI find lululemon.com?

Reportlululemon.com·checked 2026-05-21 20:32 UTC·methodology v0.1 (preview)·canaifind.com/r/QVjecwCT

PartialSome fundamentals in place; high-leverage gaps identified.

This is a static-scan check (robots.txt + llms.txt + schema.org + headers). Live engine probes across ChatGPT, Claude, Gemini, and Perplexity arrive in a future build — currently in queue. Real visibility lives in category and comparison queries, which we measure with a 100-prompt stratified set on the Audit tier.

╴ Check your own domain

Same scan, free, no signup. Results in ~5 seconds at your own permanent canaifind.com/r/{slug} URL.

AI crawler robots.txt audit

§1 of 4

OpenAI

GPTBot	Training crawler for future OpenAI models.	? Unknown (fetch blocked)
OAI-SearchBot	ChatGPT Search index. Disallowing makes you invisible to ChatGPT Search.	? Unknown (fetch blocked)
ChatGPT-User	User-initiated retrieval. Ignores robots.txt by design.	— Ignores robots.txt

Anthropic

ClaudeBot	Training crawler for Anthropic models.	? Unknown (fetch blocked)
Claude-User	Retrieves pages when a Claude user asks about them. Respects robots.txt (unlike OpenAI's ChatGPT-User).	? Unknown (fetch blocked)
Claude-SearchBot	Search index for Claude. Disallowing reduces Claude search quality.	? Unknown (fetch blocked)
claude-code	Claude Code CLI / IDE retrieval. Documentation-targeted.	? Unknown (fetch blocked)

Perplexity

PerplexityBot	Perplexity indexing. Disallowing removes you from Perplexity retrieval.	? Unknown (fetch blocked)
Perplexity-User	User-initiated retrieval. Ignores robots.txt by design.	— Ignores robots.txt

Google

Google-Extended	Training opt-out for Gemini / Bard. Disallowing opts you out of Google AI training.	? Unknown (fetch blocked)
GoogleOther	Catch-all for non-Search Google crawlers.	? Unknown (fetch blocked)

Structured data & discovery files

§2 of 4

Artifact	Status	Note
llms.txt	✗ Missing	Anthropic Claude respects this; Google has confirmed it does not; OpenAI is unconfirmed.
llms-full.txt	✗ Missing	Optional full-content companion file.

Artifact	Status	Note
schema.org Organization	✗ Missing	Entity anchor for the sameAs graph.
schema.org FAQPage	✗ Missing	2.7× citation rate vs without (Relixir 2025) — highest-leverage single fix.
schema.org Article	✗ Missing	For editorial pages.
schema.org HowTo	✗ Missing	For tutorials.
schema.org SoftwareApplication	✗ Missing	For product pages.
Person (author entity)	✗ Missing	E-E-A-T signal on bylines.

HTTP headers

§3 of 4

Could not fetch the homepage (HTTP 0). Skipping HTTP header checks.

Top findings

§4 of 4

1
Could not fetch robots.txt.
The request for lululemon.com/robots.txt failed: the origin did not respond within 5s. We cannot make claims about per-crawler access until we can read the file. AI retrieval crawlers running from datacenter IPs may face the same outcome.
Med
2
Could not fetch the homepage.
The request for https://lululemon.com/ failed: the origin did not respond within 5s. AI retrieval crawlers may face the same outcome from datacenter IPs — we can't audit schema.org markup until the page is reachable.
Med

╴ Share this report

This report has a permanent URL: canaifind.com/r/QVjecwCT. Screenshot, drop in Slack, quote-tweet, or send to whoever's going to ask. That's how this tool finds the next person who needs it.

AI crawler robots.txt audit

Structured data & discovery files

HTTP headers

Top findings

Could not fetch robots.txt.

Could not fetch the homepage.