AugmentClaude

SitemapKit

Discover and extract all URLs from any website's sitemaps.

Installation

  1. Make sure Claude is on your device and in your terminal.

    Skills load from ~/.claude/skills/ when Claude Code starts up — so you need it on your machine first. If you don't have it yet, install it once with the command below, then run claude in any terminal to verify.

    One-time setup
    npm i -g @anthropic-ai/claude-code

    Already have it? Skip ahead.

  2. Paste into Claude Code or into your terminal.

    This copies the whole skill folder into ~/.claude/skills/sitemapkit-aiskillstore/ — the SKILL.md plus any scripts, reference docs, or templates the skill ships with. Safe default: works for every skill.

    Faster alternative (instruction-only skills)

    Skips the clone and grabs only the SKILL.md file. Don't use this if the skill ships Python scripts, reference markdowns, or asset templates — they won't be downloaded and the skill will fail when it tries to load them.

    Quick install (SKILL.md only)
    Sign up to copy
  3. Restart Claude Code.

    Quit and reopen Claude Code (or any other agent that loads from ~/.claude/skills/). New skills are picked up on startup.

  4. Just ask Claude.

    Skills auto-activate when your request matches the skill's description — no slash command needed. Trigger phrases live in the skill's own frontmatter; you can read them in the “What this skill does” section above.

Prefer to read the source first? Open on GitHub.

When Claude uses it

Discover and extract sitemaps from any website using SitemapKit. Use this skill whenever the user wants to find pages on a website, get a list of URLs from a domain, audit a site's structure, crawl a sitemap, check what pages exist on a site, or do anything involving sitemaps or site URL discovery — even if they don't explicitly say "sitemap". Requires the sitemapkit MCP server configured with a valid SITEMAPKIT_API_KEY.

What this skill does

SitemapKit

Use the SitemapKit MCP tools to discover and extract URLs from any website's sitemaps.

Tools available

  • discover_sitemaps — finds all sitemap files for a domain (checks robots.txt, common paths, sitemap indexes). Use this first when you just want to know what sitemaps exist.
  • extract_sitemap — fetches all URLs from a specific sitemap URL. Use when the user gives you a direct sitemap URL.
  • full_crawl — discovers all sitemaps for a domain and returns every URL across all of them in one call. Use this when the user wants the complete list of pages on a site.

When to use which tool

User saysUse
"find sitemaps for X" / "does X have a sitemap?"discover_sitemaps
"extract URLs from X/sitemap.xml"extract_sitemap
"get all pages on X" / "crawl X" / "list all URLs on X"full_crawl

Usage guidelines

  • Always pass a full URL including protocol: https://example.com
  • full_crawl and discover_sitemaps only use the domain — paths are ignored
  • extract_sitemap needs the exact sitemap URL, e.g. https://example.com/sitemap.xml
  • Default max_urls is 1000. If the user wants more, pass a higher value (up to plan limit)
  • If truncated: true appears in the result, tell the user there are more URLs and suggest increasing max_urls
  • Check meta.quota.remaining in the response — if it's low, warn the user proactively

Error handling

ErrorWhat to tell the user
UnauthorizedAPI key is missing or invalid. Get one at https://app.sitemapkit.com/settings/api
Monthly quota exceededPlan limit reached. Upgrade at https://sitemapkit.com/pricing
Rate limit exceededToo many requests per minute. Wait and retry — the response includes a retryAfter timestamp

Example interactions

"What pages does stripe.com have?" → Call full_crawl with url: "https://stripe.com", present the URL list.

"Find all sitemaps for shopify.com" → Call discover_sitemaps with url: "https://shopify.com", list the sitemap URLs found and which sources they came from (robots.txt, common paths, etc.).

"Extract https://example.com/sitemap-posts.xml" → Call extract_sitemap with url: "https://example.com/sitemap-posts.xml", present the URLs with lastmod dates if available.

"How many pages does vercel.com have?" → Call full_crawl, report totalUrls and whether the result was truncated.

Related skills