AugmentClaude

Prometheus Configuration

Sets up Prometheus scraping, recording rules, and alerting with proven conventions.

Installation

  1. Make sure Claude is on your device and in your terminal.

    Skills load from ~/.claude/skills/ when Claude Code starts up — so you need it on your machine first. If you don't have it yet, install it once with the command below, then run claude in any terminal to verify.

    One-time setup
    npm i -g @anthropic-ai/claude-code

    Already have it? Skip ahead.

  2. Paste into Claude Code or into your terminal.

    This copies the whole skill folder into ~/.claude/skills/prometheus-configuration-wshobson/ — the SKILL.md plus any scripts, reference docs, or templates the skill ships with. Safe default: works for every skill.

    Faster alternative (instruction-only skills)

    Skips the clone and grabs only the SKILL.md file. Don't use this if the skill ships Python scripts, reference markdowns, or asset templates — they won't be downloaded and the skill will fail when it tries to load them.

    Quick install (SKILL.md only)
    Sign up to copy
  3. Restart Claude Code.

    Quit and reopen Claude Code (or any other agent that loads from ~/.claude/skills/). New skills are picked up on startup.

  4. Just ask Claude.

    Skills auto-activate when your request matches the skill's description — no slash command needed. Trigger phrases live in the skill's own frontmatter; you can read them in the “What this skill does” section above.

Prefer to read the source first? Open on GitHub.

When Claude uses it

Teaches Claude how to configure Prometheus for metric collection across infrastructure and applications: scrape configuration, service discovery, recording rules, alert rules, relabeling, retention, and high-availability setups. Reach for it when standing up monitoring, wiring up metric scraping, or building out alerting. Includes naming conventions, best-practice defaults, and curl-based troubleshooting commands for inspecting targets, config, and queries.

What this skill does

What it does: Gives Claude a reference for configuring Prometheus to collect, store, and alert on metrics from infrastructure and applications.

  • Covers the core workflow: scrape configuration, service discovery, recording rules, and alert rule design
  • Bakes in conventions like consistent metric naming (prefix_name_unit), 15-60s scrape intervals, relabeling for cleanup, and retention sizing
  • Recommends scaling and reliability patterns including multiple Prometheus instances, federation, and Thanos/Cortex for long-term storage
  • Provides troubleshooting commands to check scrape targets, verify the active config, and test PromQL queries against the API (e.g. query=up)
  • Points to a deeper references/details.md for worked examples, and links related skills for Grafana dashboards, SLOs, and distributed tracing

Related skills