Catalog/osint-wayback-urls

OSINT

Wayback CDX URL inventory API

Build a deduped, filtered inventory of every historical URL the Internet Archive captured for a domain via the keyless Wayback CDX API, with first/last capture + snapshot count per URL and status-code/extension summaries. Distinct from a single closest snapshot: this is the full crawl-surface map. The value-add is the CDX normalization and aggregation. Answers 'all archived URLs for this domain', 'historical pages of X', 'wayback URL inventory', 'what paths did this site expose over time'.

Price$0.01per request
MethodPOST
Route/v1/osint/wayback-urls
StatusLive
MIME typeapplication/json
Rate limit60/minute
Cache3600s public
osintwaybackcdxarchiveurlsinventoryinternet-archiverecon
API URLhttps://x402.hexl.dev/v1/osint/wayback-urls
Integration docs
Example request
{
  "domain": "example.com",
  "limit": 100,
  "status": "200"
}
Example response
{
  "domain": "example.com",
  "uniqueUrls": 2,
  "totalSnapshots": 3,
  "returned": 2,
  "statusCounts": {
    "200": 3
  },
  "topExtensions": [
    {
      "ext": "html",
      "count": 2
    }
  ],
  "urls": [
    {
      "url": "http://example.com/",
      "mime": "text/html",
      "status": "200",
      "firstCapture": "20100101000000",
      "lastCapture": "20210101000000",
      "snapshots": 2
    }
  ]
}
Input schema
{
  "type": "object",
  "required": [
    "domain"
  ],
  "properties": {
    "domain": {
      "type": "string",
      "examples": [
        "example.com"
      ]
    },
    "limit": {
      "type": "number",
      "default": 100,
      "examples": [
        100
      ]
    },
    "status": {
      "type": "string",
      "examples": [
        "200"
      ]
    },
    "mime": {
      "type": "string",
      "examples": [
        "text/html"
      ]
    }
  }
}
Output schema
{
  "type": "object",
  "additionalProperties": true
}