Catalog/ref-stopwords

Reference

Stopword list (multi-language) API

Return a curated stopword list for an ISO-639-1 language (11 languages incl. en/es/fr/de/zh/ja/ar/ru), and optionally strip stopwords from supplied text returning the filtered tokens + removal stats. The MORE: a multi-language stopword corpus plus an actual tokenize-and-filter step for NLP/RAG pipelines in one call. Answers 'give me English stopwords', 'remove stopwords from this text', 'French stopword list', 'clean this text for indexing'.

Price$0.01per request
MethodPOST
Route/v1/ref/stopwords
StatusLive
MIME typeapplication/json
Rate limit60/minute
CacheNo cache
refstopwordsnlpragtokenizetext-cleaninglanguagestop-words
API URLhttps://x402.hexl.dev/v1/ref/stopwords
Integration docs
Example request
{
  "language": "en",
  "text": "the quick brown fox"
}
Example response
{
  "language": "en",
  "count": 102,
  "stopwords": [
    "a",
    "an",
    "and",
    "are",
    "as"
  ],
  "text": {
    "tokenCount": 4,
    "removed": 1,
    "filteredTokens": [
      "quick",
      "brown",
      "fox"
    ],
    "filtered": "quick brown fox"
  }
}
Input schema
{
  "type": "object",
  "required": [
    "language"
  ],
  "properties": {
    "language": {
      "type": "string",
      "examples": [
        "en"
      ]
    },
    "text": {
      "type": "string",
      "examples": [
        "the quick brown fox"
      ]
    }
  }
}
Output schema
{
  "type": "object",
  "additionalProperties": true
}