Module SnowballSource

Sourcetype encoding =
  1. | ISO_8859_1
  2. | ISO_8859_2
  3. | KOI8_R
  4. | UTF_8

Type of encodings.

Sourceval pp_encoding : Format.formatter -> encoding -> unit
Sourcetype t

Type of stemmers.

Sourcemodule Language : sig ... end
Sourceval languages : Language.t list

Languages available for stemming.

Sourceval porter : Language.t
Sourceval create : ?encoding:encoding -> Language.t -> t

create ?encoding language creates a stemmer which can be used to stem words via stem.

NOTE: it's important to release (via remove a t when you are done about stemming.

Sourceval remove : t -> unit

remove stemmer destroys the underlying structure used to stem words.

Sourceval stem : t -> string -> string

stem stemmer word stems the given word with the given stemmer (which corresponds to the language we use).