|
Stemming is the process of extracting stem (base word) from the
given word. This base word needs not to be in the root
(meaningful) form. A word can be a combination of affixes and
stem. An affix can be a prefix, postfix or infix. In the
developed Urdu Stemmer Assas-band, foreign words are not
handled. The task of Urdu Stemmer is to extract stem, prefix and
postfix from the given word. Assas-band only returns meaningful
form of the stem which means that the base form is converted
into root by attaching required character(s). Assas-band
distinguishes between stem of masculine and feminine forms. For
example the stem of لڑکیاں (girls) is لڑکی (girl), whereas the
stem of لڑکوں (boys) is لڑکا (boy). |
|