It calculates the proportion of related words that had different stems.

understemming_index(words, stems)

Arguments

words

is a data.frame containing a column word a a column group so the function can identify groups of words.

stems

is a character vector with the stemming result for each word