Word filter that groups words?

Posted by Legend on Stack Overflow See other posts from Stack Overflow or by Legend
Published on 2010-04-19T15:38:39Z Indexed on 2010/04/19 15:43 UTC
Read the original article Hit count: 154

Is there any library that achieves the following:

Convert

Microsoft Windows 98
Microsoft Windows XP
Windows 7
Windows Ultimate
Desktop Windows

to

Windows 4

The complicated part here is to recognize that "Desktop Windows" is an anomaly here and not count it. If nothing is added before the word "Windows", perhaps it can be counted but if there is something else and the suffix does not match any popular suffix, it can still be counted.

Maybe I am a little vague here but perhaps someone could have an idea about what I am talking about here. Any suggestions?

© Stack Overflow or respective owner

Related posts about natural-language

Related posts about language-agnostic