Extracting ""((Adj|Noun)+|((Adj|Noun)(Noun-Prep)?)(Adj|Noun))Noun"" from Text (Justeson & Katz, 1995)

Posted by ssuhan on Stack Overflow See other posts from Stack Overflow or by ssuhan
Published on 2011-01-06T01:26:13Z Indexed on 2011/01/06 1:54 UTC
Read the original article Hit count: 580

I would like to query if it is possible to extract ((Adj|Noun)+|((Adj|Noun)(Noun-Prep)?)(Adj|Noun))Noun proposed by Justeson and Katz (1995) in R package openNLP?

That is, I would like to use this linguistic filtering to extract candidate noun phrases.

I cannot well understand its meaning.

Could you do me a favor to explain it or transform such representation into R language.

Many thanks.

Maybe we can start the sample code from:

library("openNLP")

acq <- "This paper describes a novel optical thread plug gauge (OTPG) for internal thread inspection using machine vision. The OTPG is composed of a rigid industrial endoscope, a charge-coupled device camera, and a two degree-of-freedom motion control unit. A sequence of partial wall images of an internal thread are retrieved and reconstructed into a 2D unwrapped image. Then, a digital image processing and classification procedure is used to normalize, segment, and determine the quality of the internal thread."

acqTag <- tagPOS(acq)

acqTagSplit = strsplit(acqTag," ")

© Stack Overflow or respective owner

Related posts about r

    Related posts about text-parsing