-
Notifications
You must be signed in to change notification settings - Fork 8
Open
Labels
Description
Why are the following parses considered to have different lengths? I'm guessing it has something to do with a punctuation filter.
GOLD=
(ROOT (S (CC And) (NP (NP (NNS rents)) (PP (IN on) (NP (NP (NNP Beverly) (NNP Hills)
(POS ')) (NNP Rodeo) (NNP Drive)))) (ADVP (RB generally)) (VP (VBP do) (RB n't)
(VP (VB exceed) (NP (NP (RB about) ($ $) (CD 125)) (NP (DT a) (JJ square)
(NN foot))))) (. .)))
TEST=
(ROOT (S (CC And) (NP (NP (NNS rents)) (PP (IN on) (NP (NNP Beverly) (NNP Hills))))
('' ') (NP (NNP Rodeo) (NNP Drive)) (ADVP (RB generally)) (VP (VBP do) (RB n't)
(VP (VB exceed) (NP (NP (QP (IN about) ($ $) (CD 125))) (NP (DT a) (NN square)
(NN foot))))) (. .)))
In this case, I think the TEST parse drops the token ('' ')
, but the GOLD parse does not because it is has a possessive tag.