Examples for enhanced CONJ tags


When entering the enhanced CONJ tags, they must be followed by a comma (for instance, not just INIT, but INIT,). There is no space between the enhanced tag and the comma. In what follows, the commas has been omitted for simplicity.


Subtags

Type Subtag
Clause-initial INIT
Word-level PRE, WORD
Phrase-level ADJP, ADVP, FRAG, INTJP, NP, PP, QP, QTP, VP (rare), WADJP, WADVP, WNP, WPP
Intermediate-level ADJX, ADVX, NX
Clause-level CP, IP, RRC
Gapping GAP
Uncertain X

Clause-initial conjunction

CONJ_INIT indicates that the scope of conjunction is the entire token.
( (IP-MAT (CONJ_INIT And)
          (ADVP-TMP (ADV then))
	  (NP-SBJ (D the) (N rain))
          (VBD started)))

( (IP-MAT (CONJ_INIT For)
          (NP-SBJ (PRO we))
	  (HVP have)
          (ADVP-TMP (ADV always))
	  (VBN known)
          (NP-ACC (PRO it))))

CONJ_INIT is also used for full (= ungapped) parenthetical clauses.

( (IP-MAT (CONJ_INIT For)
          (NP-SBJ (PRO we))
	  (HVP have)
          (ADVP-TMP (ADV always))
	  (, ,)
	  (IP-MAT-PRN (CONJ_INIT and)
	  	      (NP-SBJ (PRO I))
	  	      (HVP have)
		      (VBN pointed)
		      (NP-ACC (D this))
		      (RP out)
		      (ADVP (ADV repeatedly))
	  (, ,)
	  (VBN known)
          (NP-ACC (PRO it))))

Word-level conjunction

CONJ_WORD indicates that the scope of conjunction is the next word (regardless of category).
(XP (X ...) (CONJ_WORD and) (X ...))				← like categories

(XP (X ...) (CONJ_WORD and) (Y ...))				← unlike categories

(ADJP (VAN interested) (CONJ_WORD and) (ADJ enthusiastic))

(NP (NS sticks) (CONJ_WORD and) (NS stones))

(NP-SBJ (NPR John) (CONJ_WORD and) (PRO I))

(ADVP-DIR (ADV hither) (CONJ_WORD and) (ADV thither))

(IP-MAT (NP-SBJ (PRO I))
	(VBD (VBD came) (CONJ_WORD and) (VBD saw) (CONJ_WORD and) (VBD conquered)))

(CP-ADV (VP (VB (VB come) (CONJ_WORD and) (VB go)))
        (C though)
        (IP-SUB he may ...))

(WNP (WPRO who) (CONJ_WORD or) (WPRO what))

In the cases of BOTH ... AND, EITHER ... OR, and NEITHER ... NOR, the first member (BOTH, EITHER, NEITHER) is tagged CONJ_PRE.

(NP (CONJ_PRE both) (NS sticks) (CONJ_WORD and) (NS stones)) (ADVP-DIR (CONJ_PRE either) (ADV hither) (CONJ_WORD or) (ADV thither))

Phrasal conjunction

CONJ_ADJP, CONJ_ADVP, etc. indicates that the scope of conjunction is an entire phrase (ADJP, ADVP, FRAG, INTJP, NP, PP, QP, QTP, VP, WADJP, WADVP, WNP, WPP).

In cases of unlike category conjunction, the CONJ subtag indicates the category of the first conjunct and, by convention, the category that will end up dominating the entire conjunction structure in the parse. In other words, in this case, the CONJ looks "up" the tree to the conjunct that precedes it.

(XP (XP ...)
    (CONJP (CONJ_XP and)			← like category coordination
           (XP ...)))

(XP (XP ...)
    (CONJP (CONJ_XP and)			← unlike category coordination - note CONJ_XP
           (YP ...)))

(ADJP (ADJP (ADJ lonely))
      (CONJP (CONJ_ADJP and)			← CONJ_ADJP even though first conjunct is a single word
             (ADJP (ADJ far)
	   	   (PP (P from)
		       (NP (N home))))))

(ADJP (ADJP (ADJ far)
	    (PP (P from)
		(NP (N home))))
            (CONJP (CONJ_ADJP and)		← CONJ_ADJP even though second conjunct is a single word
                   (ADJP (ADJ lonely))))

(INTJP (INTJP (INTJ Jiminy) (INTJ crickets))
       (CONJP (CONJ_INTJP and)
              (INTJP (INTJ holy) (INTJ shmoly))))

( (IP-MAT (QTP (QTP (NP (NEG Not) (D that)))
               (CONJP (CONJ_QTP or)
                      (QTP (ADJ hot) (D that))))	← not a normal constituent - hence QTP
          (NP-SBJ (PRO He))
  	  (VBD muttered)))

(NP (NP (D the) (ADJ big) (N boy))
    (CONJP (CONJ_NP and)
           (NP (D the) (ADJ little) (N girl))))

(PP (PP (RP out)
        (P of)
        (NP (D the) (N frying) (N pan)))
    (CONJP (CONJ_PP and)
           (PP (P into)
	       (NP (D the) (N fire)))))

(NP (NP (D a) (N Republican))
    (CONJP (CONJ_NP and)			← unlike category coordination - note CONJ_NP
           (ADJP (ADJ proud)
	   	 (PP (P of)
		     (NP (PRO it))))))

(CP-ADV (VP (VP (VB arrive)			← rare because VP is rare
	    	(ADVP-TMP (ADV late)))
            (CONJP (CONJ_VP but)
                   (VP (VB leave)
		       (ADVP-TMP (ADV early)))))
        (C though)
        (IP-SUB he may ...))

(WADVP (WADVP (WADV how) (WADV often))
       (CONJP (CONJ_WADVP and)			← unlike category coordination - note CONJ_WADVP
       	      (WPP (P under)
	      	   (WNP (WD what) (NS circumstances)))))

(WNP (WNP (WD which) (N person))
     (CONJP (CONJ_WNP or)
            (WNP (WD which) (N organization))))

(WPP (WPP (P under)
	  (WNP (WD what) (NS circumstances)))
     (CONJP (CONJ_WPP and)			← unlike category coordination - note CONJ_WPP
     	    (WADVP (WADV how) (WADV often))))

Intermediate-level conjunction

CONJ_ADJX, CONJ_ADVX, CONJ_NX indicates that the scope of conjunction is an intermediate projection level (ADJX, ADVX, NX). The CONJ label looks "down" the tree to the conjunct that follows it.

There shouldn't be any cases of unlike category conjunction with intermediate-level conjuncts.

(NP (D the) (ADJ big) (N boy)	     		← CONJ_NX, not CONJ_WORD because of the (potentially) shared premodifiers
    (CONJP (CONJ_NX and)
           (NX (ADJ little) (N girl))))

(NP (D the) (ADJ big) (N boy)
    (CONJP (CONJ_NX and)
           (NX (N girl))))

(WNP (WD which) (N person)
     (CONJP (CONJ_NX or)
            (NX (N organization))))

Clause-level conjunction

CONJ_CP, CP_IP, CONJ_RRC indicates conjunction of full clauses.
( (IP-MAT (NP-SBJ (PRO He))
          (VBD said)
	  (CP-THT (CP-THT (C that / 0)			← same whether C in first conjunct is overt or not
	  	          (IP-SUB (NP-SBJ (PRO he))	
		      	          (BED was)
			          (VAG coming)))
                  (CONJP (CONJ_CP but)
		  	 (CP-THT (C that)
			         (IP-SUB (NP-SBJ (PRO he))	
		      	                 (HVD had)
			                 (VBN eaten))))))))

In cases of ambiguity between conjoining two IPs or conjoining two CPs containing silent complementizers, the default is the two-IP structure, in accordance with the default of "minimize structure".

( (IP-MAT (NP-SBJ (PRO He))
          (VBD said)
	  (CP-THT (C that / 0)				← same whether C is overt or not
	  	  (IP-SUB (IP-SUB (NP-SBJ (PRO he))	
		                  (BED was)
			          (VAG coming)))
                          (CONJP (CONJ_IP but)		← CONJ_IP, not CONJ_CP !
			         (IP-SUB (NP-SBJ *con*)
		      	                 (HVD had)
			                 (VBN eaten))))))))

( (IP-MAT (NP-SBJ (NPR John))
          (, ,)
	  (IP-PPL (IP-PPL (VAG walking)
    	                  (PP (P along)
	                      (NP (D the) (N beach))))
                  (CONJP (CONJ_IP and)
                         (IP-PPL (VAG noticing)
		                 (NP-ACC (D a) (N glint)))))
          (, ,)
          (VBD found)
	  (NP-ACC (D the) (N treasure))))

(NP (Q all) (NS students)
    (RRC (RRC (VAG majoring)
    	      (PP (P in)
	          (NP (N linguistics))))
         (CONJP (CONJ_RRC and)
                (RRC (VAG graduating)
		     (NP-TMP (D this) (N year))))))

Conjunction in connection with gapping

CONJ_GAP indicates conjunction of gapped material.
( (IP-MAT (IP-MAT-1 (NP-SBJ (PRO I))
                    (HVP have)
	            (VBN stood)
	            (PP (P on)
	                (NP (D the) (N beach))))
	  (CONJP (CONJ_GAP and)
	  	 (IP-MAT=1 (VBN heard)
		 	   (NP-ACC (D the) (N ocean))))))

(IP-INF-1 (TO to)
          (VB pick)
	  (RP up)
	  (IP-INF-PRN=1 (CONJ_GAP and)
	  	        (VB put)
		        (RP down))
	  (NP-ACC (D the) (N book)))

We'll discuss any cases of backwards gapping as they come up (they are rare).

Various structures are systematically ambiguous between conjunction + gapping versus conjunction of (possibly extraposed) ungapped constituents. The default is to avoid gapping.

The default covers difficult cases like "Fanny, of course, belonging to it, and little Edward, for that day", where it is unclear whether the scope of "for that day" extends to Fanny.

In some cases, however, gapping is the only analysis. This is particularly common in connection with expressions like "especially", "in particular", "rather", and so on. We take these to attach high (following our default for high attachment).

Note, however, that negation in the X NOT Y, X AND NOT Y, X BUT NOT Y construction and focus particles attach low.

(IP (IP-1 to eat beans,)
    (CONJP (CONJ_GAP or)
           (IP=1 (ADVP rather,)
                 (NP peas))))
rather than:
(IP to
    eat 
    (NP (NP beans,)
        (CONJP (CONJ_NP or)
               (NP rather, peas))))

Uncertainty

If you're unsure of which CONJ subtype is appropriate, use CONJ_X or add a comment in the tagged file.

Defaults

Special cases

To be dealt with

FRAG

FS = conjunction at the beginning of a false start