PAROLE* Morphosyntactic Tagset for Irish

latest version https://www.scss.tcd.ie/SLP/parole.htm (Sept 2012)

*The PAROLE tagset is a common tagset developed for 14 European Languages during the EU funded LE-PAROLE project (1996-1999)

Revised 2004-2006
Updated for Spoken Language 2011 (table 13)


The tables below give a full description of the part-of-speech mark-up tags used in the linguistically annotated Corpus of Irish


eg:
<w tag="Ncmsc" base="doras" std="dhoras">dhoras</w>

This consists of two parts:

  1. <w>dhoras</w> is the basic word tag. A start tag <w> and end tag </w> surround each word, in this case 'dhoras'.
  2. The start tag <w> contains three attributes each of whose values relate to the word in question.
    • tag="Ncmsc" is the morphosyntactical description of the word "dhoras". It is in the form of an attribute and value pair, where the name of the attribute is "tag" and in this case its value is "Ncmsc". The meaning of the PAROLE tag values can be looked up in the tables below.
      In the case of "dhoras", "Ncmsc" tells that it is a Noun(N), common noun(c), masculine grammatical gender(m), singular number(s), common(c) case (i.e nom/acc/dat), semantic gender is not used, and the word is not marked for contrast (e.g. dhoras-sa). (Semantic gender would be relevant to a word like 'cailín' (girl) whose grammatical gender is masculine but whose semantic gender is feminine).
    • base="doras" is the lemma or dictionary headword associated with this inflected word 'dhoras'.
    • std="dhoras" is the modern standard spelling of the word, which in modern texts will be the same as the inflected word. This tag is mainly of relevance to texts with pre-standardised spelling, e.g. <w tag="Ncmsc" base="máistir" std="mháistir"> mhaighistir </w> where the older spelling 'mhaighistir' is associated with the modern standard equivalent 'mháistir'.

REVISIONS to PAROLE COMMON MORPHOSYNTACTIC TAGSET:
Additional features for Irish are marked with a *
Additional values for Irish which were added to existing features are underlined.

The following tables describe the information associated with the various parts of speech.

1. Noun 9. Conjunction
2. Verb 10. Numerals
3. Adjective 11. Interjection
4. Pronoun 12. Unique Memebrship Class
5. Determiner 13. Residuals (incl. Spoken)
6. Article 14. Punctuation
7. Adverb 15. Abbreviation
8. Adposition 16. Copula
  17. Verbal Particle


1. NOUN
1. POS 2. Type 3. Gender 4. Number 5. Case 6. Sem-Gender *7. Contrast *8. Derived
N

c = common
p = proper
s = substant.
v = verbal

f = fem
m = mas
s = sing.
p = pl.
c = com.
g = gen.
v = voc.
d = dative
n/a e = emphatic n = de-nominal

Example :
<w msd=Npfsc>Feabhra</w>


2. VERB
1. POS 2. Type 3. Mood 4. Tense 5. Person 6. Number 7. Gender *8. Dependency *9. Contrast
V m = main
i = indic .
s = subj.
m = imper
c = cond.
p = pres.
s = past
h = past hab
f = future
g = pres. hab
1 = first
2 = sec.
3 = third
0 = auto
s = sing
p = pl.
n/a

d = dependent
r = relative
n = negative

e = emphatic

Example :
<w msd=Vmip>fáiltíonn</w>


3. ADJECTIVE
1. POS 2. Type 3. Degree 4. Gender 5. Number 6. Case *7. Contrast
A

q = qualificator
v = verbal

p = positive
c = comparative
f = fem.
m = masc.
s = sing
p = pl.
c = com.
g = gen.
v = voc.
e = emphatic


Example :
<w msd=Aqp>glan</w>
<w msd=Aqc>mó</w>
<w msd=Av>briste</w>


4. PRONOUN
1.POS 2. Type 3. Person 4. Gender 5. Number 6. Case 7. Posessor
P p = personal
x = reflexive
q = interrog.
i = indefinite
r = prepositional
d = demonstrative
1 = first
2 = second
3 = third
0 = null
f = fem.
m = masc.
s = sing.
p= pl.
n = nom e = emphatic


Example :
<w msd=Pp3msn>sé</w>
<w msd=Pp3msne>seisean</w>
<w msd=Px>féin</w>
<w msd=Pi>ceachtar</w>
<w msd=Pr1-s>agam</w>

<w msd=Pq>cé</w>

5. DETERMINER
1. POS 2. Type 3. Person 4. Gender 5. Number 6. Case 7. Posessor
D

d = demonstrative
p = possessive
q = quantifier
w = interrogative

1 = first
2 = second
3 = third
f = fem.
m = masc.
s = sing
p = pl.
n/a n/a

Example :
<w msd=Dd>seo</w>
<w msd=Dp3>a</w>
<w msd=Dw>cé</w>


6. ARTICLE
1. POS 2. Type 3. Gender 4. Number 5. Case
T d = definite f = fem.
m = masc.
s = sing
p = pl.
g = gen.


Example :
<w msd=Td-s>an</w>
<w msd=Td-p>na</w>
<w msd=Tdfsg>na</w>

7. ADVERB
1. POS 2. Type 3. Degree 4. Function 5. Wh-ness
R g = general
d = direction
i = intensifier
q = interrogative
t = temporal
l = locative
b = base
c = comparative
s = superlative
m = mod.
s = spe.
n/a

Example :
<w msd=Rg>síos</w>
<w msd=Rr>mar</w>
<w msd=Rq>conas</w>
<w msd=Rd>siar</w>

8. ADPOSITION
1. POS 2. Type 3. Formation 4. Gender 5. Number  
S p = preposition c = compound (multi-word)
a = with article
p = with poss. det.
r = with rel. part.
d = with degree part.
o = with obj. pron.
  s = sing
p = pl.
 

Example :
<w msd=Sp>le</w>
<w msd=Spa>sa</w>


9. CONJUNCTION
1. POS 2. Type 3. Ctype 4. Coord-pos
C c = coordinate
s = subordinative
w = with copula
s = past tense

Example :
<w msd=Cc>agus</w>
<w msd=Cs>go</w>


10. NUMERALS
1. POS 2. Type 3. Gender 4. Number 5. Case
M c = cardinal
o = ordinal
n = numeral
p = personal
r = roman
s = operator
n/a n/a n/a

Example :
<w msd=Mc>trí</w>
<w msd=Mo>chéad</w>
<w msd=Mp>triúr</w>
<w msd=Mr>iii</w>
<w msd=Ms>=</w>

   

11. INTERJECTION  
1. POS 
I  
       

Example :
<w msd=I>Ora</w>

12. UNIQUE MEMBERSHIP CLASS
1. POS 2. Particle Type 3. B-Function
U v = vocative
c = comparative/superl.
a = adverbial
m = numeral
d = degree
p = patronym
w = copular particle

Example :
<w msd=Uv>a</w>
<w msd=Ua>go</w>
<w msd=Up>Uí</w>



13. RESIDUALS
1. POS 2 Type 3 Sub-category
X

a = acronym
d = date
e = e-mail address
l = list item
n = number
t = toponym
w = website address
x = unknown
y = symbol (changed from s 2011)

 
  f = foreign

e = English
ev = E. verb
en = E. noun
enp = E. noun proper
enpd = E. noun prop definite
ea = E. adjective
er = E. adverb
ed = E. determiner
ec = E. conjunction
en = E. number

 

s = spoken

c = spoken - communicator
ce = spoken - communicator - English
e = spoken - event (cough, laugh etc.)
f = spoken - filler(em, mmm, ahem etc,)
p = spoken - phonetic fragment (e.g. m-, dr- etc.)
x = spoken - indecipherable

Example :
<w msd=Xf>chevalier</w>
<w msd=Xfen>studio</w>
<w msd=Xfenpd>Bahamas</w>
<w msd=Xe>sunny@galaxy.com</w>
<w msd=Xsc>mar a déarfá</w>
<w msd=Xsce>you know</w>

14. PUNCTUATION
1. POS 2. Type
F e = sentence final
i = sen. Internal
a = quote/par init.
z= quote/par fin.
b= hyphen/ underscore/ dash
q = ?

Example :
<w msd=Fi>;</w>
<w msd=Fb>-</w>

<w msd=Fq>?</w>

       
      

15. ABBREVIATION 
1. POS  
Y 


Example :
<w msd=Y>lch</w>

16. COPULA
1. POS 2. Tense/Mood 3. Clause Type 4. Mood 5. Neg/Aff 6. Person
W p = pres/fut
s = past/cond
d = dependant
r = relative (direct)
s = relative (indirect)

i = indic.
s = subjunct.
q = interrog.

n = negative

3 = third pers. pronoun

17. VERBAL PARTICLE
1. POS 2. Type 3. Mood 4. Tense

Q

q = interrog.
n = neg

m = imperative
q = interrog.
r = relative
s = subjunct.

s = past

<w msd=Q-r>a</w>
<w msd=Q>ní</w>

Ar Ais | Back