The
tables below give a full description of the part-of-speech mark-up tags used in the linguistically annotated Corpus of Irish
eg:<w
tag="Ncmsc" base="doras" std="dhoras">dhoras</w>
This
consists of two parts:
- <w>dhoras</w>
is the basic word tag. A start tag <w>
and end tag </w> surround each word, in this case
'dhoras'.
- The start tag <w> contains three attributes each of whose values relate to the word in question.
- tag="Ncmsc"
is the morphosyntactical
description of the word "dhoras". It is in
the form of an attribute and value pair, where the name
of the attribute is "tag" and in this case its
value is "Ncmsc". The meaning of the PAROLE tag values
can be looked up in the tables below.
In the case of "dhoras", "Ncmsc" tells
that it is a Noun(N), common noun(c), masculine grammatical gender(m), singular number(s),
common(c) case (i.e nom/acc/dat), semantic gender is not used,
and the word is not marked for contrast (e.g. dhoras-sa). (Semantic gender would be relevant to a word like 'cailín' (girl) whose grammatical gender is masculine but whose semantic gender is feminine).
- base="doras" is the lemma or dictionary headword associated with this inflected word 'dhoras'.
- std="dhoras" is the modern standard spelling of the word, which in modern texts will be the same as the inflected word. This tag is mainly of relevance to texts with pre-standardised spelling, e.g. <w
tag="Ncmsc" base="máistir" std="mháistir"> mhaighistir </w> where the older spelling 'mhaighistir' is associated with the modern standard equivalent 'mháistir'.
REVISIONS to PAROLE COMMON MORPHOSYNTACTIC TAGSET:
Additional features for Irish are marked with a *
Additional values for Irish which were added to existing features are underlined.
The following tables describe the information associated with
the various parts of speech.
1.
NOUN |
1.
POS |
2.
Type |
3.
Gender |
4.
Number |
5.
Case |
6.
Sem-Gender |
*7.
Contrast |
*8.
Derived |
N |
c
= common
p = proper
s = substant.
v
= verbal |
f
= fem
m = mas |
s
= sing. p = pl. |
c
= com.
g = gen.
v = voc.
d = dative |
n/a |
e
= emphatic |
n = de-nominal |
Example :
<w msd=Npfsc>Feabhra</w>
2.
VERB |
1.
POS |
2.
Type |
3.
Mood |
4.
Tense |
5.
Person |
6.
Number |
7.
Gender |
*8.
Dependency |
*9.
Contrast |
V |
m
= main
|
i
= indic .
s = subj.
m = imper
c = cond.
|
p
= pres.
s = past
h = past hab
f = future
g = pres. hab |
1
= first
2 = sec.
3 = third
0 = auto
|
s
= sing
p = pl. |
n/a |
d
= dependent
r = relative
n = negative |
e
= emphatic |
Example :
<w msd=Vmip>fáiltíonn</w>
3.
ADJECTIVE |
1.
POS |
2.
Type |
3.
Degree |
4.
Gender |
5.
Number |
6.
Case |
*7. Contrast |
A |
q
= qualificator
v = verbal
|
p
= positive
c = comparative |
f
= fem.
m = masc. |
s
= sing
p = pl. |
c
= com.
g = gen.
v = voc. |
e
= emphatic |
Example :
<w msd=Aqp>glan</w>
<w msd=Aqc>mó</w>
<w msd=Av>briste</w>
4.
PRONOUN |
1.POS |
2.
Type |
3.
Person |
4.
Gender |
5.
Number |
6.
Case |
7.
Posessor |
P |
p
= personal
x = reflexive
q = interrog.
i = indefinite
r = prepositional
d = demonstrative |
1
= first
2 = second
3 = third
0 = null |
f
= fem.
m = masc. |
s
= sing.
p= pl. |
n = nom |
e
= emphatic |
Example :
<w msd=Pp3msn>sé</w>
<w msd=Pp3msne>seisean</w>
<w msd=Px>féin</w>
<w msd=Pi>ceachtar</w>
<w msd=Pr1-s>agam</w>
<w msd=Pq>cé</w>
5.
DETERMINER |
1.
POS |
2.
Type |
3.
Person |
4.
Gender |
5.
Number |
6.
Case |
7.
Posessor |
D |
d
= demonstrative
p = possessive
q = quantifier
w = interrogative |
1
= first
2 = second
3 = third |
f
= fem.
m = masc. |
s
= sing
p = pl. |
n/a |
n/a |
Example :
<w msd=Dd>seo</w>
<w msd=Dp3>a</w>
<w msd=Dw>cé</w>
6.
ARTICLE |
1.
POS |
2.
Type |
3.
Gender |
4.
Number |
5.
Case |
T |
d
= definite |
f
= fem.
m = masc. |
s
= sing
p = pl. |
g = gen. |
Example :
<w msd=Td-s>an</w>
<w msd=Td-p>na</w>
<w msd=Tdfsg>na</w>
7.
ADVERB |
1.
POS |
2.
Type |
3.
Degree |
4.
Function |
5.
Wh-ness |
R |
g
= general
d = direction
i = intensifier
q = interrogative
t = temporal
l = locative |
b
= base
c = comparative
s = superlative |
m
= mod.
s = spe.
|
n/a |
Example :
<w msd=Rg>síos</w>
<w msd=Rr>mar</w>
<w msd=Rq>conas</w>
<w msd=Rd>siar</w>
8.
ADPOSITION |
1.
POS |
2.
Type |
3.
Formation |
4.
Gender |
5.
Number |
|
S |
p
= preposition |
c = compound (multi-word)
a = with article
p = with poss. det.
r = with rel. part.
d = with degree part.
o = with obj. pron.
|
|
s
= sing
p = pl.
|
|
Example :
<w msd=Sp>le</w>
<w msd=Spa>sa</w>
9.
CONJUNCTION |
1.
POS |
2.
Type |
3.
Ctype |
4.
Coord-pos |
C |
c
= coordinate
s = subordinative |
w
= with copula
|
s
= past tense |
Example :
<w msd=Cc>agus</w>
<w msd=Cs>go</w>
10.
NUMERALS |
1.
POS |
2.
Type |
3.
Gender |
4.
Number |
5.
Case |
M |
c
= cardinal
o = ordinal
n = numeral
p = personal
r = roman
s = operator |
n/a |
n/a |
n/a |
Example :
<w msd=Mc>trí</w>
<w msd=Mo>chéad</w>
<w msd=Mp>triúr</w>
<w msd=Mr>iii</w>
<w msd=Ms>=</w>
11.
INTERJECTION |
1.
POS |
I |
Example :
<w msd=I>Ora</w>
12.
UNIQUE MEMBERSHIP CLASS |
1.
POS |
2.
Particle Type |
3.
B-Function |
U |
v = vocative
c = comparative/superl.
a = adverbial
m = numeral
d = degree
p = patronym
w = copular particle |
|
Example :
<w msd=Uv>a</w>
<w msd=Ua>go</w>
<w msd=Up>Uí</w>
13.
RESIDUALS |
1.
POS |
2
Type |
3 Sub-category |
X |
a = acronym
d = date
e = e-mail address
l = list item
n = number
t = toponym
w = website address
x = unknown
y = symbol (changed from s 2011) |
|
|
f = foreign |
e = English
ev = E. verb
en = E. noun
enp = E. noun proper
enpd = E. noun prop definite
ea = E. adjective
er = E. adverb
ed = E. determiner
ec = E. conjunction
en = E. number
|
|
s = spoken |
c = spoken - communicator
ce = spoken - communicator - English
e = spoken - event (cough, laugh etc.)
f = spoken - filler(em, mmm, ahem etc,)
p = spoken - phonetic fragment (e.g. m-, dr- etc.)
x = spoken - indecipherable |
Example :
<w msd=Xf>chevalier</w>
<w msd=Xfen>studio</w>
<w msd=Xfenpd>Bahamas</w>
<w msd=Xe>sunny@galaxy.com</w>
<w msd=Xsc>mar a déarfá</w>
<w msd=Xsce>you know</w>
14.
PUNCTUATION |
1.
POS |
2.
Type |
F |
e
= sentence final
i = sen. Internal
a = quote/par init.
z= quote/par fin.
b= hyphen/ underscore/ dash
q = ? |
Example :
<w msd=Fi>;</w>
<w msd=Fb>-</w>
<w msd=Fq>?</w>
15.
ABBREVIATION |
1.
POS |
Y |
Example :
<w msd=Y>lch</w>
16.
COPULA |
1.
POS |
2.
Tense/Mood |
3.
Clause Type |
4.
Mood |
5.
Neg/Aff |
6.
Person |
W |
p
= pres/fut
s = past/cond |
d = dependant
r = relative (direct)
s = relative (indirect)
|
i
= indic.
s = subjunct.
q = interrog. |
n
= negative |
3 = third pers. pronoun |
17.
VERBAL PARTICLE |
1.
POS |
2.
Type |
3.
Mood |
4.
Tense |
Q |
q
= interrog.
n = neg
|
m = imperative
q
= interrog.
r = relative
s = subjunct.
|
s
= past |
<w msd=Q-r>a</w>
<w msd=Q>ní</w>
Ar
Ais | Back
|