Upload
vominh
View
212
Download
0
Embed Size (px)
Citation preview
Table of Contents
Acknowledgements………………………………………………………………….….2
Introduction………………………………………………………………………….....3
1 Hungarian Vowel Harmony……………………………………………………........5
1. 1 The Hungarian Vowel Inventory………………………………………………........5
1. 2 Vowel Harmony……………………………………………………………….........6
1. 2. 1 The Role of Vowels in Harmony……………………………………………........7
1. 2. 2 The Role of Vowel Harmony in the Selection of Suffixes………………….........8
1. 2. 3 Stems and Harmony................................................................................................9
1.2.4 The Behaviour of Neutral (Transparent) Vowels...................................................14
1.2.4.1 The Vowel e.........................................................................................................15
2 Research and Data Analysis......................................................................................17
2. 1 Vacillating Stems – Data from Online Corpora.......................................................17
2. 2 Relative Frequency of Suffixes – Data from the HNC.............................................22
2. 3 Vacillating Stems and the Role of Paradigmatic Uniformity...................................23
2. 4 A Brief Comparison of Various Data.......................................................................27
3 Optimality Theory and Vacillating Stems………………………………………....29
3.1 The Basic Principles of Optimality Theory……………………………………..….29
3.2 An Optimality Theoretic Analysis of Vacillating Stems………………………..….32
3.2.1 Partially Ranked Constraints……………………………………………………..32
3.2.2 Unranked Constraints………………………………………………………….....45
Conclusion……………………………………………………………………………..54
Bibliography...................................................................................................................56
2
Acknowledgements First of all, I would like to say thanks to my mentor and supervisor Dr. Szilárd
Szentgyörgyi, who aroused my interest in linguistics, especially in English and
Hungarian phonology. He encouraged me to partake in the Students’ Research Circle
Movement and to write my thesis on this topic. He also supported me in continuing my
study and work in this field after graduation, too.
I am greatly indebted to Zara Wanlass from the University of Iowa for providing
me with the raw data of her analysis and thus helping me with my research.
I am also very grateful to some of my fellow students and co-founders of the
Linguistics Research Circle (also known as NyeKK), Anna Csernák, Gergely Harrach
and Krisztina Szöllősi and my phonetics and phonology seminar leader from the
Institute of German Studies, Balázs Huszka, who also encouraged me to do individual
research and supported me throughout this process.
A special thanks goes to Anna Csernák, who acted almost like a second
supervisor to me when I began to write my conference paper, sacrificing a lot of her
time and energy to help me with my research. She never hesitated to provide me with
books, articles or even some good ideas whenever I needed it and she never let me give
it up.
Last but not least, I would like to thank my fellow students and professors from
the Institute of English and American Studies for making the years spent at this
university a pleasant and rewarding experience.
3
Introduction
Vowel harmony is a much-studied area of the Hungarian language. This paper studies a
phenomenon that proves to be difficult not only for foreign learners but can also be
problematic for native speakers. There are certain words that contain a back harmonic
vowel followed by one or more neutral (transparent) vowels. These are referred to as
vacillating stems because they can take back as well as front suffixes (e.g. fotelban /
fotelben, ’armchair-iness’, hidrogénnak / nek ’hydrogen-iness’). Both forms are equally
grammatical and used by native speakers. Some speakers use only the back or the front
variant of the same suffixed stem, whereas others tend to vacillate between the two
variants or use the back variant with some suffixes and the front one with other suffixes.
Vacillating stems have been subject to a great deal of research recently. For
example, with the help of the ’wug’-test (Hayes & Londe 2005) Hungarian speakers
were asked to attach suffixes to words that do not exist in the Hungarian language but
do not violate its phonotactic rules. Students in the Padányi Secondary School in
Veszprém were also given a questionnaire with sample sentences containing vacillating
stems in both forms (i.e. each word with a front and a back suffix). Students were asked
to choose the form that they consider more natural (Csernák 2006). Research was
carried out in online corpora, namely in the Hungarian National Corpus and the Magyar
Webkorpusz on the Szószablya project website (Wanlass 2007).
It is a widespread assumption that vacillating stems, as opposed to harmonic and
neutral stems, do not show uniformity within one paradigm. The purpose of our data
collection and statistical analysis was to elucidate whether these stems in fact behave
inconsistently in the paradigm or show some kind of uniformity. Therefore the focus of
our data analysis was the paradigm of each collected and tested vacillating stem.
According to McCarthy (2005), inflectional paradigms (like the paradigms of
vacillating stems – the suffixed forms) greatly differ from derivational paradigms. In an
inflectional paradigm each member has an inpact on the other members and each
member can become the base of an assimilation process, ensuring paradigmatic
uniformity. Therefore, there is no need for one particular base for paradigmatic
uniformity (as opposed to derivational paradigms (Benua 1997). The other purpose of
our analysis was to find out whether there is no need for one base (i.e. a form to which
other members of the paradigm tend to conform), if it is possible to talk about
paradigmatic uniformity in the case of vacillating stems.
4
The study in the paper is based on a survey carried out in the Padányi Secondary School
in the autumn of 2005 (Csernák, 2006) and data collected from online corpora. Suffixed
forms of vacillating stems were collected from the Hungarian National Corpus and the
Magyar Webkorpusz on the Szószablya project website (Wanlass, 2007).1 The relative
frequency of suffixes was tested on the Hungarian National Corpus.
The phenomenon will be dealt with in the framework of Optimality Theory
(Prince & Smolensky 1993). According to this theory, the surface representation of a
word is the result of language-specific ranking of universal constraints. This paper
attempts to model the distribution of front and back suffixes with the help of partially
ranked constraints in the first part of the optimality theoretic analysis and with unranked
constraints in the second part.
The structure of the paper is the following. In section 1, we provide an overview
of Hungarian vowel harmony, including vacillating stems and their vowels and the
peculiar behaviour of the vowel e. In section 2, we discuss the surveys and analyse their
data. In section 3, we give a short description of the main principles of Optimality
Theory, then we attempt to give an optimality theoretic account for the behaviour of
vacillating and anti-harmonic stems based on the data collected from various sources.
1 I am greatly indebted to Zara Wanlass (University of Iowa) for providing me with the raw data of
her research. All errors are mine.
5
1 Hungarian Vowel Harmony
1. 1 The Hungarian Vowel Inventory
The Hungarian vowel inventory based on the description of (Ringen &Vago 1998: 394)
is the following:
Front Back
[- rounded] [+ rounded] [- rounded] [+ rounded]
(1)
short long short long long short long
High (close) i í ü ü u ú
Mid é ö ő
o ó
Low (open) e á a
The places of articulation of the fourteen vowels are shown in the above table. The
features front and back refer to the horizontal movement of the tongue. We differentiate
high, mid and low vowels according to the vertical position of the tongue. The
distinctive features [+ rounded] and [- rounded] refer to the involvement of lips in the
articulation. The fourteen vowels can be divided into seven long-short pairs. The [+
high] vowel pairs only differ in length as opposed to [- high] vowels, which differ both
in length and quality.
The contrast between the duration (length) of the mid vowel pairs o-ó and ö-ő is
phonologically relevant: they do not only differ in length but also in closeness. The
most striking contrast is to be found in the case of the a-á and e-é vowel pairs.
However, several words that contain these vowels show long-short alternation in the
singular-plural forms (nyár ’summer Sg’- nyarak ’summer Pl”, kéz ’hand-Sg’ – kezek
’hand-Pl’), just like ordinary vowel pairs that do not really differ in quality (víz ’water-
Sg’ – vizek ’water Pl”, tő ’stem-Sg’ – tövek ’stem-Pl”). This proves that they are long-
short pairs regardless of the difference in quality. (Nádasdy & Siptár 1994: 63-64)
É is a central vowel as far as the vertical position of the tongue is concerned.
However, from a phonological point of view, it does not belong to that group, being the
long counterpart of e. Moreover, it behaves as the front pair of á in suffixes (-nál / -nél,
-vá / -vé), therefore it might as well be classified as an underlyingly low vowel. Length
also plays an important role in the sonority sequence: the long member of the long-short
6
vowel pairs is more sonorant (except for the e-é pair – then the short one). Lower
vowels are more sonorous than higher ones; rounded vowels are more sonorant than
their unrounded counterparts (Ringen 1999). However, others claim the opposite
(Kassai 1998: 126).2
1. 2 Vowel Harmony
In several languages certain segments within one particular domain (e. g. a word or a
syllable) must have certain distinctive features that are identical. This phenomenon is
called harmony in phonology. Harmony can involve consonants or vowels. (For
example, there are languages in which only obstruents are involved.)
The most widely accepted means of describing harmony is spreading. It means
that a feature in question is spreading from a segment to the other adjacent segments
that are able to have this feature. Spreading might be obstructed by the so-called
blocking segments. If a consonant or a vowel (in our case: a vowel) behaves as a
blocking segment, the adjacent vowels to the right will be placed into another domain,
i.e. they will not be involved in the harmony. As opposed to blocking segments, neutral
(transparent) segments will not take part in the process.
As mentioned earlier, in Hungarian only vowels take place in this process. There
are two basic types of harmony: fronting (palatal) harmony and rounding (labial)
harmony. The domain of fronting harmony is the word. It involves the stem and the
suffixes, but complex words do not belong here as the appearance of a new root (e.g.
láb][fej) signifies the beginning of a new domain. Words with prefixes do not count,
either, because they also belong to more than one harmonic domain.
Fronting harmony is to be observed in the majority of Hungarian words –
usually, we find only front (születéseteknek ’your birth-dat’) or back
(boldogtalanságotoknak ’your sadness-dat’) vowels within one word. However, there
are words that contain both front and back vowels (hotel ‘hotel’, allűr ‘allure’, maci
‘teddy bear’, fiú ‘boy’) and certain suffixes are non-harmonic by nature (e.g. fazoníroz
’trim’, bodorít ’curl’, kalapért ’to / fot the hat’, mindenkor ’everytime’).3
2 On Hungarian vowels see (Bolla: 1982), (Bolla: 1995), (Gósy: 2004), (Kassai: 1998), (Kenesei: 2000), (Szende: 1976) 3 For more details on Hungarian vowel harmony see (Benus (to appear), (Dienes: 1997), (Oostendorp: 2004), (Rebrus 2000), (Reiss: 2003), (Ringen & Szentgyörgyi: 2004)
7
1. 2. 1 The Role of Vowels in Harmony
As far as harmony is concerned, Hungarian vowels are divided into two groups: neutral
(transparent) vowels and harmonic vowels. Front unrounded vowels like e, é, i, í are
neutral – most ’mixed’ words contain at least one of these vowels. They do not play a
role in vowel harmony. For instance, the noun maci ’teddy bear’ will take a back suffix
(macinak ’teddy bear-dat’) because the only harmonic vowel in the stem (that plays a
role in the process) is a back vowel. All [+ back] vowels are harmonic, whereas only
rounded [- back] vowels (ö,ő,ü,ű) are harmonic. Unrounded [- back] vowels are neutral.
The vowel e has a special role: sometimes it behaves as a harmonic vowel, sometimes
as a neutral vowel. (Nádasdy & Siptár 1994: 96)
The following table shows the role of vowels in harmony (the % indicates that the
vowel in question sometimes behaves as a harmonic vowel, sometimes as a neutral
vowel):
(3)
HARMONIC ROLE HORIZONTAL MOVEMENT OF THE
TONGUE HARMONIC
VOWELS
NON-HARMONIC
VOWELS
FRONT VOWELS ö, ő, ü, ű, (e %)
BACK VOWELS a, á, o, ó, u, ú
i, í, é, (e %)
(Nádasdy & Siptár 1994: 96)
8
1. 2. 2 The Role of Vowel Harmony in the Selection of Suffixes
Hungarian vowel harmony is governed by the stem vowel. It has a definite direction and
the harmonic value of the suffix depends on the harmonic value of the stem vowel. It is
only possible if suffixes are able to alternate in accordance with the stem. (Nádasdy and
Siptár 1994: 96)
Examples of alternating suffixes are the following (Siptár & Törkenczy 2000: 65):
(4)
ú ű (láb-ú ’-legged’, fej-ű ’headed’)
u ü (ház-unk ‚our house’, kert-ünk ’our garden’)
ó ő (vár-ó ’waiting’, kér-ő ’asking / suitor’)
o e ö (ház-hoz ’house-all’, kert-hez ’garden-all’,vödör-höz ’bucket-all’)
á é (vár-ná ’would wait for it’, kér-né ’would ask for it’)
á e (ház-ban ’house-iness’, kert-ben ’garden-iness’)
Non-alternating suffixes contain either only neutral vowels (hatig ’till six o’clock’,
katonaként ’as a soldier’, Kovácsék ’the Kovács (Smith) family’) or back harmonic
vowels that do not harmonise with the stem vowel (ötkor ’at five o’clock’, fazoníroz ’to
trim’, csekkol ’to check’) (Kertész 2003). No non-alternating suffix contains an e.
(Nádasdy & Siptár 1994: 108)
9
1. 2. 3. Stems and Harmony
Stems are divided into the following groups: If the last vowel in the word is harmonic,
the stem is called a harmonic stem. (I) If the last vowel is neutral (transparent), the stem
is neutral. (II) There are simple (A) and complex (B) stems within each group.
IA Simple harmonic stems: the last vowel of the stem is harmonic. They might contain
further harmonic vowels that harmonise with the last vowel as far as frontness is
concerned. They might contain neutral vowels, but not in the last syllable. Suffix
harmony is governed by the frontness of harmonic vowels:
(5) IA-f: TŰZ-type
[- back]
tűz 'fire' tűznek 'fire-dat' tűztől 'fire-abl'
tükör 'mirror' tükörnek 'mirror-dat' tükörtől 'mirror-abl'
öröm 'joy' örömnek 'joy-dat' örömtől 'joy-abl'
szemölcs 'verruca' szemölcsnek 'verruca-dat' szemölcstől 'verruca-abl'
rézsű 'slope' rézsűnek 'slope-dat' rézsűtől 'slope-abl'
(6) IA-b: HÁZ-type
[+ back]
ház 'house' háznak 'house-dat' háztól 'house-abl'
kupa 'goblet' kupának 'goblet-dat' kupától 'goblet-abl'
koszorú 'wreath' koszorúnak 'wreath-dat' koszorútól 'wreath-abl'
bika 'bull' bikának 'bull-dat' bikától 'bull-abl'
csíra 'germ' csírának 'germ-dat' csírától 'germ-abl'
példa 'example' példának 'example-dat' példától 'germ-abl'
hernyó 'caterpillar' hernyónak 'caterpillar-dat' hernyótól 'caterpillar-abl'
10
IB Complex harmonic stems: the last vowel of the stem is harmonic, but the stem
contains other harmonic vowels that do not harmonise with the last vowel as far as
frontness is concerned. The stem might contain further neutral vowels (but not in the
last syllable). These stems are also referred to as ‘disharmonic’. Suffix harmony is
governed by the frontness of the last harmonic vowel:
(7) IB-f: SOFŐR-type
[- back]
sofőr 'car driver' sofőrnek 'car driver-dat' sofőrtől 'car-driver-abl'
allűr 'allure' allűrnek 'allure-dat' allűrtől 'allure-abl'
kosztüm 'costume' kosztümnek 'costume-dat' kosztümtől 'costume-abl'
(8) IB-b: NÜANSZ-type
[- back]
nüansz 'nuance' nüansznak 'nuance-dat' nüansztól 'nuance-abl'
amőba 'amoeba' amőbának 'amoeba-dat' amőbától 'amoeba-abl'
pözsó 'Peugeot' pözsónak 'Peugeot-dat' pözsótól 'Peugeot-abl'
IIA Simple neutral stems: the stem contains neutral vowels only. Since neutral vowels
are front, these stems will take front suffixes.
(9) IIA-f: VÍZ-type
[- back]
víz 'water' víznek 'water-dat' víztől 'water-abl'
szegény 'poor' szegénynek 'poor-dat' szegénytől 'poor-abl'
kert 'garden' kertnek 'garden-dat' kerttől 'garden-abl'
11
There are approximately 60 simple neutral stems that always take back suffixes. These
are referred to as antiharmonic. The vast majority of the words in question contain i, í,
some of them contain é (cél ’goal’, héj ’peel’, derék ’waist’). Some of these stems end
in –a but the a is often lost as a consequence of suffixation (tiszt-a ’clear’, ritka-a ’rare’,
ném-a ’mute’, bén-a ’lame’). Fiú ’boy’ is a special stem that will take a back suffix
after the /u/ is deleted (fiam ’my son’). The behaviour of férfi ’man’ is also peculiar: it is
basically a vacillating stem (it can take back suffixes as well as front suffixes – férfinak
/ nek, -tól / -től), nevertheless, there are only férfias ’masculine’ and férfiak ’man-pl’;
*férfies and *férfiek are non-existent forms.
(10) IIA-b: HÍD-type
[+ back]
híd 'bridge' hídnak 'bridge-dat' hídtól 'bridge-abl'
cél 'goal' célnak 'goal-dat' céltól 'goal-abl'
derék 'waist' deréknak 'waist-dat' deréktól 'waist-abl'
IIB Complex neutral stems: the last vowel is neutral, but the stem also contains (a)
harmonic vowel(s). Suffix harmony is governed by the harmonic vowel that precedes
the neutral vowel of the last syllable. In the case of ÜVEG-type stems, both the neutral
and the last harmonic vowel are front:
(11) IIB-f: ÜVEG-type
[- back]
üveg 'glass' üvegnek 'glass-dat' üvegtől 'glass-abl'
rövid 'short' rövidnek 'short-dat' rövidtől 'short-abl'
örmény 'Armenian' örménynek 'Armenian-dat' örménytől 'Armenian-abl'
12
The transparent nature of neutral vowels manifests itself in its entirety in the case of
PAPÍR-type stems:
(12) IIB-b: PAPÍR-type
[+ back]
papír ’paper’ papírnak ’paper-dat’ papírtól ’paper-abl’
dózis 'dose' dózisnak 'dose-dat' dózistól 'dose-abl'
kordé 'barrow' kordénak 'barrow-dat' kordétól 'barrow-abl'
tányér 'plate' tányérnak 'plate-dat' tányértól 'plate-abl'
kávé 'coffee' kávénak 'coffee-dat' kávétól 'coffee-abl'
haver 'buddy' havernak 'buddy-dat' havertől 'buddy-abl'
balek 'dupe' baleknak 'dupe-dat' balektól 'dupe-abl'
maszek ‘private sector’ maszeknak ‘private sector-dat' maszektól ‘private sector-abl'
What was mentioned above can be summarised with the help of the following table
(Nádasdy & Siptár 1994: 109). (Vacillating stems are indicated by %.)
13
(13)
CONSTRUCTION SUFFIXATION Last vowel Vowels Front Back
I Simple harmonic stems (contain one or more harmonising vowels)
IA-f: TŰZ Tükör ’mirror’ , kürt ’horn’, köszörű ’grinder’, szemölcs ’verruca’, gőz ’steam’ ripők ’rogue’, rézsű ’slope’, revü ’revue’, esztendő ’year’, kesztyű ’glove’
IA-b: HÁZ kupa ’cup’, város ‚town’, koszorú ’wreath’ bika ’bull’, hernyó ’caterpillar, izom ’muscle’, tégla ’brick’, opera ’opera’, bitó ’gallows’, patika ’chemist’s’, stílus ’style’, beton ’concrete’, konkurencia ’concurrence’
I HARMONIC STEMS (the last vowel is harmonic) (a, á, o, ó, u, ú, ö, ő, ü, ű, (% e))
IB Complex harmonic stems (disharmonic stems) (contain several not harmonising vowels)
IB-f: SOFŐR attitűd ’attitude’, operatőr ‘cinematographer’, allűr ’allure ’, kosztüm ’costume’ kódex ’codex’, október ’october’, oxigén ’oxygen’ %dzsungel ’jungle’, %konkrét’concrete’, %analízis ‘analysis’
IB-b: NÜANSZ amőba ’amoeba’, ökonómia ’economy’, bürokrácia ’bureaucracy’ cölibátus ’celibacy’, ökumenikus ’oecumenical’
IIA Simple neutral stems (contain only neutral vowels)
IIA-f: VÍZ rét ’field’, szegény ’poor’, rekettye ’genista’, bili ’pot’, fillér <one hundredth of a Hungarian forint> kemping ’camping’, % férfi ’man’
IIA-b: HÍD (antiharmonic stems) cél ’goal’, derék ’waist’ % férfi ’man’
II NEUTRAL STEMS (the last vowel is neutral) (i, í, é, (% e))
IIB Complex neutral stems (contain harmonic vowels as well)
IIB-f: ÜVEG Rövid ’short’ , tőzeg ’peat’ örmény ’Armenian’, gyülevész ’mob’, hübrisz ’hubris’
IIB-b: PAPÍR kordé ’barrow’, kuvik ’sparrow owl’, csiricsáré ’gaudy’, föderatív ’federative’
14
1. 2. 4 The Behaviour of Neutral (Transparent) Vowels
Not all neutral (transparent) vowels behave in the same way. Monosyllabic words
containing front unrounded vowels usually take front suffixes (e.g. cím + nek ’title-
dat’). If a neutral vowel is followed by a harmonic vowel in the stem, the transparency
of the neutral vowel cannot be proven, because the suffix is determined by the last
harmonic vowel. (Nádasdy & Siptár 1994: 105) There are approximately 60 stems
containing neutral vowels that take back suffixes, as mentioned above (e.g. híd + nak
’bridge-dat’). (Nádasdy & Siptár 1994: 103)
Two factors can influence the choice of suffixes in the case of at least disyllabic
stems. The first one is called the height effect, based on the height of the rightmost
vowel: the phonologically low vowel [ε] occurs with front suffixes more often than the
mid vowel [e:], which occurs with front suffixes more often than the high vowels [i] and
[i:]. The radír-type complex neutral stems will take back suffixes (radír + nak). If the
last syllable contains an é, the stem is likely to vacillate between back and front suffixes
(konkrét + nak/nek ’concrete’). Stems that contain an e as the last vowel are the most
likely to take front suffixes (dzsungel + nak / nek ’jungle’). The second generalisation is
referred to as the count effect. For example, BNN (back + neutral + neutral) stems take
front suffixes more often than BN (back + neutral) stems (Hayes & Londe: 6).
Generally, high vowels are the most transparent, thus the least harmonic. The
mid vowel é is less transparent and more harmonic than the high vowels i and í. The
low vowel e is the most harmonic and the least transparent. The hierarchy of
transparency is thus: i > í > é > e. (Kontra & Ringen 1986: 2)
Not only quality but quantity can also influence the choice of suffixes. If a back
vowel in a stem is followed by several transparent vowels, the stem will vacillate in
most of the cases:
(14)
back vowel + i + i aszpirin 'aspirin' + nak / nek
back vowel + i + é oxigén 'oxygen', klarinét 'clarinet' + nak / nek
back vowel + e+ i / í hokedli 'kitchen stool', agresszív 'aggressive' + nak / nek
15
However, there are exceptions (based on data collected from the Hungarian National
Corpus):
(15)
back vowel + i + e manifeszt 'manifest' + *nak / nek
back vowel + é + e katéter 'catheter' + *nak / nek
o Stems can vacillate when a harmonic vowel is between two neutral vowels
(hidrogén + nak / nek ’hydrogen-dat’) (Csernák 2006) However, krizantém
’chrysantemum’ only takes front suffixes (based on the Hungarian National
Corpus).
o If the harmonic stem vowel is followed by more than one e vowels, the e will
always behave as a harmonic vowel and take front suffixes (e.g. november +
*nak / nek ’november-dat’). (Siptár & Törkenczy 2000)
o Less frequent words are more likely to take back suffixes. (Hayes & Londe
2005: 15)
1. 2. 4. 1 The Vowel e
As mentioned earlier, e seems to be the odd-one-out among front unrounded vowels.
There is no stem in which a back harmonic vowel is followed by an e and will always
take a back suffix. (Kontra & Ringen 1986: 4), (Nádasdy & Siptár 1994: 108) There are
no non-alternating suffixes containing e. (Kontra & Ringen 1986: 4), (Nádasdy & Siptár
1994: 108) There are no anti-harmonically suffixed stems containing e (Kontra &
Ringen 1986: 4), (Nádasdy & Siptár 1994: 108), unlike i (lik + as ’with holes’), í (híd +
nak ’bridge-dat’) and é (cél + hoz ’aim-all’). In the case of the csekkol-type stems it is
the otherwise alternating suffixes that are incapable of alternation.4
Several ideas have been proposed about the underlying representation of the e
vowel: (i) we can assume that there are two types of e in the mental lexicon: one of
them is harmonic, the other is neutral (transparent) (Nádasdy & Siptár 1994: 96), or (ii)
the entire word (i.e. the vacillating stem) has two different underlying forms (e.g. fotel
’armchair’– containing a neutral e or a harmonic e) (Siptár & Törkenczy 2000: 169); 4 For detailed analysis of this phenomenon see (Kertész 2003)
16
(iii) e can also be regarded simply as a harmonic vowel (Ringen & Vago 1995: 311,
Rebrus & Törkenczy 2005: 279).
Although these assumptions make the analysis of these phenomena easier, they
are not totally acceptable. If we assume that there are two underlying forms of a
segment, the two underlying forms will have only one surface representation that will
sometimes behave as harmonic, sometimes as neutral. The idea that there are two
underlying representations of a word is not plausible, either, as duplication in the
lexicon should be avoided. Furthermore, if e is harmonic, how would we explain the
fact that only stems containing e (but not other harmonic vowels) vacillate? This would
be a great coincidence.5
5 On vacillating stems see (Benus (to appear)), (Deme: 1994), (Kontra & Ringen: 1996), (Siptár: 1993)
17
2 Research and Data Analysis
This paper reports data collected from various sources. The inflected forms of
vacillating and anti-harmonic stems were collected from the Hungarian National
Corpus6 (henceforth HNC) and the Magyar Webkorpusz, on the Szószablya project
website7 (Wanlass 2007). Inflected forms of nouns were tested in the HNC in order to
measure the relative frequency of suffixes. The third source of data is a survey carried
out in the autumn of 2005 in the Padányi Secondary School (Csernák 2006).
It has been proven by research that vacillating stems do not show uniformity
within one paradigm in the way harmonic and anti-harmonic stems do. The aim of this
study is to investigate whether the behaviour of these stems is totally unpredictable or
there is a certain analogy behind it. It has also been mentioned that inflectional
paradigms are different from derivational paradigms as far as paradigmatic uniformity is
concerned (McCarthy 2005). In inflectional paradigms each member has an inpact on
the other members and each member can become the base for an assimilation process,
ensuring paradigmatic uniformity. Therefore there is no need for one particular base for
paradigmatic uniformity (as opposed to derivational paradigms (Benua 1997). The other
aim of our analysis was to find out whether there is no need for one base (i.e. a form to
which other members of the paradigm tend to conform), if it is possible to talk about
paradigmatic uniformity in the case of vacillating stems.
6 http://corpus.nytud.hu/mnsz/index_hun.html The corpus was started started in 1998 at the Department of Corpus Linguistics of the Research Institute for Linguistics of the Hungarian Academy of Sciences (HAS) under the supervision of Tamás Váradi. The objective was to create a 100-million-word balanced reference corpus of present-day Hungarian. From 2002 began a new effort to extend the area of data collection to the Hungarian language use of the whole Carpathian Basin in Hungarian Language Corpus of the Carpathian Basin project. Aim was to create a 15-million-word corpus of Hungarian language beyond the borders of Hungary. The truly national Hungarian National Corpus, containing language variants form Slovakia, Subcarpathia, Transylvania and Vojvodina also, was introduced in November 2005. The first Hungarian corpus covering language variants from also beyond the border of Hungary have been completed as the result of joint work of the Hungarian Language Offices and the Department of Corpus Linguistics. 7 http://www.szoszablya.hu
18
2. 1 Vacillating Stems – Data from Online Corpora
In the original study (Wanlass 2007) the following fourteen common alternating
suffixes were measured:
(16)
Case Front Back Gloss
Instrumental -vel -val -with
Translative -vé -vá -(turn) into
Illative -be -ba -into
Inessive -ben -ban -in
Superessive -en/ön -on -on
Elative -ből -ból -out of
Dative -nek -nak -to/for
Adessive -nél -nál -at
Sublative -re -ra onto
Allative -hez/höz -hoz to
Delative -ről -ról of/about/from top of
Ablative -től -tól from
Comparative -ebb -abb more (ex: bigger)
Plural -ek -ak
(Wanlass 2007)
As the Accusative -t is a very common suffix but was not included in the above table,
we completed the list with it. Although -ak is listed above as the only back plural suffix,
the default form of it is -ok if the given word is not a lowering stem.8 As every stem was
suffixed as if it was a lowering stem, we collected these data again by adding the suffix
–ok to words that are not lowering stems. The additional inflected forms with the
aforementioned suffixes (that were not included in the previous study) were collected
from the Hungarian National Corpus.
Among the words included in the study (APPENDIX I) were stems classified as
vacillating (e.g. hotel ’hotel’, agresszív ’aggressive’, konkrét ’concrete’ etc), neutral
(e.g. papír ’paper’, dózis ’dose’, kávé ’coffee’) and disharmonic (oxigén ’oxygen’,
varieté ’cabaret’, november ’november’) by Siptár & Törkenczy (2000). We added
8 On lowering stems see (Szentgyörgyi: 1998)
19
several other words to the list that can also be interesting, for example férfi ’man’ and
izé <a filler>. Both contain only front unrounded (i.e. neutral vowels), however, both
words exhibit vacillation. Férfi is a stem containing only neutral vowels, which,
interestingly enough, takes back suffixes in the vast majority of the cases. There is one
suffixed form of this stem that never takes front suffixes (férfies is regarded as
ungrammatical, only férfias ’manly’ is used by native speakers of Hungarian.
Although izé <a filler> is typically used in speech, it might be found in
colloquially written texts, therefore it is worth searching in online corpora as well. It is a
neutral stem and would normally take front suffixes. However, it is often heard with a
back suffix in colloquial speech. (For example: Mondtam az izénak…’I told it to
the…what he’s called…’) Data from online corpora indicate that this variation only
occurs in speech, but not in writing.
Several other words were added, too. We tried to choose words of different
forms that could be vacillating:
o Stems containing a back vowel followed by a front neutral vowel (e.g.
fotel ’armchair’, farmer ’jeans’ or ’farmer’
o Stems containing a back vowel followed by two front neutral vowels
(e.g. porfirin ’porphyrin’, ametiszt ’ametyst’, katéter ’catheter’)
o Stems containing two back harmonic vowels followed by a front neutral
vowel (e.g. produktív ’productive’, pantomim ’pantomime’)
o Stems that contain a front neutral vowel followed by a back vowel and a
front neutral vowel (e.g. krizantém ’chrysanthemum’)
o Stems that contain two or three back vowels followed by one or two front
vowels (e.g. harakiri ’harakiri’, szalamander ’salamander’, paralízis
’paralysis’)
o Least common types of stems that might vacillate (e.g. hipochonder
’hypochondriac’ (two back vowels between two neutral vowels),
vegetatív ’vegetative’ (two neutral vowels followed by a back and a
neutral vowel), kvalitatív ’qualitative’ (a back vowel is followed by a
neutral, a back and a neutral vowel)
The additional more than 200 stems were collected from the Hungarian Dictionary and
will be tested in the Hungarian National Corpus and other Internet sources / corpora.
The analysis of these additional data is still in the process.
20
As a result of the aforementioned changes and additions, the data reported in this paper
do not correspond precisely to those in the original study (Wanlass, 2007).
It is also interesting to note that certain words that were identified as vacillating
stems exhibit an amount of vacillation (e.g. analízis ’analysis’, klarinét ’clarinet’) that is
smaller than haver ’pal’ that is considered to be neutral (APPENDIX I).9
The data collected show that it is hard to make generalisations about the
behaviour of different types of vacillating stems based on their form, that is, not all
stems of the same form display the same extent of vacillation. However, there are
general tendencies to be observed. In the following, we attempt to describe these
tendencies about the behaviour of different types of vacillating stems:
BN (back-neutral) stems: Stems containing a back vowel followed by a neutral
vowel (e.g.: fotel ’armchair’, hotel ’hotel’, haver ’pal’, farmer ’jeans’ or ’farmer’):
These stems take mostly front suffixes, especially those that contain e. Haver ’pal’
seems to be an exception because it takes back suffixes in the majority of the cases. The
words of this type that take mostly back suffixes tend to be the ones that have i, í or é as
their last stem vowel. That might be due to that fact that e is the most harmonic and i,í
are the least harmonic among neutral vowels. This is reflected by the tendencies to take
front or back suffixes: Stems containing e are the most likely to take front suffixes and
stems having an é in the stem are more likely to take front suffixes than stems
containing an í as the last stem vowel but are more likely to take back suffixes than
stems with a vowel e (based on data collected from the Hungarian National Corpus).
BNN (back-neutral-neutral) stems: Stems that contain a back vowel followed by
two neutral vowels (e.g. porfirin ’porphyrin’, ametiszt ’ametyst’, katéter ’catheter’):
These stems are more likely to take front suffixes than back suffixes, except for matiné
’matinee’ and klarinét ’clarinet’. The reason for matiné taking mostly back suffixes and
other stems of this type taking front suffixes might be explained by the fact that certain
polysyllabic words having similar endings to common Hungarian or foreign free or
bound morphemes tend to take suffixes as if they were compounds. (For the detailed
description of this phenomenon see chapter 2.3.)
9 For the detailed analysis of the discrepancies between dividing words into groups (neutral, disharmonic, vacillating) see (Wanlass 2007)
21
BBN (back-back-neutral) stems: Stems that contain two back vowels followed by a
neutral vowel (e.g. produktív ’productive’, pantomim ’pantomime’):
These stems take mostly back suffixes. Words of this type can also take back suffixes
that seem to be treated as compounds. (See chapter 2.3.)
NBN (neutral-back-neutral) stems: Stems that contain a neutral vowel followed by
a back vowel and a neutral vowel (e.g. krizantém ’chrysanthemum’, destruktív
’destructive’):
Some of these stems tend to take mostly front suffixes, whereas others take back
suffixes in the majority of cases. Stems, in which the back vowel is between i and e/é
(e.g. bitumen ’bitumen’, hidrogén ’hydrogen’) take mostly front suffixes, except for the
word krizantém ’chrysanthemum’. Stems like leander ’oleander’, that contain a back
vowel sandwitched between two e-s, take almost exclusively front suffixes. Words, in
which the back vowel is between an e and an i/í (e.g. destruktív ’destructive’, deduktív
’deductive’), take mostly back suffixes. Words like illusztris ’illustrious’, in which the
back vowel is between two i-s, also take back suffixes in the vast majority of the cases.
These tendencies also show that e might be more harmonic than é and i/í and é is more
harmonic than i and í.
BBNN (back-back-neutral-neutral) and BBBN (back-back-back-neutral) stems:
Stems that contain two or three back vowels followed by one or two front vowels
(e.g. harakiri ’harakiri’, szalamander ’salamander’, paralízis ’paralysis’):
These stems take front suffixes in the vast majority of the cases. However, there is no
sufficient information about these words due to the sparse amount of words of these
type (that are likely to vacillate).
The same thing has to be mentioned in connection with the least common types
of stems that might vacillate such as hipochonder ’hypochondriac’ (two back vowels
between two neutral vowels), vegetatív ’vegetative’ (two neutral vowels followed by a
back and a neutral vowel), kvalitatív ’qualitative’ (a back vowel is followed by a
neutral, a back and a neutral vowel). Hipochonder takes mostly front suffixes, whereas
vegetatív and kvalitatív take back suffixes.
22
2. 2 Relative Frequency of Suffixes – Data from the Hungarian National Corpus
We selected 199 inflected nouns to measure the relative frequency of suffixes. The
words were selected in the following way: we tried to include all the possible types of
nouns (e.g. harmonic, neutral, disharmonic, lowering stems). Vacillating stems were not
included as it would have made the collection of data complicated as vacillating stems
can take two or three forms of the same suffix, whereas other stems only take one form.
The reasons for selecting nouns10 are the following: Nouns can take all the above
suffixes. Adjectives can only take the above suffixes if they behave as nouns. Moreover,
there are certain suffixes that can be misleading if they are attached to nouns. For
example, the superessive -on/-en/-ön alternating suffix has a different meaning if it is
taken by an adjective – in this case, the adjective becomes a modal adverb (e.g.
kedvesen ’nicely’). For similar reasons, the comparative -abb/-ebb was not included
because nouns do not normally take this suffix. It requires further resarch to get a larger
picture of the relative frequency of suffixes, including suffixes frequently attached to
adjectives. The suffixes in question in the order of relative frequency are the following
(based on data collected from the Hungarian National Corpus):
(17)
Suffix Frequency
Ak 316066
At 193892
bAn 125225
On 83236
rA 65282
nAk 60580
vAl 52192
bA 38926
bÓl 32635
hOz 29843
tÓl 20914
rÓl 18472
nÁl 10218
vÁ 5780
10 On the paradigms of nouns see (A. Jászó: 1994), (Keszler: 2000), (Kiefer: 2000)
23
The table shows that the most common suffix attached to nouns is the plural form,
whereas the translative suffix seems to be the least frequent one.
2. 3 Vacillating Stems and the Role of Paradigmatic Uniformity
It has often been mentioned that inflected stems within one paradigm seem to behave in
a uniform way, and they usually take the features of the most frequent suffix (Csernák
2006). For example, if the most frequent suffix has a + back feature, other suffixes in
the same paradigm will also be + back. This is clearly seen in the case of non-
vacillating stems:
(18) Front Frequency Back Frequency Total
fillérek 3080 fillérok 0 3080
fillérrel 1700 fillérral 0 1700
fillérre 401 fillérra 0 401
fillérbe 186 fillérba 0 186
fillérnek 58 fillérnak 0 58
fillérben 48 fillérban 0 48
filléren 40 filléron 0 40
fillérnél 19 fillérnál 0 19
fillérhez 11 fillérhoz 0 11
fillérből 0 fillérból 0 0
fillértől 0 fillértól 0 0
fillérről 0 fillérról 0 0
fillérré 0 fillérrá 0 0
TOTAL 5543 TOTAL 0
In this case, the relatively most frequent suffix happens to be the most common within
the paradigm as well. As the most frequent suffix is - back and the stem does not
exhibit vacillation, all the other stems will be - back. However, as we might expect,
the order of frequency is not exactly the same within the paradigm as in the order of
relative frequency (see table (17)): all paradigms have their own order of frequency. For
instance, in table (18) the second most frequent suffix (accusative) is not even included
since it does not contain a suffix vowel. At the same time, the least frequent suffix in
24
that paradigm is the translative vá/-vé, which happens to be the least common suffix in
the order of relative frequency as well.
What happens in the case of vacillating stems?
(19) FOTEL
Front Frequency Back Frequency Total
fotelben 1984 fotelban 1760 3644
fotelek 1107 fotelok 47 1154
fotelbe 531 fotelba 465 996
fotellel 204 fotellal 78 282
fotelből 44 fotelból 222 266
fotelre 56 fotelra 34 90 fotelen 43 fotelon 13 56
fotelnek 45 fotelnak 8 53
fotelről 0 fotelról 24 24
fotelhez 9 fotelhoz 12 23
fotellé 18 fotellá 0 18
fotelnél 1 fotelnál 1 2
foteltől 0 foteltól 2 2 TOTAL 4042 TOTAL 2666
The most frequent suffix in this paradigm differs from the highest ranked one in the
order of relative frequency (see tables (17) and (19)). The lowest ranked suffix in the
paradigm is not the same as the one in the relative order of frequency, either (see table
(19)). There are several other examples (APPENDIX I) that illustrate that the relative
order of frequency does not necessarily coincide with the order of frequency within a
particular paradigm.
The other thing to be observed is that this stem (fotel ’armchair’) shows an
extent of vacillation, however, most of the suffixes take the backness feature of the most
frequent suffix, i.e. as it is front, most of the suffixes will be front. (Certainly, for it is a
vacillating stem, there are some other suffixes that are used more often with their back
variants.)
Although the analysed words differ in the extent of vacillation, all behave in a
similar way: if the most common suffix in the paradigm (not in the relative order of
frequency) is +/- back, the majority of other suffixes will take the same feature
25
(APPENDIX I). There is only one exception to this generalisation – the word agresszív
’aggressive’11:
(20) AGRESSZÍV
Front Frequency Back Frequency Total
agresszívebb 1382 agresszívabb 1658 3040
agresszíven 1332 agresszívan 121 1453
agresszívek 569 agresszívak 0 569
agresszívvé 238 agresszívvá 269 507 agresszívnek 224 agresszívnak 13 237
agresszívvel 15 agresszívval 3 18
agresszívre 14 agresszívra 0 14
agresszívben 3 agresszívban 2 5
agresszívbe 2 agresszívba 2 4
agresszívnél 0 agresszívnál 3 3
agresszívet 1 agresszívat 0 1
agresszívből 0 agresszívból 1 1 agresszívtől 0 agresszívtól 1 1
agresszívhez 0 agresszívhoz 0 0
agresszívről 0 agresszívról 0 0 TOTAL 3780 TOTAL 2073
The most frequent suffix of agresszív is the comparative -abb/-ebb. As is shown in the
table, the word will usually take the back form of the suffix. However, if we consider
the majority of suffixes, the word will take front suffixes in most cases. Therefore, the
backness of the majority of suffixes is not identical with the backness of the most
frequent suffix.
25 stems out of the measured 282 words (including vacillating and anti-
harmonic stems) could possibly be treated as compounds by native speakers of
Hungarian (in our case: users of the internet). After having collected the relevant data,
we arrived at the following conclusion: 15 out of the measured 25 words have proven to
be treated as compounds by the majority of speakers / writers and, as a result of this,
take mostly front suffixes. These stems included words ending in –gén, -fil and –itisz
and others having an ending identical with common Hungarian words (Olivér, amulett,
ametiszt, karamell, Noémi, november, varieté). Words with an ending that is only
11 It is the only exception to the rule among the collected data (see APPENDIX I), which does not necessarily mean that there are no other exceptions.
26
similar to but not fully identical with a common Hungarian word are generally not
treated as compounds (mokasszin, aldehid) and take back suffixes. However, there are
some words having endings identical with Hungarian words, yet, are not treated as
compounds in the majority of cases (bajadér, koiné, malachit, matiné, parádé, szalámi).
Bajadér, malachit and szalámi contain two back harmonic stem vowels, which might be
the reason for their behaviour, whereas the behaviour of koiné cannot be generalised as
the amount of data found in online corpora is not sufficient for our analysis (there are
only two suffixed forms listed). No suffixed forms of the words kolompér and durábel
are found in the online corpora in question.
The seemingly exceptional behaviour of the word agresszív can be explained by
the fact that it is sometimes treated as a compound, sometimes not.
(21) AGRESSZÍV
Front FrequencyBack Frequency Total
agresszívebb 1382 agresszívabb 1658 3040
agresszíven 1332 agresszívan 121 1453
agresszívek 569 agresszívak 0 569
agresszívvé 238 agresszívvá 269 507 agresszívnek 224 agresszívnak 13 237
agresszívvel 15 agresszívval 3 18
agresszívre 14 agresszívra 0 14
agresszívben 3 agresszívban 2 5
agresszívbe 2 agresszívba 2 4
agresszívnél 0 agresszívnál 3 3
agresszívet 1 agresszívat 0 1 agresszívből 0 agresszívból 1 1
agresszívtől 0 agresszívtól 1 1
agresszívhez 0 agresszívhoz 0 0
agresszívről 0 agresszívról 0 0 TOTAL 3780 TOTAL 2073
As is clearly seen in table (21), this stem usually takes a a front suffix whenever it can
be analysed as a compound (ending in a common Hungarian noun, szív ’heart’) but it
takes a front suffix when treated as a single word, at least, as far as the first few most
common suffixes of the given paradigm are concerned. Being a noun, the word szív has
no comparative form, therefore agresszívebb is not treated as a compound in most of the
cases and can take a back suffix vowel more often than a front suffix vowel. Its
translative form is hardly ever used, either, which excludes the possibility of
27
agresszívebb being analysed as a compound and enables it to take back suffix vowels
more often than front ones. Since the inlative, adessive, elative and the ablative forms of
the the word szív are possible, agresszívbe, agresszívnél, agresszívből and agresszívtől,
respectively, can be analysed as compounds. However, this is not unequivocally so:
agresszívba/be exhibits an equal distribution of front and back suffixes according to the
Hungarian National Corpus (two founds in both cases). Agresszívnál, i.e. the form with
a back suffix vowel, is the only adessive form listed in the HNS (three founds). The
same is to be observed in the case of agresszívból/ből and agresszívtól/től, too (one
found in each case). This tendency, however, cannot be generalised in the case of the
least common suffixes in the paradigm, considering the exiguous amount of data found
in online corpora.
2. 4 A Brief Comparison of Various Data
In the following part we shall compare the results of the research of Wanlass (2007) and
Csernák (2006) to see what can be concluded from the two different collections of data.
Both papers arrive at the conclusion that in the case of vacillating stems the
choice of back and front forms differs from suffix to suffix: some words take the front
vowel version of a certain suffix while others take the back variant. The basic difference
between the two studies is that the survey carried out in the Padányi Secondary School
(Csernák 2006) queries the students’ choices of suffix, that is, students have to decide
consciously which suffix they prefer to use. As opposed to them, users of the Internet
can choose suffixes intuitively, without thinking and being observed. A further
difference is that Csernák (2006) allowed more than one option: students were free to
choose both the front and the back variant of the same suffix. Therefore, the comparison
of the results of the two studies is slightly problematic.
It is mentioned (Csernák 2006) that the suffixed forms of vacillating stems also
tend to show some kind of uniformity within the same paradigm. It is assumed that the
vowels of suffixed forms in a given paradigm are determined by the dative suffix –nAk
in a way that its backness feature will be prevalent in the same paradigm. However, this
assumption is not plausible. As we could see previously, the backness feature of the
vowel in the majority of suffixes is determined by the backness feature of the most
frequent suffixed form of the given word, which changes from paradigm to paradigm.
28
The positive aspect of the study (Csernák: 2006) is that we receive information about
those suffixed forms of the word that are not found in online corpora, for example:
(22) Wanlass (2007) Front Frequency Back Frequency Total
hidrogénre 0 hidrogénra 0 0
(23) Csernák (2006)
Hidrogén BACK FRONT BOTH I. BACK I. FRONT NOT BACK
rA 27,04 67,46 2,39 0,96 2,15 72,96
An exhaustive comparison of the results of the two studies is out of the scope of this
paper.
29
3 Optimality Theory and Vacillating Stems
3.1 The Basic Principles of Optimality Theory
Optimality Theory (henceforth “OT”) is an alternative model proposed by Alan Prince
and Paul Smolensky in 1993. It breaks away with the traditions of rule-based
phonological theories, which explained phonological phenomena with the help of
rewrite rules, i.e. forms stored in the lexicon are mapped onto surface forms in a series
of steps (in a definite order), each defined by a rule. One rule can effect the other either
by inducing (“feeding”) or blocking (“bleeding”) it. Interaction between the rules is
regulated by rule ordering. (Roca & Johnson 1999: 585)
In the OT grammar, there are only two levels of representation: the underlying
level and the surface level. The mapping from the underlying (lexical) level to the
surface level is guided and restrained by universal constraints. There are two
mechanisms playing a role in selecting the output forms: one is the generator that
generates all the possible surface forms from the underlying (lexical) representation, the
other is the evaluator that applies the constraints on the possible candidates (surface
representations), compares the candidates and selects the only optimal candidate.
(24) Generator: input => {candidate, candidate2, candidate3...}
Evaluator: {candidate, candidate2, candidate3...} => output
Evaluation is characterised by parallelism, that is, all the constraints participate in the
evaluation. This process takes place at the same time, therefore there is no order of
application, but there is a hierarchical order of constraints. The optimal surface form is
the candidate that violates the least possible constraints (with respect to the hierarchy of
constraints).
(25)
/input/ constraint 1 constraint 2
candidate (a) *
candidate (b) *!
The optimal candidate is indicated by the pointing finger. The asterisk indicates that a
constraint is violated. If the violation is fatal, and therefore the candidate cannot be
30
optimal, the asterisk is followed by an exclamation mark. Candidate (b) is not optimal
because it violates the highest ranked constraint. Here the concept of dominancy has to
be introduced: the most dominant constraint is on top of the hierarchy. The violation of
the dominant constraint is more serious than the violation of other (lower ranked)
constraints. It is important in the evaluation process which constraint is violated, but the
seriousness of the violation should also be taken into account.
(26)
/input/ constraint1 constraint2
candidate(a) * *
candidate(b) **!
(27)
/input/ constraint1 constraint2
candidate(a) *
candidate(b) * *!
In tableau (26) the highest ranked constraint is violated by both candidates, but it is
violated twice by candidate (b). Although candidate (a) violates the next constraint,
whereas (b) does not, the winner is candidate (a), since the violation of the highest
ranked constraint is the most fatal. In tableau (27) both candidates violate the dominant
constraint, but candidate (b) also violates the next constraint. Since (a) does not violate
it, it is the optimal candidate.
There are two conflicting types of constraint: (i) faithfulness constraints and
(ii) markedness constraints. Faithfulness constraints state that surface forms must
replicate their corresponding underlying forms. Markedness constraints express the
universal tendencies exhibited in the construction of segments.12
An example of faithfulness constraint is:
Ident IO(voice): The value of the feature [voice] of an input segment must be preserved
in its output correspondent.
12 There are marked and unmarked forms / structures in a language. Unmarked structures (e.g. CV syllable structure) are found in any language and are always more general and require less articulatory effort than marked forms (e.g. VC syllable structure), which are not present in all languages and are usually avoided and will only be utilised to express contrast (Kager 1999: 2)
31
An example of markedness constraint is:
*complex coda: coda must not be complex
(28)
/ke:p/ Ident IO(voice)
(a) ke:p (b) ke:b *!
(29)
/εmbεr/ *complex coda
(a) εm . bεr
(b) εmb . εr *!
In tableau (28) candidate (b) violates the faithfulness constraint, because it has a voiced
coda, unlike the underlying form. Candidate (a) is faithful to the underlying form (i.e. it
remains voiceless in the surface representation), therefore it is the winner.
In tableau (29) candidate (b) violates the markedness constraint, because it has a
complex coda, whereas candidate (a) has a simple coda, which makes it the optimal
candidate. Note that all constraints only apply to the possible candidates as output
forms; input forms cannot be regulated by constraints. This principle is called Richness
of the Base.
32
3.2 An Optimality Theoretic Analysis of Vacillating Stems
3.2.1 Partially Ranked Constraints
In this part of the chapter we are not going to give a full explanation for the behaviour
of vacillating stems. Our analysis is based on the following assumption: if there are
certain constraints that do not have a fixed place in the hierarchy, the front as well as the
back suffixes of the stem can be optimal (Ringen & Heinämäki: 1999). It is necessary to
show that these constraints are not ranked in relation to each other because otherwise
only one candidate could be judged as optimal by the evaluator. Constraints that do not
have a fixed place in the hierarchy will only be ranked in the moment of speaking.
The following constraints are from Ringen & Vago (1998):
* : Short unrounded back vowels are not allowed.
Align-R: No vowel intervenes between the right edge of back and the right edge of
the prosodic word.
SPECIFY: Segments should be specified for features.
No-Gap: Gapped configurations are prohibited. A gapped configuration is illustrated
below:
(where B is a possible anchor for F)
We introduce the following constraint:
SPECIFYroot: Segments of the root should be specified for features.
In the following optimality theoretic analysis we assume that vacillating stems are
marked in the lexicon. This explains that, for instance, aszpirin ’aspirine’ and április
’April’ have a similar form (both contain a back harmonic vowel followed by two front
unrounded (neutral) vowels) but behave in a different way: április always takes back
suffixes, whereas aszpirin can take back as well as front suffixes, i.e. it is a vacillating
stem.
33
Due to the markedness of vacillating stems the two constraints Align-R and
SPECIFYroot are not ranked with respect to each other (indicated by the dotted line) in
the following tableau:
(30)
vac/ + vA/ * SPECIFY root
Align-R SPECIFY No-Gap
(a)
+b -b +b
****
(b) A +b -b
****
*
(c)
+b -b
***
(d) I I
+b
**
**
**
(e) I
+b
*!
*
*
*
The highest ranked constraint is * because there are no short unrounded back vowels
in the Hungarian vowel inventory, therefore a word containing such a vowel will never
be optimal. SPECIFYroot and Align-R can be placed in a hierarchical order depending
on whether the word takes a back or a front suffix.
34
(31)
vac/ + vA/ * SPECIFY root
Align-R SPECIFY No-Gap
(a)
+b -b +b
****!
(b) A +b -b
****!
*
(c) +b -b
***
(c) I I
+b
**!
**
**
(e) I +b
*!
*
*
*
If SPECIFYroot is ranked higher than Align-R (31), the optimal candidate will be (c),
a stem with a front suffix. Candidate (e) cannot be optimal because it has a short
unrounded back vowel, which does not exist in Hungarian. Candidate (c) and (d) violate
SPECIFYroot because they contain underspecified vowels in the stem (indicated by
capital letters). Candidates (a), (b) and (c) violate Align-R, therefore the amount of
violation should also be taken into account: (a) and (b) have four violations, whereas (c)
has only three, therefore it is the winner.
(32)
vac/ + vA/ * Align-R SPECIFY root
SPECIFY No-Gap
(d)
+b -b +b
****!
(e) A +b -b
****!
*
(f) +b -b
***!
(d) I I +b
**
**
**
(e) I +b
*!
*
*
*
35
In tableau (32), we can see the ranking of constraints if the optimal candidate happens to
be a stem with a back suffix. In this case, the hierarchy of constraints is changed: Align-
R is ranked higher than SPECIFYroot. Candidate (e) violates the highest ranked
constraint. The next highest ranked constraint is violated by all the other candidates but
(d), therefore it is the optimal form.
The lower ranked constraints SPECIFY and No-Gap play only a minor role in
selecting the optimal candidate.
As mentioned earlier, the hierarchy of the two constraints Align-R and
SPECIFYroot are interchangeable only in the case of vacillating stems due to their
markedness. Other stems that do not exhibit vacillation are unmarked with respect to
this hierarchy. In their case, these two constraints are in a strict hierarchical order:
Align-R is always ranked higher than SPECIFYroot, as shown in tableau (33):
(33)
/ + A/ Align-R SPECIFY root
SPECIFY No-Gap
(a)
+b -b +b
***!
(b) A
+b -b
***!
*
(c)
+b -b
**!
(d) E
+b
*
*
*
Research based on data collected from online corpora (APPENDIX I) has proven that
not only non-vacillating but vacillating stems also show some kind of uniformity within
a particular paradigm. However, this uniformity is certainly not extended to all the
suffixed forms in the paradigm but it concerns the majority of them. The most common
suffixed form in the paradigm seems to trigger uniformity: most of the suffixed forms
tend to share its backness feature. (The only exception of the rule among the words
under scrutiny is agresszív ’aggressive’.)
36
’To account for phenomena like this, paradigm has become an important concept in expressing the phonological relationship between words. It is assumed that the surface shape of a word w , in a given
paradigm x can influence / determine the surface shape of another word w2, even if w2 is not derived from w in a cyclic sense. The nonderivational
paradigmatic relationships received formalisation as Output-Output correspondences in Optimality Theory.’
(Rebrus & Törkenczy 2005: 264)
We propose the following constraint:
Ident-OO(back): The backness of the suffix vowel after the stem has to be identical to
the backness of the suffix vowel of the most common suffixed form of the stem.13
To illustrate this with an example, let us see how a word (norvég ’Norwegian’) behaves
that is a vacillating stem, takes back suffixes more often than front suffixes and its most
common suffixed form is the plural form, that is, norvégok.
(34) NORVÉG
Front Frequency Back Frequency Total
norvégek 134 norvégok 367 501
norvégre 5 norvégra 20 25
norvéget 1 norvégot 21 23 norvégnek 9 norvégnak 7 16
norvégben 2 norvégban 12 14
norvégből 0 norvégból 14 14
norvéggel 1 norvéggal 8 9
norvéghez 1 norvéghoz 5 6
norvégtől 0 norvégtól 2 2
norvégről 0 norvégról 2 2 norvéggé 1 norvéggá 0 1
norvégen 0 norvégon 0 0
norvégbe 0 norvégba 0 0
norvégnél 0 norvégnál 0 0 TOTAL 154 TOTAL 458
13 On Output-Output constraints see (Benua: 1997), (Rebrus & Törkenczy 2005)
37
The above described tendency is clearly shown in table (34). Blue indicates the number
of suffixed forms that tend to share the backness feature of the most common suffixed
form of the stem. Red indicates those forms that veer away from this tendency.
The following tableau contains the stem norvég ’Norwegian’ with partially ranked
constraints:
(35)
vac/ + O / Ident-OO (back)
Align-R SPECIFY root
SPECIFY No-Gap
(a)
+b -b +b
***
(b) O
+b -b
*
***
*
(c)
+b -b
*
**
(d) E
+b
*
**
*
In tableau (35), the constraints Ident-OO(back), Align-R and SPECIFY(root) are not
ranked with respect to each other. The ranking in the moment of speaking will decide
whether the optimal form will be the back or the front variant of the same suffixed form.
The stem norvég is shown in the following tableau with a less common suffix to
account for the general tendency towards uniformity:
(36)
vac/ + O / Ident-OO (back)
Align-R SPECIFY root
SPECIFY No-Gap
(a)
+b -b +b
***!
(b) O
+b -b
*!
***
*
(c)
+b -b
*!
**
(d) E
+b
*
**
*
38
Candidate (b) and (c) violate Ident-OO(back) because (b) contains an underspecified
vowel and (c) has a front suffix vowel unlike the most common suffixed form in the
paradigm (i.e. norvégok). Candidates (a), (b) and (c) violate Align-R – (a) and (b) have
three violations and (c) has only two. Candidate (d) does not violate the first two highest
ranked constraints, therefore it is the winner. The result is a stem with a back suffix
vowel that contains an underspecified vowel in the stem.
(37)
vac/ + O / Align-R Ident-OO (back)
SPECIFY root
SPECIFY No-Gap
(a)
+b -b +b
***!
(b) O
+b -b
***!
*
*
(c)
+b -b
**!
*
(d) E
+b
*
**
*
It is obvious from tableau (37) that even if Align-R is ranked higher than Ident-
OO(back), the optimal candidate is (d), that is, a form with a back suffix vowel.
The front variant will be the winner only when SPECIFY(root) is ranked higher than
Align-R and Ident-OO(back). It is shown in tableau (38):
39
(38)
vac/ + O / SPECIFY root
Align-R Ident-OO (back)
SPECIFY No-Gap
(a)
+b -b +b
***!
(b) O
+b -b
***!
*
*
(c)
+b -b
**
*
(d) E
+b
*!
**
*
Candidate (d), which was the optimal candidate in the aforementioned cases, violates
SPECIFY(root) because it has an unspecified vowel in the stem, therefore it is ruled
out. All of the competing candidates violate Align-R but candidate (c) (a form with a
front suffix vowel) has the fewest violations (two instead of three), thus it is the winner.
The next example is sláger ’hit single’, the most common suffixed form of which is the
plural form with a front suffix vowel, that is, slágerek:
(39) SLÁGER
Front Frequency Back Frequency Total
slágerek 7497 slágerok 0 7497
slágerrel 1420 slágerral 26 1446 slágerré 433 slágerrá 4 437
slágernek 411 slágernak 3 414
slágerben 262 slágerban 2 264
slágerre 157 slágerra 0 157
slágerbe 40 slágerba 0 40
slágerhez 34 slágerhoz 0 34
slágeren 33 slágeron 0 33
slágernél 12 slágernál 0 12 slágerből 0 slágerból 1 1
slágertől 0 slágertól 0 0
slágerről 0 slágerról 0 0 TOTAL 10299 TOTAL 36
40
Table (39) is the paradigm of sláger ’hit single’. It is clearly seen that it takes front
suffixes in the vast majority of cases, just like the most common suffix in the paradigm.
There is only one case when the suffixed forms do not conform to the backness feature
of the most frequent suffixed form in the paradigm. (It is indicated by the colour red.)
However, it is not statistically significant, since the Hungarian National Corpus contains
only one example of this suffixed form.
Tableau (40) contains the word sláger with partially ranked constraints (the hierarchy is
not yet decided):
(40)
vac/ + A/ Ident-OO (back)
Align-R SPECIFY root
SPECIFY No-Gap
(a) +b -b +b
*
***
(b) A +b -b
*
***
*
(c) +b -b
**
(d) E +b
*
*
*
*
In the following tableau, the hierarchy is decided. The winner is a suffixed stem with a
front suffix vowel.
(41)
vac/ + A/ Ident-OO (back)
Align-R SPECIFY root
SPECIFY No-Gap
(a) +b -b +b
*!
***
(b) A +b -b
*!
***
*
(c) +b -b
**
(d) E +b
*!
*
*
*
41
As is seen in tableau (41), if Ident-OO(back) is ranked the highest in the hierarchy, the
optimal candidate can only be (c) – a front variant. Candidates (a) and (d) contain a
back suffix vowel, therefore they violate paradigmatic uniformity. So does candidate
(b), which has an unspecified vowel in the suffix. Thus the only possible winner is (c).
In this case, the hierarchy of the following two constraints, Align-R and
SPECIFY(root) does not count. Therefore, the hierarchy shown in tableau (42) will
also result in a winning candidate (c) (i.e. a front variant):
(42)
vac/ + A/ Ident-OO (back)
SPECIFY root
Align-R SPECIFY No-Gap
(a) +b -b +b
*!
***
(b) A +b -b
*!
***
*
(c) +b -b
**
(d) E +b
*!
*
*
*
In the above tableau, the order of Align-R and SPECIFY(root) is reversed:
SPECIFY(root) is ranked higher than Align-R. However, the winning candidate
remains (c), since Ident-OO(back), which is the highest ranked constraint, plays the
main role in selecting the optimal form.
42
The optimal candidate is also a front variant, if SPECIFY(root) is ranked the highest. It
is shown in tableau (43):
(43)
vac/ + A/ SPECIFY root
Ident-OO (back)
Align-R SPECIFY No-Gap
(a) +b -b +b
*!
***
(b) A +b -b
*!
***
*
(c) +b -b
**
(d) E +b
*!
*
*
*
In tableau (43), only candidate (d) violates SPECIFY(root) because it has an
unspecified vowel in the stem. The second highest ranked constraint, Ident-OO(back)
is violated by candidates (a), (b) and (d), since (a) and (d) contain a back suffix vowel
and (b) has an unspecified one. As (c) has a front suffix vowel, it shares the backness
feature of the most common suffixed form in the paradigm, therefore it does not violate
Ident-OO(back). Candidate (c) is also the only form that has no violations yet, thus it is
the optimal one.
There is another possible ranking of these three constraints, namely, when Align-R is
ranked the highest, as is shown in tableau (44):
(44)
vac/ + A/ Align-R Ident-OO (back)
SPECIFY root
SPECIFY No-Gap
(a) +b -b +b
***!
*
(b) A +b -b
***!
*
*
(c) +b -b
**!
(d) E +b
*
*
*
*
43
All candidates but (d) violate the highest ranked constraint, Align-R. Therefore the
hierarchical order of the next two highest ranked constraints, Ident-OO(back) and
SPECIFY(root) does not matter. The winner is candidate (c), that is, the back variant
of the suffixed form.
If we change the order of Ident-OO(back) and SPECIFY(root), the winner will be the
same candidate, provided that Align-R remains the highest ranked constraint, as is seen
in the following tableau:
(45)
vac/ + A/ Align-R SPECIFY root
Ident-OO (back)
SPECIFY No-Gap
(a) +b -b +b
***!
*
(b) A +b -b
***!
*
*
(c) +b -b
**!
(d) E +b
*
*
*
*
In tableau (45), only candidate (d), the back variant of the suffixed form, violates
SPECIFY(root) because it has an underspecified vowel in the stem. However, this is
not fatal and candidate (d) is the winner regardless of this violation, which was decided
by the highest ranked constraint, Align-R.
The hierarchical order of the three highest ranked constraints, Align-R, Ident-
OO(back) and SPECIFY(root) does not matter in the case of neutral and harmonic
stems. In any case, always the same candidate will be the winner. Let us illustrate this
with the following two tableaux:
44
(46)
/ + A/ Align-R SPECIFY root
Ident-OO (back)
SPECIFY No-Gap
(a) -b +b
**
*
(b) A -b
**
*
*
*
(c) -b
*
(d) -b
*
(47)
/ + vA/ Align-R SPECIFY root
Ident-OO (back)
SPECIFY No-Gap
(a) +b
*
(b) A l +b
**
*
*
*
(c) l +b -b
**
*
(d) l +b
*
Tableau (46) with the word fillér <one hundredth of a Hungarian forint> is an example
of neutral stems that take front suffixes, whereas tableau (47) contains a word (motor
’motorbyke’) that is a harmonic stem, containing back harmonic vowels and taking back
suffixes only. In both cases, the winner is always the same candidate regardless of the
hierarchical order of Align-R, SPECIFY(root) and Ident-OO(back). The optimal
candidate is (c) in the case of neutral and front harmonic stems (the front variant of the
suffixed form) and (d) in the case of back harmonic stems (the back variant of the
suffixed form), since these candidates do not violate any of the constraints Align-R,
SPECIFY(root) and Ident-OO(back).
45
3.2.2 Unranked Constraints
In the following, we are going to give a different account for the behaviour of
vacillating stems. This type of analysis has already been applied to Finnish-language
phenomena (Ringen & Heinämäkki 1999). It is based on the following assumption: If
we have a group of at least four unranked constraints, with their different permutations
we can get not only different candidates but also different proportions of the distribution
of front and back suffixes. In this case, we have six unranked constraints. The following
two constraints were taken from other articles:
Ringen and Vago: Hungarian Vowel Harmony and Optimality Theory (1998)
Align-R: No vowel intervenes between the right edge of back and the right edge of
the prosodic word.
Hayes and Londe: Stochastic Phonological Knowledge: The Case of Hungarian Vowel
Harmony
*[-b][-b][+b]: Two front vowels cannot be followed by a back vowel.
We introduce the following four constraints:
Ident-OO(back): The backness feature of the suffix vowel after the stem has to be
identical to the backness of the suffix vowel of the most common suffixed form of the
stem.
AGREE(str): The backness feature of the suffix vowel has to agree with the backness
feature of the vowel of the stressed syllable.
AGREE(last): The backness feature of the suffix vowel has to agree with the backness
feature of the the vowel of the stem.
AGREE(low): The backness feature of the suffix vowel has to agree with the backness
feature of the lowest vowel of the stem.
Since there are six unranked constraints, there are 720 possible hierarchies, as 6!
= 720. Depending on the 720 possibilites of the ranking of these contraints, the optimal
candidate can either be a back or a front variant of the suffixed form. These constraints
are unranked with respect to each other in the case of non-vacillating stems. The word
fillér <one hundreth of a Hungarian forint> is a neutral stem and always takes front
suffixes. The word motor ’motorbyke’ is a harmonic stem containing only back vowels
and thus takes only back suffixes. These words with unranked constraints are seen in the
following two tableaux (that is, tableaux (48) and (49), respectively):
46
(48)
/ / AGREE str
Align-R Ident-OO back
AGREE low
*-h -h+h
AGREE last
(a) -b +b
*
**
*
*
*
(b) A -b
*
**
*
*
*
(c) -b
(d) -b
*
*
*
(49)
/ / AGREE str
Align-R Ident-OO back
AGREE low
*-h -h+h
AGREE last
(a) +b
(b) A l +b
*
*
*
*
*
(c) l +b -b
*
**
*
*
*
(d) l +b
As is clearly seen in the above tableaux, the hierarchy of the constraints does not matter
in the case of neutral and harmonic stems. Regardless of the ranking of constraints, the
winner is candidate (c) in both cases. In tableau (37) it is the front variant of the suffixed
form (since the stem contains only front vowels), whereas it is the variant with a back
suffix vowel (as the stem consists of back harmonic vowels only). As the winner is
always the same candidate in all of the 720 cases, the distribution of front and back
suffixes is the following: 100% front and 0% back in the case of neutral and front
harmonic stems and 100% back and 0% front in the case of back harmonic stems.
In the case of vacillating stems, the distribution of front and back suffixes
resulting from the 720 different rankings of constraints is not as simple as it is in the
47
case of neutral and harmonic stems. Let us illustrate this with some of the most common
types of vacillating stems.
BN(back-neutral)-type stems: A back vowel is followed by a neutral vowel in the
stem (e.g. hotel ’hotel’, fotel ’armchair’). The word fotel ’armchair’ is seen in tableau
(50) with the six unranked constraints:
(50)
// AGREE str
Align-R Ident-OO back
AGREE low
*-h -h+h
AGREE last
(a) +b -b +b
***
*
*
*
(b) A +b -b
*
**
*
*
*
(c) +b -b
*
**
(d) E +b
*
*
*
These stems take front suffixes in the following cases:
o if Ident-OO(back) is the highest ranked constraint
o if AGREE(low) is the highest ranked constraint
o if AGREE(last) is the highest ranked constraint
o if *-b -b +b is the highest ranked constraint, followed by any of the above
constraints in the hierarchy
If Ident-OO(back) is ranked the highest, a front variant will be the optimal candidate,
since the most common suffixed form in the paradigm of the word fotel ’armchair’ (and
in the paradigm of other stems of this type) is the one with a front suffix vowel.
If AGREE(low) has the highest place in the hierarchy, the optimal candidate is
also the one with a front suffix vowel. The stem fotel contains a mid-vowel (o) and a
low vowel (e), therefore the variant with the suffix vowel e, that is, the front variant will
be that candidate that does not violate this constraint.
If AGREE(last) is the highest ranked constraint, the winner is the suffixed form
with the front suffix vowel. As e is the last vowel in the stem fotel ’armchair’, the
suffixed form will get a front vowel.
48
The constraint *-b -b+b has no significance in the analysis of BN stems (like fotel
’armchair’), as the suffixed forms of these words cannot contain two front neutral
vowels followed by a back harmonic vowel. If this constraint is ranked the highest, it is
not violated by any of the candidates. If it is followed by either Ident-OO(back),
AGREE(low) or AGREE(last), the second highest ranked constraint will be the first to
be violated by some of the candidates. As mentioned above, these constraints are not
violated by suffixed forms containing a front vowel (at least, in this case), therefore
these types of hierarchies will result in a winning candidate with a front suffix vowel.
In all the other cases, the winner is the back variant of the suffixed form. The
percentages can be calculated this way: 3 x 5! + 3 x 4! = 360 + 72 = 432. 432 is 60% of
720, therefore hotel-type stems will take front suffixes in 60% of the cases and back
suffixes in 40% of the cases. Approximately this is the distribution of front and back
suffixes in the case of fotel but not that of hotel (which takes more back than front
suffixes) according to the Hungarian National Corpus. However, according to other data
(Csernák 2006) the proportion of the distribution of front and back suffixes is almost the
same in the case of these words.
BNN (back-neutral-neutral)-type stems: A back vowel is followed by two neutral
vowels in the stem (e.g. agresszív ’aggressive’, aszpirin ’aspirine’, polimer ’polymer’).
The tableau of agresszív ’aggressive’ is the following.
(51)
/ / AGREE str
Align-R Ident-OO back
AGREE low
*-h -h+h
AGREE last
(a) +b -b +b
****
*
*
*
(b) A +b -b
*
****
*
*
*
*
(c) +b -b
*
***
*
(d) E +b
*
*
*
The word in the above tableau is an example of stems that contain an a followed by an e
and an í.
49
These stems take front suffixes in the following cases:
o if AGREE(last) is the highest ranked constraint in the hierarchy
o if *-b -b +b is the highest ranked constraint in the hierarchy
o if AGREE(low) is the highest ranked constraint in the hierarchy, followed by
any of the above mentioned constraints.
If AGREE(last) has the highest priority in the ranking of constraints, the optimal
candidate is also a front variant. BNN stems end in front vowels, therefore the suffix
vowel, if it agrees with the last stem vowel, will also be front.
If *-b -b+b is on top of the hierarchy, the winner will always be a front
variant of the suffixed form. In all the other cases, the winner is form with a back suffix
vowel.
If AGREE(low) is the highest ranked constraint, both the font and back variant
can be optimal, as a and e are both low vowels. In the case of such stems, a front variant
can be the winner if this constraint is followed by either *-b -b+b or AGREE(last).
In the case of agresszív-type words (stems containing an a or á, followed by e
(one or two) and / or i/í): 2 x 5! + 2 x 4! = 288. 288 is 40% of 720, therefore the
distribution of front and back suffixes is the following: 40% front and 60% back
suffixes. This is slightly problematic because the distribution of front and back suffixes
is just the opposite based on data collected from the Hungarian National Corpus: it is
40% back and 60% front. As mentioned, earlier, this word has a peculiar behaviour: it is
the exception to the rule of paradigmatic uniformity. We also made an assumption about
their exceptionality: it might be analysed as a compound in most of the cases (when the
suffixes attatched to the word agresszív ’aggressive’ could also be attatched to the noun
szív ’heart’ – e.g. agresszívhez ’aggressive-all’ and szívhez ’heart-all’ are both well-
formed and grammatical, but *szívebb ’heart-comp’ is not possible. Therefore, the
behaviour of this word might not be fully accounted for in this analysis. Furthermore,
there are no sufficient data about stems of the same form, given that this is not a very
common form of vacillating stems.
50
(52) is the tableau of aszprin ’aspirine’:
(52) / / AGREE
str Align-R Ident-OO
back AGREE
low *-h
-h+h AGREE
last (a) +b -b +b
****
*
*
*
*
(b) A +b -b
*
****
*
*
*
*
(c) +b -b
*
***
(d) II +b
*
*
*
*
This is an example of stems containing an a followed by two i-s.
These stems take front suffixes in the following cases:
o if AGREE(last) is the highest ranked constraint in the hierarchy
o if *-b -b +b is the highest ranked constraint in the hierarchy
o if Ident-OO(back) is the highest ranked constraint in the hierarchy
If AGREE(last) has the highest priority in the ranking of constraints, the optimal
candidate is also a front variant. Vacillating stems end in front vowels, therefore the
suffix vowel, if it agrees with the last stem vowel, will also be front.
If *-b -b+b is on top of the hierarchy, the winner will always be a front
variant of the suffixed form. In all the other cases, the winner is form with a back suffix
vowel.
If Ident-OO(back) is ranked the highest in the hierarchy, the optimal candidate
will be a suffixed form with either a front or a back suffix vowel, depending on the
backness feature of the most common suffixed form in the paradigm of the given word.
It tends to be a front variant in the vast majority of the cases.
In the case of aszpirin-type words (stems containing an a followed by two i-s):
3 x 5! = 360, therefore the distribution of front and back suffixes is 50-50%. However,
these words are analysed as compounds in the majority of cases, therefore almost
always take front suffixes. There is no sufficient data available about stems that have
this form and are not analysed as compounds.
51
(53)
/ / AGREE str
Align-R Ident-OO back
AGREE low
*-h -h+h
AGREE last
(a) +b -b +b
****
*
*
*
*
(b) A +b -b
*
****
*
*
*
*
(c) +b -b
*
***
(d) IE +b
*
*
*
*
The above tableau is is an example of stems which contain an o followed by an i and an
e.
These stems can take front suffixes in the following cases:
o if AGREE(low) is the highest ranked constraint
o if AGREE(last) is the highest ranked constraint in the hierarchy
o if *-b -b +b is the highest ranked constraint in the hierarchy
o if Ident-OO(back) is the highest ranked constraint in the hierarchy
If AGREE(low) has the most important place in the hierarchy of constraints, the
optimal candidate will be a suffixed form with a front vowel, since the stem contains an
o, an i and an e. O is a mid vowel, i is high and e is low, therefore the backness of the
suffix vowel has to agree with e (i.e. the lowest vowel of the stem).
If AGREE(last) has the highest priority in the ranking of constraints, the
optimal candidate is also a front variant. Vacillating stems end in front vowels, therefore
the suffix vowel, if it agrees with the last stem vowel, will also be front.
If *-b -b+b is on top of the hierarchy, the winner will always be a front
variant of the suffixed form. In all the other cases, the winner is form with a back suffix
vowel.
If Ident-OO(back) is ranked the highest in the hierarchy, the optimal candidate
will be a suffixed form with either a front or a back suffix vowel, depending on the
backness feature of the most common suffixed form in the paradigm of the given word.
It tends to be a front variant in the vast majority of the cases.
52
In the case of polimer-type stems (containing an o followed by an i and an e) the
distribution of percentages can be calculated in the following way: 4 x 5! = 480. Since
480 is approximately 70% of 720, the proportions of front and back suffixes are 70%
and 30%, respectively.
BBN (back-back-neutral)-type stems: Two back vowels are followed by a neutral
vowel. The word produktív ’productive’ is an example for these stems. It is shown in
tableau (54):
(54)
/ / AGREE str
Align-R Ident-OO back
AGREE low
*-h -h+h
AGREE last
(a)
+h -h +h
***
*
(b) A +h -h
*
***
*
*
*
(c) +h -h
*
**
*
*
(d) I +h
*
These stems take front suffixes in the following cases:
o if AGREE(last) is the highest ranked constraint
o if *-b -b +b on top of the hierarchy is followed by AGREE(last)
If the highest ranked constraint in the hierarchy is AGREE(last), the optimal candidate
will be the front variant of the suffixed form. As the last vowel of such stems is always
a front vowel, these stems will take front suffixes, if the aforementioned constraint has
the highest priority in the ranking.
Since the constraint *-b -b +b does not play a role in the analysis of these
stems (there are no front vowels followed by a back vowel), no candidates can violate it.
If it is ranked the highest in the hierarchy, the second highest ranked constraint will
decide which form is the optimal. If it is followed by AGREE(last), the winner will be
the front variant of the suffixed stem.
53
In all the other cases, the suffixed stem with a back suffix vowel will be the
winner. Thus, the distribution of front and back suffixes can be calculated in the
following way: 5! x 4! = 120 + 24 = 144, which is 20% of 720. Therefore these words
will take front suffixes in 20% and back suffixes in 80% of the cases.
54
Conclusion
In the first section, we provided a description of the Hungarian vowel inventory, vowel
harmony and the peculiar behaviour of vacillating stems. In the second section, we
reported the results of different research projects, including a survey carried out in a
secondary school and other data collections from various online corpora. After having
collected data from various sources, we analysed them, focussing on the paradigm of
each vacillating stem. We arrived at the conclusion that vacillating stems, altough they
do not behave as consistently as neutral and harmonic stems, do show uniformity in a
paradigm to a certain extent. Furthermore, there always seems to be a base for
paradigmatic uniformity, which tends to be the most common suffixed form in the
paradigm of the given vacillating stem. It can be stated that paradigmatic uniformity in
the case of these stems is in connection with the frequency of suffixes (or suffixed
forms). However, it never seems to depend on the relative frequency of suffixes in
general – what counts is the frequency within one particular paradigm, that is the
paradigm of each vacillating stem. We also learned that there is only one exception to
the rule (at least, among the words collected in the Hungarian National Corpus): this is
the word agresszív ’aggressive’.
Based on the behaviour of certain stems, it is also likely that many of them seem
to be treated as hypothetical compounds by the majority of native speakers. These are
stems that look as if they ended in common Hungarian or foreign free or bound
morphemes or having endings that closely resemble existing Hungarian words. In these
cases, the words in question almost exclusively take front suffixes (since all vacillating
stems end in neutral (front unrounded) vowels. The only exceptions are the stems
ending in –né (e.g. matiné, praliné), which behave like names containing back harmonic
vowels with <the suffix indicating the marital status of women> attached to them (e.g.
Kovácsné ’Mrs Kovács (Smith)’): they almost always take back suffixes. This theory
might give an explanation for the exceptional behaviour of the word agresszív
’aggressive’. This is a stem that could be treated as a pseudo-compound by native
speakers. Interestingly enough, there are only two suffixes among the ones tested that
cannot be attached the word szív (the word that looks exactly like the last syllable of
agresszív) and one of them is the most common suffix in the paradigm of this stem.
In the third section, we gave a brief description of the vasic principles of
Optimality Theory, then we attempted to give an optimality theoretic account for the
unusual behaviour of vacillating stems first with the help of partially ranked constraints
55
and second, with a group of unranked constraints. This part of the paper requires further
study. In the first optimality theoretic analysis (with partially ranked constraints), we
assumed that vacillating stems are marked in the lexicon. However, in the second part of
our OT analysis, we attempted to model the behaviour of vacillating and non-vacillating
stems with unranked constraints. In this analysis, we can account for the behaviour of
vacillating stems without declaring that they are marked in the lexicon. The difference
between vacillating and non-vacillating stems lies in the results of constraint ranking. In
this case, we had six constraints that were unranked with respect to each other. From the
different rankings of these six constraints we could get 720 possible hierarchies that
resulted in different (either front or back) optimal candidates in the case of vacillating
stems. However, the ranking of constraints did not make a difference for non-vacillating
stems: in all of the 720 cases, the same candidate was the winner (the front variant for
neutral and front harmonic stems and the back variant for back harmonic stems).
Nevertheless, there are still problems with the optimality theoretic analysis of
vacillating stems. Although we could account for the behaviour of these stems and
could approximately get the proportions of the distribution of front and back suffixes for
the most typical types of vacillating stems, these percentages are not yet precise enough
in certain cases. Furthermore, we have not yet provided a convincing analysis for the
exceptional behaviour of the word agresszív ’aggressive’. It also makes our analysis
more complicated and less efficient that there is only an exiguous amount of data
available on the Hungarian National Corpus about certain rare vacillating stems and
most vacillating stems do not count as the most common words. Most BNN (back-
neutral-neutral) and BBN (back-back-neutral) stems seem to be treated as compounds in
the majority of cases, therefore it is difficult to account for their behaviour the way we
tried to do in our second optimality theoretic analysis.
As it is easier to collect data with the help of Internet search programs than in
the Hungarian National Corpus and other corpora, it would be worth collecting words
this way. Data collection from the Internet and online corpora might be more useful
than carrying out a survey because Internet texts are examples of spontaneous discourse:
they use the language the way they instinctively do, without being surveilled and
consciously choosing forms that are socially more prestigeous or perceived as
gramatically more ’correct’. Having said that, it would also be worth carrying out a
survey in different parts of the counry in order to account for the regional differences as
well.
56
Bibliography
A. Jászó, Anna (ed.) (1994): A magyar nyelv könyve. Budapest: Trezor.
Benua, Laura (1997): Transderivational Identity: Phonological Relations Between
Words. PhD thesis, University of Massachusetts, Amherst.
http://roa.rutgers.edu/view.php3?roa=259
Benus, Stefan – Gafos, Adamantios – Goldstein, Louis (to appear): Phonetics and
Phonology of Transparent Vowels in Hungarian.
homepages.nyu.edu/~ag63/VowelHarmonyProject/BLS-03.pdf
Bolla, Kálmán (ed.) (1982): Fejezetek a magyar leíró hangtanból. Budapest: Akadémiai
Kiadó.
Bolla, Kálmán (1995): Magyar fonetikai atlasz. Budapest: Nemzeti Tankönyvkiadó.
Csernák, Anna (2006): Elölségi harmónia és az ingadozó tövek. TDK Conference pape,
University of Pannonia.
Deme, László (1994): A hangtan területéről. In: Deme László: Nyelvünkről,
használatáról, használóiról.
Dienes, Péter (1997): Hungarian Neutral Vowels. In: The Odd Yearbook, 151-180.
Gósy, Mária (2004): Fonetika, a beszéd tudománya. Budapest: Osiris Kiadó.
Hayes, Bruce – Cziráky Londe, Zsuzsa (2005): Stochastic Phonological Knowledge:
The Case of Hungarian Vowel Harmony.
www.ai.mit.edu/projects/dm/featgeom/hayes-hungarian-stoch.pdf
Kager, René (1999): Optimality Theory. Cambridge: Cambridge University Press
Kassai, Ilona (1998): Fonetika. Budapest: Nemzeti Tankönyvkiadó.
57
Kenesei, István (ed.) (2000): A nyelv és a nyelvek. Corvina.
Keszler, Borbála (ed.) (2000): Magyar grammatika. Budapest: Nemzeti
Tankönyvkiadó.
Kertész, Zsuzsa (2003): Vowel Harmony and the Stratified Lexicon of Hungarian.
www.ai.mit.edu/projects/dm/featgeom/kertesz-stratified.pdf
Kiefer, Ferenc (2000): A ragozás. In: Kiefer Ferenc (ed.): Strukturális Magyar Nyelvtan
3. Morfológia. Budapest: Akadémiai Kiadó, 569-618.
Kontra, Miklós – Ringen, Catherine O. (1986): Hungarian Vowel Harmony: The
Evidence from Loanwords. In: Ural-Altaische Jahrbücher 58, 1-14.
McCarthy, John J. (2005): Optimal Paradigms. In: Paradigms in Phonological Theory.
New York: Oxford University Press Inc., 170-210.
Nádasdy, Ádám – Siptár, Péter (1994): A magánhangzók. In: Kiefer Ferenc (ed.):
Strukturális Magyar Nyelvtan 2. Fonológia. Budapest: Akadémiai Kiadó, 42-181.
Oostendorp, Marc van (2004): Aspects of Vowel Harmony
www.vanoostendorp.nl/fonologie/rhodos2004.pdf
Rebrus, Péter (2000): Morfofonológiai jelenségek. In: Kiefer Ferenc (ed.): Strukturális
Magyar Nyelvtan 3. Morfológia. Budapest: Akadémiai Kiadó, 763-948.
Rebrus, Péter (2001): Optimalitáselmélet. In: Siptár Péter (ed.): Szabálytalan fonológia.
Budapest: Tinta Könyvkiadó, 77-116.
Rebrus, Péter – Törkenczy, Miklós (2005): Uniformity and Contrast in the Hungarian
Verbal Paradigm. In: Paradigms in Phonological Theory. Oxford: Oxford University
Press, 263-295.
Reiss, Charles (2003): Deriving the Feature-Filling / Feature-Changing Contrast: An
Application to Hungarian Vowel Harmony. In: Linguistic Inquiry 34. 199-224.
58
Ringen, Catherine O. – Vago, Robert M. (1995): A Constraint-Based Analysis of
Hungarian Vowel Harmony. In: István Kenesei (ed.): Approaches to Hungarian, Vol. 5:
Levels and Structures. Szeged: JATE, 309-319.
Ringen, Catherine O. – Vago, Robert M. (1998): Hungarian Vowel Harmony in
Optimality Theory. Phonology 15. 393-416.
Ringen, Catherine O. – Heinämäkki, Orvokki (1999): Variation in Finnish Vowel
Harmony. In: Natural Language and Linguistic Theory 17. 303-37.
Ringen, Catherine O. – Szentgyörgyi, Szilárd (2000): Constraint Reranking in the
Szeged Dialect of Hungarian. In: Gábor Alberti – István Kenesei (ed.): Approaches to
Hungarian, Vol. 7: Papers from the Pécs Conference. Szeged: JATE Press, 3-22.
Roca, Iggy – Johnson, Wyn (1999): A Course in Phonology. Oxford &. Malden, Mass.:
Blackwell Publishers
Siptár, Péter (ed.) (1993): Modern fonológiai szöveggyűjtemény II. Szeged: JATE Press.
Siptár, Péter – Törkenczy, Miklós (2000): The Phonology of Hungarian. Oxford:
Oxford University Press.
Szende, Tamás (1976): A beszédfolyamat alaptényezői. Budapest: Akadémiai Kiadó.
Szentgyörgyi, Szilárd (1998): Lowering: The Interaction of Phonology and Morphology
in Hungarian. PhD thesis. József Attila Tudományegyetem, Szeged
Wanlass, Zara (2007): Evidence from Online Corpora for Vacillation in Hungarian
Vowel Harmony.
http://www.nytud.hu/icsh7/absz/Evidence from online corpora.doc