Simone Faluschi and Maria Gabriella Di Benedetto
INFOCOM Department, University of Rome 'La
Sapienza'
Via Eudossiana, 18, 00184 Rome, Italy
E-mail: simone.fal@tiscalinet.it
The aim of this project was to carry out a study based on the analysis of gemination of affricate consonants in Italian. Disyllabic words such as VCV (vowel-consonant-vowel) or VCCV (vowel-consonant-consonant-vowel) containing the four affricate Italian consonants [ʧ, ʤ, ʦ, ʣ], in their singleton and geminate version co-articulated with the three Italian vowels [a, i, u], were the subject of this study. The parameters that were considered are those time-related, frequency-related and energy-related ones. Next, a statistical analysis on the values obtained was carried out in order to establish the meaningfulness of the average values obtained. Those measures that were considered were taken at different sampling points of the utterance, so that it was possible to observe how they changed during the utterance sequence. Results show that pre-consonant vowel and consonant durations significantly change when passing from singleton to geminate utterance. Frequency and energy parameters do not show great differences like the durations of vocal and consonant although they present significant differences between singleton and geminate utterances. Results in time domain are in agreement with previous studies about Italian consonants (stops, liquids, fricatives and nasals) but it is not so for frequency and energy measurements.
In Italian, there are minimal pairs formed by
words which can be guessed only by the gemination of a consonant. Words
belonging to these minimal pairs are graphically distinguished by writing twice
the consonant that doubles (for example: micia (pussy-cat) and miccia (fuse)).
A problem still unsolved is about which vocal signal parameters change where
moving from singleton to geminate utterance and how they can affect perception.
To give an answer to this problem, concerning Italian language, several studies
- belonging to the GEMMA project in progress at the INFOCOM Department at
University 'La Sapienza' in Rome - have been performed on other consonant, such
as stops, liquids, fricatives and nasals. Results on stop consonants were
reported in Rossetti (1993, 1994) and Esposito and Di Benedetto (1999). Liquids
were analyzed in Argiolas et al. (1995) while the analysis of fricatives was
reported in Giovanardi (1998) and Giovanardi and Di Benedetto (1998). Finally
results on nasal consonants were reported in Mattei (1999) and Mattei and Di
Benedetto (2000). Main results confirm those obtained in previous studies about
the gemination of Italian consonants. They are a significant increase of
consonant duration and an as much reduction of the pre-consonant vowel, that
are confirmed by previous studies about perception of stop (Esposito and Di
Benedetto, 1999) and nasal (Mattei and Di Benedetto, 2000) consonants.
As already pointed out in (Mattei
and Di Benedetto, 2000) gemination has been analysed in other languages as
well: (Shrotriya, 1995), (Blumstein et
al., 1998), (Rochet and Rochet, 1995), (Cohn et al., 1999), (Abramson, 1999) (Local
and Simpson, 1999), (Arvaniti, 1999), (Louali and
Maddieson, 1999). This studies, by the way, do not concentrate
particularly on affricate consonants. The only one analysing this class of consonants
was (Abramson, 1999) although in a pre-test perceptive analysis results
presented a too high percentage of mistakes so that he was compelled to discard
that class.
In Italian, several disyllabic words form minimal pairs which can be distinguished on the sole basis of gemination of one consonant. Native speakers exhibit a natural attitude in producing disyllabic words of minimal pairs identified by the presence or absence of gemination of one consonant. The above consideration led to the creation of the GEMMA project database formed by a set of vowel-consonant-vowel disyllabic words (the singleton case) and vowel-consonant-consonant-vowel disyllabic words (the geminate case) which would serve as the basis for studying gemination in all possible geminated consonantal forms of Italian. The words in the database included the entire set of those Italian consonants which appear in singleton and geminated forms. Affricate consonants were studied when co-articulated with the three Italian vowels that are at the extreme positions in the phonetical diagram [a, i, u] and represent a subset of Italian vowels [a, e, ɛ, i, o, ɔ, u]. Words were pronunced by six pronunciation defectless and dialectal inflexionless native speakers. In order to keep under total control all the parameters used, words were pronounced alone instead of in whole sentences, so that those elements, such as intonation or internal stresses, that could have influenced the utterances were eliminated. Every word was pronounced three times by each speaker. Our procedure in building up the databese of utterances was motivated by the strong need of having data with well-controlled parameters, as a starting point for future more realistic settings. The analyzed words in the present study were therefore 3 for each affricate consonant (which are [ʧ, ʤ, ʦ, ʣ] and their geminate version) and 6 for each speaker in three repetition, leading to a total of 3x4x2x6x3=432 utterances (216 singletons and 216 geminates).
All the utterances were produced and recorded at
the Speech Laboratory of the INFOCOM Department at the University of Rome 'La
Sapienza' (Italy). Equipement used for this experiment is of a superior quality
and the recording took place into a sound-treated room. The operator who look
after this recording procedure was an acoustic trained subject. When there was
an evident pronunciation mistake or if it was judged unnatural, the speaker was
compelled to repeat it again. Three were the recording sessions that were done,
each corresponding to one of the three repetitions.
The set of words analysed in the present study
is reported in Table I. Geminate utterances are indicated by a double grapheme
of the first consonant letter.
|
ʧ |
ʤ |
ʦ |
ʣ |
||||
a |
aʧa |
atʧa |
aʤa |
adʤa |
aʦa |
atʦa |
aʣa |
adʣa |
i |
iʧi |
itʧi |
iʤi |
idʤi |
iʦi |
itʦi |
iʣi |
idʣi |
u |
uʧu |
utʧu |
uʤu |
udʤu |
uʦu |
utʦu |
uʣu |
udʣu |
TABLE I The complete set of words analyzed |
Once recorded, the utterances were digitized
using UNICE software by VECSYS (Vecsys, 1989) through which it has been
possible to elaborate the signal. It was filtered at 5 kHz (enough to study the
main spectral properties of the vocal signal), sampled at 10 kHz while every sample
was represented by using 16 bits. The resulting signal was stored into PC to
procede into next analysis performed by UNICE, such as spectrograms, DFT
(Discrete Fourier Transform) and LPC (Linear Predictive Coding) spectra. For
these elaborations a pre-emphasizing filter (with α=0.95) on the signal
and a Hamming window of 256 samples were used.
The GEMMA project analyses the measurement of a set group of standard parameters (Giovanardi and Di Benedetto, 1998) and (Mattei and Di Benedetto, 1999). Being the affricate consonants very particular in having an occlusive and a fricative phase it has been decided to modify the standard parameters in order to study more correctly the utterances. The following are the parameters used for this study:
1. duration of the pre-consonant vowel, indicated as V1d. The vowel onset was identified by the appearance of a glottal pulse followed by other regular glottal pulses. In those cases in which a glottal excitation was visible before regular vowel voicing, the vowel onset was taken as the beginning of regular vowel voicing, and the initial glottal excitation was discarded. Vowel offset was identified, by examination of both the waveform and the spectrogram
2. duration of the occlusive part of consonant, indicated below as C1d
3. duration of the fricative part of consonant, indicated below as C2d
4. duration of all consonant, indicated below as Cd (= C1d + C2d)
5. duration of the vowel following the consonant, indicated as V2d. The V2 onset was identified, by visual inspection of both the waveform and the spectrogram. The V2 offset was identified as the temporal sampling point where the glottal pulse disappeared
6. duration of the whole utterance, indicated as Utd
7. total energy of V1. Xi is the sample i, t1 and t2 are the temporal sampling points of vowel onset and vowel offset, respectively
8.
average power of
V1
9.
total energy of
occlusive phase of C, indicated below as EtotC1 and computed as for V1, with t1 and
t2 that corresponding to V1 offset (C1 onset) and C2 onset (C1
offset), respectively
10.
average power of
occlusive phase of C, indicated below as PmC1 and computed as for the average power of V1
11.
total energy of
fricative phase of C, indicated below as EtotC2 and computed as for V1, with t1 and
t2 that corresponding to C1 offset (C2 onset) and C2 offset,
respectively
12.
average power of
fricative phase of C, indicated below as PmC2 and computed as for the average power of V1
13.
total energy of
the whole consonant, indicated below as EtotC and computed like
usual
14.
average power of
the whole consonant, indicated below as PmC
15.
instantaneous
energy at V1 center, indicated as EiV1, computed in a window of 256 samples
centered on V1
16.
instantaneous
energy at V1-C1 transition, indicated as EiV1-C1: The window of 256 samples is centered on V1
offset
17.
instantaneous
energy at C1 center, indicated as EiC1 and computed as EiV1
18.
instantaneous
energy at C1-C2 transition, indicated as EiC1-C2: The window of 256 samples is centered on C1
offset (C2 onset)
19.
instantaneous
energy at C2 center, indicated as EiC2 and computed as EiV1
20.
instantaneous
energy at C2 offset, indicated as EiC2off, computed right before the first 256 samples of V2 onset
21.
F0, A0, F1, A1,
F2, A2, F3, A3, at V1 center (where F1,F2 and F3 are the fornants and A1, A2
and A3 their amplitudes)
22.
F0, A0, F1, A1,
F2, A2, F3, A3, at V1 offset
23.
F0, A0, F1, A1,
F2, A2, F3, A3, at the transition from V1 to C
24.
F0 and A0 at the
onset of voiced consonants
25.
F0 and A0 at the
centre of occlusive phase of voiced consonants
26.
F0 and A0 at the
centre of fricative phase of voiced consonants
27.
F0 and A0 at the
offset of voiced consonants
28.
F0, A0, F1, A1,
F2, A2, F3, A3, at V2 onset
29.
F0, A0, F1, A1,
F2, A2, F3, A3, at V2 centre
|
|||
FIGURE 1 Sampling points selected for the computation
of the acoustic parameters and indication of the frames where frequency
parameters were estimated. |
In this paragraph there are time, frequency and
energy domains results of the study. To understand wether existing differences
between values obtained for singleton and geminate utterances are statistically
meaningfulness, these statistic tests were used: Mono and Multivariate Anova,
the maximum a-posteriori classification test, the maximum likelihood
classification test and the Spearman Rank Correlation Coefficient rs. We separately show the results in time and
frequency domain.
2.1 Results in the time domain
The durations of the phonemes are showed in
Table II. It contains the average values of V1d, C1d, C2d, V2d and Utd for each
of the four consonants co-articulated with each of the three vowels and the
corresponding standard deviations. As we can see in Table II those parameters which
present relevant differences between singleton and geminate utterances are V1d,
C1d and C2d (remember that C1 and C2 together form the whole consonant). We
notice a general tendency to shorten the first vowel duration and to lenghten
the consonant duration (both in occlusive and fricative phase); on the
contrary, there are no relevant variations in the second vowel duration. These
statements are supported by a statistic analysis based on these data, showed on
Table III. Those values that modify significantly (p<0.05) from singleton to
geminate utterance, are highlighted through bold characters. They indicate that
a strong significancy is found for V1d, C1d and C2d, except for [ʦ]
articulated with [i]. Utd too, grows significantly in rather all the geminate
utterances, although not so strongly as those presented above. Moreover, the
second consonant duration does not vary significantly between singletons and
geminates, except for [ʤ] articulated with [a].
V1d |
C1d |
C2d |
V2d |
Utd |
|
V1d |
C1d |
C2d |
V2d |
Utd
|
|
V1d |
C1d |
C2d |
V2d |
Utd |
|
aʧa |
160.0 |
73.1 |
100.9 |
112.3 |
446.3 |
iʧi |
137.4 |
64.0 |
122.4 |
104.6 |
428.4 |
uʧu |
163.6 |
66.0 |
103.7 |
131.7 |
465.0 |
(StD) |
27.6 |
34.7 |
20.5 |
19.6 |
43.8 |
(StD) |
20.8 |
29.2 |
16.2 |
17.9 |
29.8 |
(StD) |
27.4 |
37.9 |
24.0 |
23.7 |
32.0 |
atʧa |
113.2 |
137.8 |
128.7 |
107.5 |
487.2 |
itʧi |
99.3 |
122.8 |
158.4 |
110.7 |
491.3 |
utʧu |
110.9 |
151.1 |
123.0 |
125.0 |
509.9 |
(StD) |
19.2 |
13.9 |
28.1 |
12.2 |
29.3 |
(StD) |
17.9 |
20.4 |
26.1 |
21.0 |
37.5 |
(StD) |
25.4 |
39.4 |
24.7 |
22.4 |
51.7 |
aʤa |
169.0 |
92.0 |
49.1 |
142.3 |
452.3 |
iʤi |
166.7 |
95.9 |
52.6 |
141.6 |
456.8 |
uʤu |
173.5 |
85.7 |
44.1 |
146.1 |
449.5 |
(StD) |
20.6 |
18.9 |
13.6 |
26.1 |
47.4 |
(StD) |
28.3 |
17.5 |
15.7 |
30.6 |
53.4 |
(StD) |
32.1 |
21.1 |
16.5 |
26.5 |
45.0 |
adʤa |
127.3 |
156.1 |
61.5 |
125.9 |
470.9 |
idʤi |
111.7 |
162.1 |
74.1 |
129.4 |
477.3 |
udʤu |
120.2 |
154.0 |
61.3 |
137.3 |
472.8 |
(StD) |
16.0 |
17.7 |
11.0 |
15.9 |
42.2 |
(StD) |
21.3 |
28.2 |
25.5 |
30.6 |
56.6 |
(StD) |
21.6 |
21.3 |
20.8 |
29.9 |
67.7 |
aʦa |
121.3 |
89.6 |
129.8 |
109.9 |
450.6 |
iʦi |
106.7 |
84.4 |
149.6 |
109.7 |
450.4 |
uʦu |
133.2 |
73.3 |
140.7 |
115.3 |
462.5 |
(StD) |
23.3 |
11.0 |
34.0 |
23.1 |
37.0 |
(StD) |
25.9 |
20.2 |
31.3 |
18.1 |
32.2 |
(StD) |
30.6 |
26.9 |
22.4 |
16.3 |
41.1 |
atʦa |
106.0 |
112.2 |
167.0 |
117.4 |
502.6 |
itʦi |
94.5 |
114.0 |
171.0 |
123.2 |
502.7 |
utʦu |
103.8 |
96.3 |
178.8 |
115.1 |
493.9 |
(StD) |
18.7 |
18.8 |
22.0 |
20.6 |
43.5 |
(StD) |
17.9 |
31.4 |
34.7 |
22.8 |
48.0 |
(StD) |
21.9 |
20.4 |
19.4 |
15.8 |
40.4 |
aʣa |
163.4 |
89.9 |
78.6 |
139.7 |
471.7 |
iʣi |
148.4 |
85.9 |
90.9 |
148.1 |
473.4 |
uʣu |
150.8 |
81.6 |
80.9 |
139.7 |
453.0 |
(StD) |
24.7 |
13.5 |
19.3 |
18.9 |
42.9 |
(StD) |
37.5 |
16.5 |
21.6 |
20.7 |
35.7 |
(StD) |
23.7 |
18.8 |
18.1 |
23.8 |
44.8 |
adʣa |
127.8 |
139.8 |
102.3 |
136.3 |
506.2 |
idʣi |
104.7 |
136.5 |
120.2 |
139.7 |
501.1 |
udʣu |
117.7 |
116.8 |
112.3 |
136.4 |
483.1 |
(StD) |
24.5 |
35.3 |
23.0 |
29.0 |
57.4 |
(StD) |
23.9 |
36.4 |
38.1 |
19.0 |
53.0 |
(StD) |
17.1 |
26.9 |
29.4 |
20.0 |
43.1 |
TABLE II Average values (and standard deviations) of V1d, C1d, C2d, V2d and Utd,
over all repetitions and speakers. All values are in msecs. |
|
|
A |
I |
U |
||||||||||||
|
|
V1d |
C1d |
C2d |
V2d |
Utd |
V1d |
C1d |
C2d |
V2d |
Utd |
V1d |
C1d |
C2d |
V2d |
Utd |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
ʧ |
F ratio |
34.89 |
53.90 |
53.90 |
0.78 |
10.83 |
34.69 |
49.22 |
24.66 |
0.88 |
31.05 |
35.75 |
43.56 |
5.62 |
0.78 |
9.82 |
p value |
0.0000 |
0.0000 |
0.0018 |
0.3842 |
0.0023 |
0.0000 |
0.0000 |
0.0000 |
0.3554 |
0.0000 |
0.0000 |
0.0000 |
0.0235 |
0.3845 |
0.0036 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
ʤ |
F ratio |
46.04 |
110.50 |
9.15 |
5.14 |
1.54 |
43.36 |
71.55 |
9.23 |
1.43 |
1.25 |
34.23 |
93.40 |
7.55 |
0.88 |
1.48 |
p value |
0.0000 |
0.0000 |
0.0047 |
0.0298 |
0.2231 |
0.0000 |
0.0000 |
0.0046 |
0.2394 |
0.2722 |
0.0000 |
0.0000 |
0.0096 |
0.3546 |
0.2328 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
ʦ |
F ratio |
4.73 |
19.22 |
15.15 |
1.06 |
14.91 |
2.69 |
11.29 |
3.80 |
3.86 |
14.75 |
10.96 |
8.32 |
29.77 |
0.00 |
5.36 |
p value |
0.0367 |
0.0001 |
0.0004 |
0.3094 |
0.0005 |
0.1102 |
0.0019 |
0.0595 |
0.0576 |
0.0005 |
0.0022 |
0.0068 |
0.0000 |
0.9679 |
0.0268 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
ʣ |
F ratio |
18.84 |
31.39 |
11.15 |
0.17 |
4.19 |
17.39 |
28.90 |
8.04 |
1.62 |
3.39 |
23.24 |
20.71 |
14.83 |
0.20 |
4.22 |
p value |
0.0001 |
0.0000 |
0.0020 |
0.6785 |
0.0485 |
0.0002 |
0.0000 |
0.0077 |
0.2120 |
0.0745 |
0.0000 |
0.0001 |
0.0005 |
0.6587 |
0.0477 |
|
TABLE
III
Results of the Anova test performed on V1d, C1d, C2d, V2d and Utd. F-values
are listed. The null hypothesis can be rejected at the p level of
significance indicated on the table. Bold characters indicate significantly
different values. |
A Spearman Rank correlation test was carried out
in order to verify if the relation between phonemes durations, in particular
those between V1d, C1d and C2d, was based on gemination. These were the
results:
·
when the two
singleton and geminate groups are put together we obtain these results for rs:
V1d vs C1d rs= - 0.471; V1d vs C2d rs = - 0.474
·
when the two
utterance groups are kept separately, then these are the value obtained for rs:
V1d vs C1d: rs = - 0.271 for singletons and rs not
significative for geminates; V1d vs C2d: rs = - 0.432 for singletons
and rs = - 0.321 for geminates.
Therefore, the correlation between V1d and C1d
can be attributed to the presence of gemination. The same can be said for the
correlation between V1d and C2d, although not with the same strength.
On the basis of the results achieved studying
time-domain quantities, it was decided to use the parameters V1d, C1d, C2d, Cd,
Cd/Utd, Cd/V1d, C1d/V1d and C2d/V1d to try an utterance classification based on
time parameters using the Maximum Likelihood Criterion (Dillon W.R. and
Goldstein M., 1984). All this, with
connected error percentages, can be seen in Table IV. Applying the MLC
criterion on all the utterances, we can find these error percentages: on Cd/Utd
we have 19.9%, on Cd/V1d 16.7%, on C1d/V1d 13.9%, on C2d/V1d 30.3%, on V1d
23.4%, on C1d 16.9%, on C2d 36.3%, on Cd 17.6%. The best result is obtained
using parameter C1d/V1d but, if we consider specific consonant classes, the result can be even much better (for example,
with parameter C1d/V1d there are no mistake for [ʤ] and with parameter Cd there is only a mistake
percentage of 4.63% for [ʧ]).
|
Cd/Utd |
Cd/V1d |
C1d/V1d |
C2d/V1d |
||||||||
|
EPP |
Errors |
Err. % |
EPP |
Errors |
Err. % |
EPP |
Errors |
Err. % |
EPP |
Errors |
Err. % |
Overall
|
0.43 |
86 |
19.91 |
1.6 |
72 |
16.67 |
0.76 |
60 |
13.89 |
0.78 |
131 |
30.32 |
Male |
0.42 |
40 |
18.52 |
1.68 |
30 |
13.89 |
0.69 |
21 |
9.72 |
0.78 |
66 |
30.56 |
Female |
0.43 |
45 |
20.83 |
1.55 |
40 |
18.52 |
0.85 |
36 |
16.67 |
0.78 |
65 |
30.09 |
[ʧ] |
0.44 |
15 |
13.89 |
1.6 |
9 |
8.33 |
0.65 |
9 |
8.33 |
0.87 |
20 |
18.52 |
[ʤ] |
0.37 |
6 |
5.56 |
1.26 |
3 |
2.78 |
0.91 |
0 |
0.00 |
0.36 |
17 |
15.74 |
[ʦ] |
0.49 |
31 |
28.70 |
1.9 |
26 |
24.07 |
0.77 |
29 |
26.85 |
1.41 |
28 |
25.93 |
[ʣ] |
0.42 |
15 |
13.89 |
1.45 |
14 |
12.96 |
0.76 |
13 |
12.04 |
0.75 |
18 |
16.67 |
[a] |
0.43 |
23 |
15.97 |
1.29 |
19 |
13.19 |
0.74 |
15 |
10.42 |
0.78 |
38 |
26.39 |
[i] |
0.48 |
32 |
22.22 |
1.75 |
25 |
17.36 |
0.92 |
22 |
15.28 |
1.13 |
41 |
28.47 |
[u] |
0.44 |
27 |
18.75 |
1.57 |
21 |
14.58 |
0.68 |
18 |
12.50 |
0.87 |
41 |
28.47 |
|
V1d |
C1d |
C2d |
Cd |
||||||||
|
EPP |
Errors |
Err. % |
EPP |
Errors |
Err. % |
EPP |
Errors |
Err. % |
EPP |
Errors |
Err. % |
Overall |
136 |
101 |
23.38 |
102 |
73 |
16.90 |
129 |
157 |
36.34 |
208 |
76 |
17.59 |
Male |
135 |
47 |
21.76 |
93 |
26 |
12.04 |
146 |
82 |
37.96 |
215 |
36 |
16.67 |
Female |
135 |
54 |
25.00 |
112 |
41 |
18.98 |
129 |
67 |
31.02 |
208 |
37 |
17.13 |
[ʧ] |
137 |
20 |
18.52 |
95 |
10 |
9.26 |
139 |
30 |
27.78 |
224 |
5 |
4.63 |
[ʤ] |
136 |
15 |
13.89 |
125 |
5 |
4.63 |
67 |
34 |
31.48 |
166 |
8 |
7.41 |
[ʦ] |
124 |
34 |
31.48 |
91 |
27 |
25.00 |
157 |
28 |
25.93 |
244 |
19 |
17.59 |
[ʣ] |
153 |
28 |
25.93 |
100 |
17 |
15.74 |
108 |
28 |
25.93 |
201 |
11 |
10.19 |
[a] |
145 |
32 |
22.22 |
104 |
17 |
11.81 |
107 |
53 |
36.81 |
198 |
23 |
15.97 |
[i] |
122 |
33 |
22.92 |
102 |
24 |
16.67 |
139 |
48 |
33.33 |
235 |
28 |
19.44 |
[u] |
135 |
27 |
18.75 |
94 |
24 |
16.67 |
129 |
54 |
37.50 |
207 |
24 |
16.67 |
TABLE IV Results of the
unidimensional Maximum Likelihood Criterion used for classifying singleton vs. geminate. |
2.2 Results in the frequency domain
In Table V there are the average frequency
parameters values and their standard deviations obtained by working out the
average of all utterances. All measured values and other interesting average
results can be found in (Faluschi, 2000). On the basis of a statistical
analysis about a multivariated ANOVA, there are not statistically meaningful
differences between taken measurements except for slight changes in the
fundamental frequency (F0) when it is measured in very specific frames. F0 is
about 14 Hz and 12 Hz higher in the geminate form (+9% and +8%) in the V1
offset and V1 to C transition frames respectively. No significant variations
were observed in F1, F2 and F3. Formant amplitudes A1, A2 and A3 are
significantly higher in their geminate form (1-3 dB) in V1 center, V1 offset,
V1 to C transition and V2 onset frames. Performing a Maximum Likelihood Criterion
based on the statistically significative frequency parameters, we are led to
about 64% of correct classifications, an average too low.
2.3 Results in the energy domain
Table VI reports the data about the energy-based
parameters. An Anova mulivariate test was performed to determine if
statistically significative differences between averages exist. Bold characters
indicate significantly different values. A general tendency is to pronounce geminate
words with more emphasis than singleton ones. This is confirmed also by the
previous analysis were there is a higher amplitude of formants in geminate
utterances. Performing a Maximum Likelihood Criterion based on the energy
parameters, leads to a poor average 61.7% of correct classification. In this
case too, the average percentage of mistakes is too high.
|
V1 CENTER |
|||||||||||||||||
|
F0 |
A0 |
F1 |
A1 |
F2 |
A2 |
F3 |
A3 |
||||||||||
Singleton |
160 |
13 |
550 |
34 |
1581 |
32 |
2895 |
26 |
||||||||||
(StD) |
44 |
6 |
310 |
9 |
777 |
8 |
466 |
11 |
||||||||||
Geminate |
169 |
13 |
552 |
36 |
1588 |
34 |
2870 |
26 |
||||||||||
(StD) |
43 |
7 |
303 |
8 |
769 |
8 |
548 |
11 |
||||||||||
|
V1 OFFSET |
|||||||||||||||||
|
F0 |
A0 |
F1 |
A1 |
F2 |
A2 |
F3 |
A3 |
||||||||||
Singleton |
151 |
13 |
487 |
29 |
1684 |
27 |
2877 |
23 |
||||||||||
(StD) |
44 |
5 |
242 |
8 |
679 |
9 |
469 |
9 |
||||||||||
Geminate |
165 |
14 |
506 |
32 |
1694 |
30 |
2792 |
25 |
||||||||||
(StD) |
43 |
7 |
256 |
7 |
683 |
8 |
656 |
10 |
||||||||||
|
V1 TO C TRANSITION |
|||||||||||||||||
|
F0 |
A0 |
F1 |
A1 |
F2 |
A2 |
F3 |
A3 |
||||||||||
Singleton |
148 |
12 |
448 |
26 |
1716 |
22 |
2864 |
21 |
||||||||||
(StD) |
43 |
5 |
201 |
8 |
638 |
8 |
469 |
7 |
||||||||||
Geminate |
160 |
13 |
466 |
27 |
1728 |
25 |
2778 |
22 |
||||||||||
(StD) |
44 |
5 |
220 |
7 |
651 |
8 |
648 |
9 |
||||||||||
|
C1 ONSET / C1 CENTER / C2 CENTER / C2
OFFSET |
|||||||||||||||||
|
F0 |
A0 |
F0 |
A0 |
F0 |
A0 |
F0 |
A0 |
||||||||||
Singleton |
139 |
11 |
126 |
9 |
112 |
9 |
120 |
10 |
||||||||||
(StD) |
36 |
5 |
32 |
5 |
23 |
4 |
27 |
5 |
||||||||||
Geminate |
70 |
5 |
62 |
4 |
40 |
3 |
48 |
4 |
||||||||||
(StD) |
78 |
7 |
69 |
6 |
58 |
6 |
64 |
7 |
||||||||||
|
V2 ONSET |
|||||||||||||||||
|
F0 |
A0 |
F1 |
A1 |
F2 |
A2 |
F3 |
A3 |
||||||||||
Singleton |
134 |
10 |
413 |
28 |
1724 |
25 |
2758 |
22 |
||||||||||
(StD) |
32 |
5 |
167 |
6 |
508 |
7 |
409 |
7 |
||||||||||
Geminate |
138 |
11 |
415 |
29 |
1743 |
27 |
2689 |
23 |
||||||||||
(StD) |
33 |
5 |
162 |
7 |
488 |
7 |
614 |
8 |
||||||||||
|
V2 CENTER |
|||||||||||||||||
|
F0 |
A0 |
F1 |
A1 |
F2 |
A2 |
F3 |
A3 |
||||||||||
Singleton |
130 |
10 |
476 |
30 |
1617 |
27 |
2798 |
21 |
||||||||||
(StD) |
31 |
5 |
258 |
6 |
653 |
7 |
421 |
8 |
||||||||||
Geminate |
133 |
10 |
480 |
31 |
1626 |
28 |
2724 |
21 |
||||||||||
(StD) |
32 |
5 |
250 |
7 |
644 |
7 |
627 |
9 |
||||||||||
TABLE V Frequency-based parameters. Mean values and Standard Deviation with respect of all the repetitions, speakers, vowels and consonants. Frequencies are in Hz, amplitudes in dB. |
||||||||||||||||||
|
|
EtotV1 |
PmV1 |
EtotC1 |
PmC1 |
EtotC2 |
PmC2 |
EtotC |
PmC |
|
||||||||
|
Singleton |
93.7 |
62.1 |
75.9 |
47.1 |
78.4 |
49.1 |
81.4 |
59.1 |
|
||||||||
|
(StD) |
6.0 |
5.5 |
5.9 |
6.0 |
4.8 |
5.3 |
4.1 |
4.5 |
|
||||||||
|
Geminate |
94.1 |
63.7 |
76.9 |
45.8 |
80.9 |
50.5 |
83.4 |
59.4 |
|
||||||||
|
(StD) |
5.7 |
5.2 |
6.0 |
5.9 |
5.1 |
5.5 |
4.5 |
4.7 |
|
||||||||
|
|
|
EiV1cent |
EiV1-C1 |
EiC1cent |
EiC1-C2 |
EiC2cent |
EiC2offs |
|
|
||||||||
|
Singleton |
|
86.8 |
79.4 |
67.3 |
68.6 |
73.1 |
72.5 |
|
|
||||||||
|
(StD) |
|
5.9 |
5.1 |
9.0 |
6.6 |
6.2 |
5.1 |
|
|
||||||||
|
Geminate |
|
88.8 |
80.9 |
63.2 |
68.2 |
74.3 |
74.5 |
|
|
||||||||
|
(StD) |
|
5.3 |
4.7 |
9.5 |
6.2 |
6.6 |
5.5 |
|
|
||||||||
|
TABLE VI Energy-based parameters. Mean
values and Standard Deviation with respect of all the repetions, speakers,
vowels and consonants. All values are in dB. Bold characters indicate
significantly different values. |
|
||||||||||||||||
First of all, we are obliged to
introduce a foreword. Affricate consonants have peculiar and very distinctive
characteristics that made necessary a split of the consonant itself into two
parts: the first is named C1 and it indicates the stop phase while the second,
named C2, indicates the fricative phase. This distinction had not been
necessary in the previous studies about other Italian consonants and this is why
it will not always be possible to compare the results of this study with those
of the previous ones.
In time domain there is evidence for a strong
connection between gemination and phonemes duration. In particular, as it was
for previous studies about gemination of Italian consonants, it was evident,
moving from singleton to geminate utterances, an increase of consonant duration
(both in the C1 stop phase and C2 fricative phase) and a decrease of
pre-consonant vowel duration, while there are not significant variations in
second vowel (V2) duration. Even the total duration of the utterance (Utd)
moves up in geminate form, although not in such a significant way such as in
V1, C1 and C2. This leads to suppose that a compensation effect (even if
incomplete) takes place between V1 and C durations. This compensation effect is
the element that tends not to make the phonetic rhythm change too much.
In the frequency domain, F0
increases passing from singleton to geminate only in two frames, an average
quantity of about 13 Hz that is perceptively relevant. To vary are also the
amplitudes of the three measured formants, A1, A2 and A3 in every analysis
frame, except for V2 center. These three are higher of 2 dB average in geminate
utterances compared to singleton ones.
As far as the energy and power of pronunciations are concerned, we see the tendency to give more emphasis to geminates instead of singletons.
For affricates it was found that the average
difference between singles and geminates in terms of V1d is 38 ms (»-25% for
geminates), in C1d is 51 ms (»+62%for geminates), in C2d is 27 ms (»+28% for geminates)
while considering the whole consonant Cd=C1d+C2d the difference is 78ms (»+44% for
geminates). For fricatives (Giovanardi M., 1998) the average V1d difference was 49 ms (»-28% for geminates)
and in Cd 98 ms (»+73% for geminates). As regards stops (Esposito A., Di
Benedetto M.G., 1999), V1d difference was 43 ms (»-26% for geminates) and 92 ms for the stop
closure duration (+101% for geminates). Finally, for nasals (Mattei M., Di
Benedetto M.G., 1999) the average V1d difference was 59 ms (»-32% for geminates)
and in Cd 121 ms (»+134% for geminates).
The straightforward comparisons that can be done
with previous studies are between the C1 occlusive phase of the affricate
consonants and the stop consonants (Esposito A., Di Benedetto M.G., 1999) and
also between the C2 fricative phase of the affricate consonants and the
fricative ones (Giovanardi M., 1998).
There is evidence to assert that differences
between singleton and geminate durations are less sharp and clear than in other
Italian consonants.
The singleton/geminate classification based on
the Maximum Likelihood Criterion using the Cd and Cd/V1d parameters leads to
the following results, belonging to previous studies:
1) 0.47% of errors in nasals for both parameters
2) 12 % of errors in fricatives for both
parameters
3) 4% and 8% of errors in stops for Cd and
Cd/V1d, respectively
In Table IV, it is possible to
see that mistake percentages for affricates are worse compared to those
obtained for other classes of consonants.
Finally, an additional similarity
between nasals, fricatives, and stops was found; the Spearman Rank correlation
coefficient between Cd and V1d was for all these consonants equal to values from
-0.71 to -0.78, while for affricates this value is lower.
As a general conclusion, the most relevant
outcomes of the present work can be summarized as follows.
The general tendency of shortening the
pre-consonant vowel and of lengthening the consonant in geminate utterance,
observed on stops, fricatives and nasals in previous studies, is confirmed for
affricates.
In previous studies, significant variations in
the energies of utterances have not been observed, except for the total energy
of the consonant in nasals that is stronger in the geminates and it is due, as
far as the author is concerned, to a longer duration of the consonant while the
tendency for the affricates is to emphasise the geminate utterance (even if
differences are limited to very few dB).
In
conclusion, we want to point out that there are studies in progress concerning
synthetised utterances of affricate consonants obtained using the articulatory
synthetiser HLsyn. Moreover, the project will pilot an experiment of perceptive
analysis in order to support the idea that, as there is evidence in this study,
the duration of phonemes is the most important factor to distinguish a
singleton utterance from a geminate one. In order to illustrate this theory we
insert below two link-buttons: the first connects to the singleton synthetised
utterance of the [ʧ]
articulated with the [a] vowel, while the second connects to the corresponding
geminate synthetised utterance.
References
Abramson A. S. (1999)
"Fundamental frequency as a cue to word-initial consonant length: Pattani
Malay" ICPhS99 San Francisco pp 591-594.
Argiolas F., Macrì F., Di Benedetto M.G. "Acoustic analysis of Italian [r] and [l]," Journal of the Acoustical Society of America
97, no. 5, pt.2, pp.3418, 1995.
Arvaniti A. (1999)
"Effects of speaking rate on the timing of single and geminate
sonorants" ICPhS99 San Francisco pp 599-602.
Blumstein S.E., Pickett
E., Burton M. (1998). “Effects
of speaking rate on Singleton/Geminate consonant contrast in Italian”,
unpublished manuscript.
Canepari L. (1979)
"Introduzione alla fonetica" Piccola Biblioteca Einaudi.
Cohn A.C., Ham
W.H., Podesva
R.
J.(1999)
"The phonetic realization of singleton-geminate contrasts in three
languages of Indonesia" ICPhS99 San Francisco pp 587-590.
Esposito A., Di Benedetto M.G (1999),
"Acoustic and Perceptual Study of Gemination in Italian Stops",
Journal of the Acoustical Society of America.
Faluschi S. (2000) "Analisi
Acustica e Sintesi delle consonanti affricate singole e geminate in
Italiano" Degree Thesis in Electronics Engineering, Rome University
"La Sapienza".
Giovanardi M. (1998) "Analisi
Acustica e Sintesi delle consonanti fricative singole e geminate in
Italiano" Degree Thesis in Electronics Engineering, Rome University
"La Sapienza".
Giovanardi M. and Di Benedetto M.G.
(1998)." Acoustic analysis of singleton and geminate fricatives in Italian" European student journal of language
and speech.
Local J.and Simpson A.P.
(1999) "Phonetic implementation of geminates in Malayalam nouns" ICPhS99 San
Francisco pp 592-595.
Louali N.and Maddieson
I. (1999) "Phonological contrast and phonetic realization: the case
of Berber stops" ICPhS99 San Francisco pp 603-606.
Mattei M. and
Di Benedetto M.G. (2000) "Acoustic analysis of singleton and geminate
nasals in Italian" European
student journal of language and speech.
Mattei M. (1999) "Analisi
Acustica delle consonanti nasali singole e geminate in Italiano" Degree
Thesis in Electronics Engineering, Rome University "La Sapienza".
Muljacic Z. (1972). "Fonologia
della lingua Italiana", (Il Mulino, Bologna).
Rochet,L.B., and
Rochet,A.P. (1995) “The perception of the single-geminate consonant contrast by
native speakers of Italian and Anglophones” in proceedings of ICPhS95, edited
by K.Elenius and P.Branderud,Vol 3 (Arne Strombergs, Stockholm ) pp. 616-619.
Rossetti R. (1994)
"Gemination of Italian stops", Journal of the Acoustical Society of
America, 95, 2pSP25, pp.2874.
Vecsys (1989). The Unice User Manual
(Vecsys - Chemin du Chene rond - 91570 Bièvres, France).
Shrotriya N., Siva Sarma
A.S., Verma R., Agrawal S.S. (1995). “Acoustic and perceptual characteristics
of geminate Hindi stop consonants”, in Proceedings of ICPhS95, edited by
K.Elenius and P.Branderud, 4, (Arne Strombergs Grafiska Stockolm), pp.132-135.
Vecsys (1989). The Unice User Manual
(Vecsys - Chemin du Chene rond - 91570 Bièvres, France).