Publications‎ > ‎

Journal Articles



Shaw, J., Carignan, C., Agostini, T., Mailhammer, R., Harvey, M., & Derrick, D. (submitted). "Lenition and Timing: the case of Iwaidja."

Carignan, C. (under review). "A network-modeling approach to investigating individual differences in articulatory-to-acoustic relationship strategies."

Carignan, C. (in press). "Using naïve listener imitations of native speaker productions to investigate mechanisms of listener-based sound change." Laboratory Phonology.
Abstract:
This study was designed to test whether listener-based sound change—listener misperception (Ohala, 1981, 1993) and perceptual cue re-weighting (Beddor, 2009, 2012)—can be observed synchronically in a laboratory setting. Co-registered articulatory data (degree of nasalization, tongue height, breathiness) and acoustic data (F1 frequency) related to the productions of phonemic oral and nasal vowels of Southern French were first collected from four native speakers, and the acoustic recordings were subsequently presented to nine Australian English naïve listeners, who were instructed to imitate the native productions. During these imitations, similar articulatory and acoustic data were collected in order to compare the articulatory strategies used by the two groups. The results suggest that the imitators successfully reproduced the acoustic distinctions made by the native speakers, but that they did so using different articulatory strategies. The articulatory strategies for the vowel pair /ɑ̃/-/a/ suggest that listeners (at least partially) misperceived F1-lowering due to nasalization and breathiness as being due to tongue height. Additional evidence supports perceptual cue re-weighting, in that the naïve imitators used nasalance less, and tongue height more, in order to obtain the same F1 nasal-oral distinctions that the native speakers had originally produced.

Derrick, D., Carignan, C., Chen, W.-R., Shujau, M., & Best, C. (2018). "Three-dimensional printable ultrasound transducer stabilization system." Journal of the Acoustic Society of America, 144(5), EL392–EL398, DOI: 10.1121/1.5066350.
Publisher link
Abstract:
When using ultrasound imaging of the tongue for speech recording/research, submental transducer stabilization is required to prevent the ultrasound transducer from translating or rotating in relation to the tongue. An iterative prototype of a lightweight three-dimensional-printable wearable ultrasound transducer stabilization system that allows flexible jaw motion and free head movement is presented. The system is completely non-metallic, eliminating interference with co- recorded signals, thus permitting co-collection and co-registration with articulometry systems. A motion study of the final version demonstrates that transducer rotation is limited to 1.25° and translation to 2.5 mm—well within accepted tolerances.

Carignan, C. (2018). "Using ultrasound and nasalance to separate oral and nasal contributions to formant frequencies of nasalized vowels." Journal of the Acoustical Society of America, 143(5), 2588–2601, DOI: 10.1121/1.5034760.
Publisher link
Abstract:
The experimental method described in this manuscript offers a possible means to address a well known issue in research on the independent effects of nasalization on vowel acoustics: given that the separate transfer functions associated with the oral and nasal cavities are merged in the acoustic signal, the task of teasing apart the respective effects of the two cavities seems to be an intractable problem. The proposed method uses ultrasound and nasalance to predict the effect of lingual configuration on formant frequencies of nasalized vowels, thus accounting for acoustic variation due to changing lingual posture and excluding its contribution to the acoustic signal. The results reveal that the independent effect of nasalization on the acoustic vowel quadrilateral resembles a counter-clockwise chain shift of nasal compared to non-nasal vowels. The results from the productions of 11 vowels by six speakers of different language backgrounds are compared to predictions presented in previous modeling studies, as well as discussed in the light of sound change of nasal vowel systems.

Blackwood Ximenes, A., Shaw, J., & Carignan, C. (2017). "A comparison of acoustic and articulatory methods for analyzing vowel variation across American and Australian dialects of English." Journal of the Acoustical Society of America, 142(1), 363–377, DOI: 10.1121/1.4991346.
Abstract:
In studies of dialect variation, the articulatory nature of vowels is sometimes inferred from formant values using the following heuristic: F1 is inversely correlated with tongue height and F2 is inversely correlated with tongue backness. This study compared vowel formants and corresponding lingual articulation in two dialects of English, standard North American English, and Australian English. Five speakers of North American English and four speakers of Australian English were recorded producing multiple repetitions of ten monophthongs embedded in the /sVd/ context. Simultaneous articulatory data were collected using electromagnetic articulography. Results show that there are significant correlations between tongue position and formants in the direction predicted by the heuristic but also that the relations implied by the heuristic break down under specific conditions. Articulatory vowel spaces, based on tongue dorsum position, and acoustic vowel spaces, based on formants, show systematic misalignment due in part to the influence of other articulatory factors, including lip rounding and tongue curvature on formant values. Incorporating these dimensions into dialect comparison yields a richer description and a more robust understanding of how vowel formant patterns are reproduced within and across dialects.

Carignan, C. (2017). "Covariation of nasalization, tongue height, and breathiness in the realization of F1 of Southern French nasal vowels." Journal of Phonetics, 63, 87–105, DOI: 10.1016/j.wocn.2017.04.005.
Abstract:
In a variety of languages, changes in tongue height and breathiness have been observed to covary with nasalization in both phonetic and phonemic vowel nasality. It has been argued that this covariation stems from speakers using multiple articulations to enhance F1 modulation and/or from listeners misperceiving the articulatory basis for F1 modification. This study includes results from synchronous nasalance, ultrasound, EGG, and F1 data related to the realizations of the oral–nasal vowel pairs /ɛ/-/ɛ̃/, /a/-/ɑ̃/, and /o/-/ɔ̃/ of Southern French (SF) as produced by four male speakers in a laboratory setting. The aim of the study is to determine to what extent tongue height and breathiness covary with nasalization, as well as how these articulations affect the realization of F1. The following evidence is observed: (1) that nasalization, breathiness, and tongue height are used in idiosyncratic ways to distinguish F1 for each vowel pair; (2) that increased nasalization and breathiness significantly predict F1-lowering for all three nasal vowels; (3) that nasalization increases throughout the duration of the nasal vowels, supporting previous claims about the temporal nature of nasality in SF nasal vowels, but contradicting claims that SF nasal vowels comprise distinct oral and nasal elements; (4) that breathiness increases in a gradient manner as nasalization increases; and (5) that the acoustic and articulatory data provide limited support for claims of the existence of an excrescent nasal coda in SF nasal vowels. These results are discussed in the light of claims that the multiple articulatory components observed in the production of vowel nasalization may have arisen due to misperception-based sound change and/or to phonetic enhancement.

Kalashnikova, M., Carignan, C., & Burnham, D. (2017). "The origins of babytalk: Smiling, teaching, or social convergence?" Royal Society Open Science, 4(8), DOI: 10.1098/rsos.170306.
Abstract:
When addressing their young infants, parents systematically modify their speech. Such infant-directed speech (IDS) contains exaggerated vowel formants, which have been proposed to foster language development via articulation of more distinct speech sounds. Here, this assumption is rigorously tested using both acoustic and, for the first time, fine-grained articulatory measures. Mothers were recorded speaking to their infant and to another adult, and measures were taken of their acoustic vowel space, their tongue and lip movements and the length of their vocal tract. Results showed that infant- but not adult-directed speech contains acoustically exaggerated vowels, and these are not the product of adjustments to tongue or to lip movements. Rather, they are the product of a shortened vocal tract due to a raised larynx, which can be ascribed to speakers' unconscious effort to appear smaller and more non-threatening to the young infant. This adjustment in IDS may be a vestige of early mother–infant interactions, which had as its primary purpose the transmission of non-aggressiveness and/or a primitive manifestation of pre-linguistic vocal social convergence of the mother to her infant. With the advent of human language, this vestige then acquired a secondary purpose—facilitating language acquisition via the serendipitously exaggerated vowels.

Mielke, J., Carignan, C., & Thomas, E. R. (2017). "The articulatory dynamics of pre-velar and pre-nasal /æ/-raising in English: an ultrasound study." Journal of the Acoustical Society of America, 142(1), 332–349, DOI: 10.1121/1.4991348.
Abstract:
Most dialects of North American English exhibit /æ/-raising in some phonological contexts. Both the conditioning environments and the temporal dynamics of the raising vary from region to region. To explore the articulatory basis of /æ/-raising across North American English dialects, acoustic and articulatory data were collected from a regionally diverse group of 24 English speakers from the United States, Canada, and the United Kingdom. A method for examining the temporal dynamics of speech directly from ultrasound video using EigenTongues decomposition [Hueber, Aversano, Chollet, Denby, Dreyfus, Oussar, Roussel, and Stone (2007). in IEEE International Conference on Acoustics, Speech and Signal Processing (Cascadilla, Honolulu, HI)] was applied to extract principal components of filtered images and linear regression to relate articulatory variation to its acoustic consequences. This technique was used to investigate the tongue movements involved in /æ/ production, in order to compare the tongue gestures involved in the various /æ/-raising patterns, and to relate them to their apparent phonetic motivations (nasalization, voicing, and tongue position).

Carignan, C., Shosted, R., Fu, M., Liang, Z.-P., & Sutton, B. (2015). "A real-time MRI investigation of the role of lingual and pharyngeal articulation in the production of the nasal vowel system of French." Journal of Phonetics, 50, 34–51, DOI: 10.1016/j.wocn.2015.01.001.
Abstract:
It is well known that, for nasal vowels, traditional estimation of the shape of the vocal tract via inference from acoustic characteristics is complicated by the acoustic effects of velopharyngeal coupling (i.e. nasalization). Given this complexity, measuring the shape of the vocal tract directly is, perhaps, a more desirable method of assessing oro-pharyngeal configuration. Real-time MRI (rt-MRI) allows us to explore the shape of the entire vocal tract during the production of nasal vowels. This permits us to better assess the contribution of the oro-pharyngeal acoustic transfer function to the acoustic signal, which is otherwise obscured by the conflation of the independent oro-pharyngeal and nasal acoustic transfer functions. The oro-pharyngeal shape associated with nasal vowels has implications for both synchronic and diachronic phonology, particularly in French, where descriptions of nasal vowels have long suggested that differences in oral articulation, in addition to velopharyngeal coupling, serve to distinguish oral and nasal vowels. In this study, we use single-slice rt-MRI (midsagittal slice) and multi-slice rt-MRI (oral, velopharyngeal, mediopharyngeal, and lower pharyngeal slices) to examine three nasal vowels /ɛ̃, ɑ̃, ɔ̃/ and their traditional oral counterparts /ɛ, a, o/ as produced by three female speakers of Northern Metropolitan French (NMF). We find evidence of lingual and pharyngeal articulatory configurations which may, in some cases, enhance formant-frequency-related acoustic effects associated with nasalization, viz., modulation of F1 and F2. Given these findings, we speculate that the synchronic oral articulation of NMF nasal vowels may have arisen—at least in part—due to misperception of the articulatory source of changes in F1 and F2, rather than to mere chance, as has been argued.

Fu, M., Zhao, B., Carignan, C., Shosted, R., Perry, J., Kuehn, D., Liang, Z.-P., & Sutton, B. (2015). "High-resolution dynamic speech imaging with joint low-rank and sparsity constraints." Magnetic Resonance in Medicine, 74(5), 1820–1832, DOI: 10.1002/mrm.25302.
Abstract:
PURPOSE:
To enable dynamic speech imaging with high spatiotemporal resolution and full-vocal-tract spatial coverage, leveraging recent advances in sparse sampling.

METHODS:
An imaging method is developed to enable high-speed dynamic speech imaging exploiting low-rank and sparsity of the dynamic images of articulatory motion during speech. The proposed method includes: (a) a novel data acquisition strategy that collects spiral navigators with high temporal frame rate and (b) an image reconstruction method that derives temporal subspaces from navigators and reconstructs high-resolution images from sparsely sampled data with joint low-rank and sparsity constraints.

RESULTS:
The proposed method has been systematically evaluated and validated through several dynamic speech experiments. A nominal imaging speed of 102 frames per second (fps) was achieved for a single-slice imaging protocol with a spatial resolution of 2.2 × 2.2 × 6.5 mm(3) . An eight-slice imaging protocol covering the entire vocal tract achieved a nominal imaging speed of 12.8 fps with the identical spatial resolution. The effectiveness of the proposed method and its practical utility was also demonstrated in a phonetic investigation.

CONCLUSION:
High spatiotemporal resolution with full-vocal-tract spatial coverage can be achieved for dynamic speech imaging experiments with low-rank and sparsity constraints.

Carignan, C. (2014). "An acoustic and articulatory examination of the 'oral' in 'nasal': The oral articulations of French nasal vowels are not arbitrary." Journal of Phonetics, 46, 23–33, DOI: 10.1016/j.wocn.2014.05.001.
Abstract:
This study includes results of an articulatory (electromagnetic articulography, i.e. EMA) and acoustic study of the realizations of three oral–nasal vowel pairs  /ɛ/-/ɛ̃/, /a/-/ɑ̃/, and /o/-/ɔ̃/ recorded from 12 Northern Metropolitan French (NMF) female speakers in laboratory settings. By studying the position of the tongue and the lips during the production of target oral and nasal vowels and simultaneously recording the acoustic signal, the predicted effects of velo-pharyngeal (VP) coupling on the acoustic output of the vocal tract can be separated from those due to oral articulatory configuration in a qualitative manner. Based on the previous research, all nasal vowels were expected to be produced with at least some change in lingual and labial articulatory configurations compared to their oral vowel counterparts. Evidence is observed which suggests that many of the oral articulatory configurations of NMF nasal vowels enhance the acoustic effect of VP coupling on F1 and F2 frequencies. Moreover, evidence is observed that the oral articulatory strategies used to produce the oral/nasal vowel distinction are idiosyncratic, but that, nevertheless, speakers produce a similar acoustic output. These results are discussed in the light of motor equivalence as well as the view that the goal of speech acts is acoustic, not articulatory.

Shosted, R., Carignan, C., & Rong, P. (2012). "Managing the distinctiveness of phonemic nasal vowels: Articulatory evidence from Hindi." Journal of the Acoustical Society of America, 131(1), 455–465, DOI: 10.1121/1.3665998.
Abstract:
There is increasing evidence that fine articulatory adjustments are made by speakers to reinforce and sometimes counteract the acoustic consequences of nasality. However, it is difficult to attribute the acoustic changes in nasal vowel spectra to either oral cavity configuration or to velopharyngeal opening (VPO). This paper takes the position that it is possible to disambiguate the effects of VPO and oropharyngeal configuration on the acoustic output of the vocal tract by studying the position and movement of the tongue and lips during the production of oral and nasal vowels. This paper uses simultaneously collected articulatory, acoustic, and nasal airflow data during the production of all oral and phonemically nasal vowels in Hindi (four speakers) to understand the consequences of the movements of oral articulators on the spectra of nasal vowels. For Hindi nasal vowels, the tongue body is generally lowered for back vowels, fronted for low vowels, and raised for front vowels (with respect to their oral congeners). These movements are generally supported by accompanying changes in the vowel spectra. In Hindi, the lowering of back nasal vowels may have originally served to enhance the acoustic salience of nasality, but has since engendered a nasal vowel chain shift.

Carignan, C., Shosted, R., Shih, C., & Rong, P. (2011). "Compensatory articulation in American English nasalized vowels." Journal of Phonetics, 39, 668–682, DOI: 10.1016/j.wocn.2011.07.005.
Abstract:
In acoustic studies of vowel nasalization, it is sometimes assumed that the primary articulatory difference between an oral vowel and a nasal vowel is the coupling of the nasal cavity to the rest of the vocal tract. Acoustic modulations observed in nasal vowels are customarily attributed to the presence of additional poles affiliated with the naso-pharyngeal tract and zeros affiliated with the nasal cavity. We test the hypothesis that oral configuration may also change during nasalized vowels, either enhancing or compensating for the acoustic modulations associated with nasality. We analyze tongue position, nasal airflow, and acoustic data to determine whether American English /i/ and /a/ manifest different oral configurations when they are nasalized, i.e. when they are followed by nasal consonants. We find that tongue position is higher during nasalized [ĩ] than it is during oral [i] but do not find any effect for nasalized [ã]. We argue that speakers of American English raise the tongue body during nasalized [ĩ] in order to counteract the perceived F1-raising (centralization) associated with high vowel nasalization.