<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<!--<?xml-stylesheet type="text/xsl" href="article.xsl"?>-->
<article article-type="research-article" dtd-version="1.2" xml:lang="en" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id journal-id-type="issn">1868-6354</journal-id>
<journal-title-group>
<journal-title>Laboratory Phonology: Journal of the Association for Laboratory Phonology</journal-title>
</journal-title-group>
<issn pub-type="epub">1868-6354</issn>
<publisher>
<publisher-name>Ubiquity Press</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5334/labphon.6461</article-id>
<article-categories>
<subj-group>
<subject>Journal article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Production and perception across three Hong Kong Cantonese consonant mergers: Community- and individual-level perspectives</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Cheng</surname>
<given-names>Lauretta S. P.</given-names>
</name>
<email>lspcheng@umich.edu</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Babel</surname>
<given-names>Molly</given-names>
</name>
<email>molly.babel@ubc.ca</email>
<xref ref-type="aff" rid="aff-2">2</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Yao</surname>
<given-names>Yao</given-names>
</name>
<email>y.yao@polyu.edu.hk</email>
<xref ref-type="aff" rid="aff-3">3</xref>
</contrib>
</contrib-group>
<aff id="aff-1"><label>1</label>Department of Linguistics, University of Michigan, Ann Arbor, Michigan, United States of America</aff>
<aff id="aff-2"><label>2</label>Department of Linguistics, University of British Columbia, Vancouver, British Columbia, Canada</aff>
<aff id="aff-3"><label>3</label>Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University, HKSAR, China</aff>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2022-06-23">
<day>23</day>
<month>06</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<issue>1</issue>
<elocation-id>14</elocation-id>
<permissions>
<copyright-statement>Copyright: &#x00A9; 2022 The Author(s)</copyright-statement>
<copyright-year>2022</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See <uri xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
</license>
</permissions>
<self-uri xlink:href="http://www.journal-labphon.org/articles/10.5334/labphon.6461/"/>
<abstract>
<p>Individual variation is key to understanding phenomena in phonetic variation and change, including the production-perception link. To test the generalizability of this relationship, this study compares community- and individual-level variation across three long-standing consonant mergers in Hong Kong Cantonese speakers: [n]&#8594;[l], [&#331;&#809;]&#8594;[m&#809;], and [&#331;]&#8596;&#216;. Concurrently, we document these understudied mergers in a community that has undergone rapid social change in recent decades. Younger (college-aged) and older (middle-aged) Hong Kongers completed a reading production task followed by a forced-choice lexical identification perception task. Group-level results suggest mismatching production and perception: While the community overall distinguished merger pairs in production, younger listeners are more perceptually categorical than older listeners. However, aggregate results obscure the fact that individuals vary substantially in the extent of merging in both perception and production, including many who exhibit complete merger, and that individual-level production-perception correlations were found for [n]&#8594;[l] and [&#331;&#809;]&#8594;[m&#809;], though not [&#331;]&#8596;&#216;. Results are discussed in the context of previous research. We find that (i) these mergers have diverged from predicted trajectories of completion, and (ii) overall, prior findings on the production-perception link are generalizable to these consonant mergers.</p>
</abstract>
</article-meta>
</front>
<body>
<sec>
<title>1. Introduction</title>
<p>While variability within speech communities has long been acknowledged, phonetic variation and change has historically been studied at the level of the community or macro-social demographic groups. Recently, a more concentrated focus has been placed on the characteristics of individual speaker-listeners and their role in innovating and driving change (for overviews, see <xref ref-type="bibr" rid="B16">Coetzee, 2018</xref>; <xref ref-type="bibr" rid="B82">Yu &amp; Zellou, 2019</xref>), including both social (e.g., <xref ref-type="bibr" rid="B66">Sharma &amp; Sankaran, 2011</xref>; <xref ref-type="bibr" rid="B52">Lev-Ari, 2017</xref>; <xref ref-type="bibr" rid="B25">Garrett &amp; Johnson, 2013</xref>) and cognitive (e.g., <xref ref-type="bibr" rid="B80">Yu, 2010</xref>; <xref ref-type="bibr" rid="B81">Yu, Abrego-Collier, &amp; Sonderegger, 2013</xref>; <xref ref-type="bibr" rid="B18">Dimov, Katseff, &amp; Johnson, 2012</xref>; <xref ref-type="bibr" rid="B39">Kong &amp; Edwards, 2016</xref>) dimensions. As Yu and Zellou (<xref ref-type="bibr" rid="B82">2019</xref>) state, &#8220;the study of individual differences can shed light on the nature of the cognitive representations and mechanisms involved in phonological processing,&#8221; which in turn can &#8220;provide insight into long-standing issues in linguistic variation and change&#8221; (p. 131). In other words, bringing together community-level (macro) patterns with individual-level (micro) considerations helps build a fuller, more accurate picture of how speech is produced, perceived, and processed&#8212;feeding into our collective understanding of how sound systems vary and change over time (see also <xref ref-type="bibr" rid="B78">Yao &amp; Chang, 2016</xref>; <xref ref-type="bibr" rid="B23">Fridland &amp; Kendall, 2012</xref>; <xref ref-type="bibr" rid="B72">Voeten, 2020</xref>).</p>
<p>The example of focus here is how perception and production systems relate in the context of sound change. As this relationship holds implications for mechanisms behind the initiation and propagation of sound change, a more complete picture is crucial for understanding both the spread of sound change from individuals to entire communities and the progression of community-level change over time. Though many have probed this question, consistent results have been hard to come by both in regards to whether production-perception systems are linked in individuals (e.g., <xref ref-type="bibr" rid="B65">Schultz, Francis, &amp; Llanos, 2012</xref>; <xref ref-type="bibr" rid="B63">Schertz, Cho, Lotto, &amp; Warner, 2015</xref>; <xref ref-type="bibr" rid="B8">Beddor, Coetzee, Styler, McGowan, &amp; Boland, 2018</xref>) and which domain precedes the other during a change-in-progress (e.g., <xref ref-type="bibr" rid="B30">Harrington, Kleber, &amp; Reubold, 2008</xref>; <xref ref-type="bibr" rid="B41">Kuang &amp; Cui, 2018</xref>; <xref ref-type="bibr" rid="B60">Pinget, Kager, &amp; Van de Velde, 2020</xref>). Insofar as the nature of the production-perception relationship has yet to be fully uncovered, attempts to answer this question strongly benefit from an approach comparing patterns at the group and individual levels.</p>
<p>In line with this perspective, the present study examines&#8212;at community and individual levels&#8212;production, perception, and production-perception alignment in an understudied case of phonetic variation and change. We conduct an apparent-time investigation of three consonant mergers in Hong Kong Cantonese with distinct profiles and trajectories, namely [n]&#8594;[l], [&#331;&#809;]&#8594;[m&#809;], and [&#331;]&#8596;&#216;. While documenting the recent state of these mergers, we test hypotheses of the production-perception relationship, seeking to extend previous findings to a novel set of sound changes of a type&#8212;consonant mergers&#8212;that has thus far lacked study (with the notable exception of <xref ref-type="bibr" rid="B60">Pinget et al., 2020</xref>). In our focus on these particular consonant mergers in Hong Kong Cantonese, we find ourselves in a complex situation involving long-standing changes-in-progress, style-shifting, and sociolinguistic ideologies. Although previous research has largely presented the variants as a subset of mergers-in-progress within a larger set of merging changes (e.g., <xref ref-type="bibr" rid="B4">Bauer, 1986</xref>; <xref ref-type="bibr" rid="B83">Zee, 1999a</xref>; <xref ref-type="bibr" rid="B71">To, Mcleod, &amp; Cheung, 2015</xref>), scholarship also indicates the presence of formality-based stylistic variation (e.g., <xref ref-type="bibr" rid="B2">Bauer, 1982a</xref>; <xref ref-type="bibr" rid="B11">Bourgerie, 1990</xref>) and stigmatization (e.g., <xref ref-type="bibr" rid="B14">Chen, 2018</xref>). Nevertheless, regardless of whether these mergers have stabilized, completed, or continue to change, they remain situated in the context of diachronic change. Accordingly, we primarily approach these mergers as historical sound change and compare them to the sound change literature while also discussing the socially-structured variation and ideology in individual experiences.</p>
<sec>
<title>1.1. Production and Perception: Is there a link?</title>
<p>Theories of sound change have long assumed that an individual&#8217;s perception and production systems must be connected in some way for an individual to perceive a change within their speech community and implement it in their own production repertoire. This is a necessary condition if we posit that changes propagate from individual to individual, though the extent to which this is true (or, under which circumstances) had not been directly explored until recently (see <xref ref-type="bibr" rid="B7">Beddor, 2015</xref>). In establishing groundwork for theories about phonetic and sound change, Beddor et al. (<xref ref-type="bibr" rid="B8">2018</xref>) examine the time course of speaker-listeners&#8217; coarticulatory vowel nasalization in production (from aerodynamic measures) and perceptual patterns (with an eye-tracking paradigm) in American English. They demonstrate within-individual correlations across production and perception for this stable variation not only in the <italic>use</italic> of a cue but in the <italic>time-course of use</italic>: Those who produce early nasal flow in a vowel use temporally early nasalization to identify words.</p>
<p>Although there is a strong theoretical appetite for perceptual and productive repertoires to map neatly onto one another, only some investigations have found individual-level evidence for this connection (e.g., <xref ref-type="bibr" rid="B85">Zellou, 2017</xref>; <xref ref-type="bibr" rid="B17">Coetzee, Beddor, Shedden, Styler, &amp; Wissing, 2018</xref>; <xref ref-type="bibr" rid="B8">Beddor et al., 2018</xref>; <xref ref-type="bibr" rid="B23">Fridland &amp; Kendall, 2012</xref>; <xref ref-type="bibr" rid="B37">Kim &amp; Clayards, 2019</xref>; <xref ref-type="bibr" rid="B60">Pinget et al., 2020</xref>; <xref ref-type="bibr" rid="B72">Voeten, 2020</xref>) while others have not (e.g., <xref ref-type="bibr" rid="B35">Kataoka, 2011</xref>; <xref ref-type="bibr" rid="B26">Grosvald &amp; Corina, 2012</xref>; <xref ref-type="bibr" rid="B63">Schertz et al., 2015</xref>; <xref ref-type="bibr" rid="B65">Schultz et al., 2012</xref>; <xref ref-type="bibr" rid="B42">Kwon, 2015</xref>). Thus, the field has yet to come to a satisfying answer regarding the relevant conditions for matching or mismatching production-perception repertoires (see <xref ref-type="bibr" rid="B82">Yu &amp; Zellou, 2019</xref>). Here, we briefly review some possible explanations for these cross-study discrepancies.</p>
<p>One common suggestion put forth to account for inconsistent findings is that the experimental tasks and measures used to assess a link between production and perception were not necessarily comparable, such that production and perception tasks may have in fact been measuring different constructs (see <xref ref-type="bibr" rid="B85">Zellou, 2017</xref>; <xref ref-type="bibr" rid="B37">Kim &amp; Clayards, 2019</xref>; <xref ref-type="bibr" rid="B26">Grosvald &amp; Corina, 2012</xref>). A related concern is that previous measures may not have been sensitive enough to capture similarities across domains (e.g., static versus dynamic measures, <xref ref-type="bibr" rid="B8">Beddor et al., 2018</xref>). These concerns point to a need for careful selection of tasks and variables for comparison. They also link to methodological concerns about validity and reliability in terms of how phonetic variables are assessed (e.g., acoustic measurements versus auditory coding; trained versus naive listeners; speakers versus non-speakers; <xref ref-type="bibr" rid="B21">Evans, Munson, &amp; Edwards, 2018</xref>).</p>
<p>From a theoretical perspective, some posit that the context of variation, such as the stability of variation patterns in the community, has consequences for production-perception alignment. Beddor et al. (<xref ref-type="bibr" rid="B8">2018</xref>), for example, suggest that an individual&#8217;s link between perception and production is likely at its weakest during change when variants are in flux and socially-stratified, in comparison to stable, non-socially-indexed variation (see also <xref ref-type="bibr" rid="B29">Harrington, 2012</xref>). However, current empirical evidence does not obviously support this position. In terms of stable variation, while Beddor and colleagues do find a link between production and perception at the individual level for vowel nasalization (as does <xref ref-type="bibr" rid="B85">Zellou, 2017</xref>), a number of scholars investigating different types of coarticulatory variation failed to do so (e.g., <xref ref-type="bibr" rid="B65">Schultz et al., 2012</xref> on VOT-f0 cue weighting in English; <xref ref-type="bibr" rid="B63">Schertz et al., 2015</xref> on Korean L2 English VOT versus f0; <xref ref-type="bibr" rid="B26">Grosvald &amp; Corina, 2012</xref> on vowel-to-vowel coarticulation), though these direct comparisons may be inconclusive due to the aforementioned methodological differences.</p>
<p>Importantly, many recent studies of sound change do find evidence of individual-level links between domains, despite community-level mismatch (<xref ref-type="bibr" rid="B17">Coetzee et al., 2018</xref>; <xref ref-type="bibr" rid="B41">Kuang &amp; Cui, 2018</xref>; <xref ref-type="bibr" rid="B60">Pinget et al., 2020</xref>; <xref ref-type="bibr" rid="B72">Voeten, 2020</xref>). On the surface, these findings run counter to the above prediction as individual-level production-perception relationships can indeed be found during changes-in-progress. Yet, it remains possible that the size of correlation is smaller or that a larger number of individuals demonstrate a lack of relationship, both compatible with an interpretation that the relationship is weaker during change than during stability. Overall, this highlights the need for more systematic, controlled comparison of individual production and perception patterns for variation in different contexts, crucially using comparable measures.</p>
</sec>
<sec>
<title>1.2. Production-perception alignment: What matters?</title>
<p>Beyond the simple existence of a production-perception connection, a co-occurring question pertains to the <italic>nature</italic> or <italic>direction</italic> of alignment. In studies of community change, alignment direction has been demonstrated to vary such that, at times, perception appears to lead in sound change while at other times, production appears more advanced. Some have called for attention to the trajectory and stage of change when examining production and perception, arguing that alignment patterns may depend on whether the change is in early or late stages. A slew of recent findings converge to support the notion that direction of misalignment may be yoked to the stage of the change.</p>
<p>Kuang and Cui (<xref ref-type="bibr" rid="B41">2018</xref>) documented two vowel changes in Southern Yi where phonation cues are shifting to formant cues. For the incipient /u/ change, all speaker groups were generally aligned: Phonation was the most important cue in both domains, but F1 was increasingly relied upon in perception. For the ongoing /e/ change, however, older speakers were misaligned, using phonation as the primary cue in production but F1 as the novel primary cue in perception. In contrast, younger speakers were categorized as &#8216;realigned,&#8217; using the novel F1 cue in both domains. In sum, misalignment occurred during intermediate stages of change and perception led production for both early- and mid-stage change.</p>
<p>Pinget et al. (<xref ref-type="bibr" rid="B60">2020</xref>) show comparable results for regional bilabial stop and labiodental fricative devoicing mergers in Dutch, where perception tends to precede production for change in initial stages. Interestingly, their results indicate a reversal during later stages of change such that perception remained comparatively less merged for individuals whose change is near completion in production. In an investigation of a VOT-f0 cue weighting shift in Afrikaans, Coetzee et al. (<xref ref-type="bibr" rid="B17">2018</xref>) similarly found that younger speakers were generally &#8216;realigned&#8217; at the individual level, but if they were misaligned, production was more advanced. This late-stage perception lag is consistent with research on vowel mergers showing that, based on expectations of the talker, listeners can utilize cues in perception that they do not distinguish in production (e.g., <xref ref-type="bibr" rid="B32">Hay, Warren, &amp; Drager, 2006</xref>; <xref ref-type="bibr" rid="B40">Koops, Gentry, &amp; Pantos, 2008</xref>).</p>
<p>Factors relating to the source of sound change may separately influence the manifestation of production-perception misalignment. Voeten (<xref ref-type="bibr" rid="B72">2020</xref>), for example, finds that sociolinguistic migrants from Belgium who moved to the Netherlands were variable in their approximation of Netherlandic vowels; in general, although production and perception were linked in this case, production seemed to lead perception. This suggests that differences may arise in production-perception mismatch between cases of internally-driven versus contact-driven changes. In other words, studies of second dialect acquisition (e.g., <xref ref-type="bibr" rid="B72">Voeten, 2020</xref>; <xref ref-type="bibr" rid="B20">Evans &amp; Iverson, 2007</xref>) find that production appears to lead perception, at least in early stages. In other cases of presumably internal changes occurring across generations, perception leads at first (e.g., <xref ref-type="bibr" rid="B41">Kuang &amp; Cui, 2018</xref>; <xref ref-type="bibr" rid="B60">Pinget et al., 2020</xref>) until the change reaches late stages.</p>
<p>Finally, the type of change may matter in the coordination of production and perception. Most prior studies investigated cue shifting (e.g., <xref ref-type="bibr" rid="B17">Coetzee et al., 2018</xref>; <xref ref-type="bibr" rid="B41">Kuang &amp; Cui, 2018</xref>) or boundary shifting (e.g., <xref ref-type="bibr" rid="B30">Harrington et al., 2008</xref>; <xref ref-type="bibr" rid="B38">Kleber, Harrington, &amp; Reubold, 2012</xref>) changes, with a focus on vowels. To quote Kuang and Cui (<xref ref-type="bibr" rid="B41">2018</xref>), does &#8220;production and perception have the same relationship across different languages, different types of sound change (e.g., merger, cue shifting, or boundary shifting), and different types of coarticulation?&#8221; (p. 196).</p>
<p>In the current study, we highlight mergers as an interesting case of change, unique because they represent a total loss of phonological contrast rather than simply a shifting of cues that maintain contrast. This comparatively drastic phonological change could lead to differences in behaviour surrounding production and perception. To the best of our knowledge, only one study of production and perception has been published on segmental mergers (<xref ref-type="bibr" rid="B60">Pinget et al., 2020</xref>), for which reported results agree with those of other studies. Prior studies on tone mergers in Hong Kong Cantonese also report a link between reduced distinction in production and slower responses in perception (<xref ref-type="bibr" rid="B54">Mok, Zuo, &amp; Wong, 2013</xref>; <xref ref-type="bibr" rid="B58">Ou &amp; Law, 2017</xref>). Here, we present an investigation of a set of consonant mergers in Hong Kong Cantonese, each with its own socio-historical profiles, to test the robustness of the perception-production link across different types of sound changes.</p>
</sec>
<sec>
<title>1.3. Consonant mergers in Hong Kong Cantonese</title>
<p>Hong Kong Cantonese (HKC) provides an interesting opportunity to study mergers, as a large set of consonantal (e.g., <xref ref-type="bibr" rid="B77">Wong, 1941</xref>; <xref ref-type="bibr" rid="B83">Zee, 1999a</xref>; <xref ref-type="bibr" rid="B71">To et al., 2015</xref>) and tonal mergers (e.g., <xref ref-type="bibr" rid="B6">Bauer, Cheung, &amp; Cheung, 2003</xref>; <xref ref-type="bibr" rid="B54">Mok et al., 2013</xref>; <xref ref-type="bibr" rid="B86">Zhang, 2019</xref>) have emerged over the course of the last century. Moreover, the complex and ever-changing socio-political landscape surrounding Hong Kong has introduced a multitude of external influences on HKC, with consequences for the trajectory of these mergers. The last few decades, in particular, have seen rapid changes due to the transition of Hong Kong&#8217;s political status from a British colony to a Special Administrative Region under Chinese rule in 1997 (&#8216;the handover&#8217;). While already a highly multilingual environment, one notable change is the increasing presence and influence of Mandarin in the city, alongside Cantonese and English, especially for younger generations (e.g., in the education system; <xref ref-type="bibr" rid="B50">Lee &amp; Leung, 2012</xref>; <xref ref-type="bibr" rid="B74">Wang &amp; Kirkpatrick, 2015</xref>; <xref ref-type="bibr" rid="B76">Wong, A., 2019</xref>). Whether contact with Mandarin has ultimately affected the HKC sound changes is unclear, though both convergence, such as reversing changes that are dissimilar to Mandarin (<xref ref-type="bibr" rid="B5">Bauer &amp; Benedict, 1997</xref>), and divergence, for example maintaining HKC-specific changes that are linked to local identity (<xref ref-type="bibr" rid="B75">Whelpton, 1999</xref>), are conceivable. This latter possibility is particularly plausible given that many locals continue to identify as &#8216;Hong Kongers&#8217; (as opposed to &#8216;Chinese&#8217; or &#8216;Hong Kong-Chinese&#8217;) and specifically view Cantonese as central to this identity (<xref ref-type="bibr" rid="B47">Lai, 2001</xref>; <xref ref-type="bibr" rid="B48">Lai, 2011</xref>).</p>
<p>Another layer of complexity is that linguistic ideologies (e.g., the public media campaigns and school curriculum changes spearheaded by Professor Richard Ho Man-Wui, a prominent public scholar and professor of Chinese literature) have led to stigmatization of the innovative variants. The use of mergers is termed &#25078;&#38899; <italic>laan5 jam1</italic>, which translates roughly to &#8216;lazy pronunciation&#8217; (also variably denoted as &#8216;lazy accent,&#8217; &#8216;lazy articulation,&#8217; and &#8216;lazy sounds&#8217;). This notion is recognized by Hong Kongers in the present day (<xref ref-type="bibr" rid="B14">Chen, 2018</xref>), and is perhaps particularly salient to younger generations due to the introduction of Cantonese pronunciation in a standardized exam in 2007. This prompted official promotion of &#8216;proper Cantonese pronunciation&#8217; on TV and radio, as well as in school activities (<xref ref-type="bibr" rid="B50">Lee &amp; Leung, 2012</xref>). As a result of the prevailing prescriptivist approach linking conservative variants with standardness, clarity, and formality, a certain level of explicit awareness and attitudes about the various mergers is available to individuals; this may in turn legitimize hypercorrection to the &#8216;proper&#8217; variant.</p>
<p>In Hong Kong, these external factors could have plausibly come together to stall or reverse the progression of the mergers, leading to stable variation or preservation of the conservative form in formal registers (e.g., <xref ref-type="bibr" rid="B44">Labov, 1994</xref>; <xref ref-type="bibr" rid="B33">Hickey, 2012</xref>). Though HKC consonant mergers have been the focus of a sizable number of sociolinguistic and phonetic production studies in the past (e.g., <xref ref-type="bibr" rid="B2">Bauer, 1982a</xref>, <xref ref-type="bibr" rid="B83">Zee, 1999a</xref>; <xref ref-type="bibr" rid="B71">To et al., 2015</xref>), little to no research appears to have revisited these consonant mergers in recent years. The most recent comprehensive study on the status of HKC consonant mergers is To et al. (<xref ref-type="bibr" rid="B71">2015</xref>), which reports on data collected in the early 2000s when the earliest post-handover generation were still children. While large in its sample of speakers, this study was limited in scope and too early to assess post-handover outcomes.</p>
<p>Against this background, the present investigation focuses on three consonant mergers in HKC: onset [n]&#8594;[l], syllabic [&#331;&#809;]&#8594;[m&#809;], and onset [&#331;]&#8596;&#216;. <xref ref-type="table" rid="T1">Table 1</xref> provides examples. Despite surface similarities, they present a mix of phonological and socio-historical profiles, including differences in phonetic biases, functional load, cross-language correspondences, and metalinguistic awareness (for a detailed review of the historical trajectories, see <xref ref-type="bibr" rid="B71">To et al., 2015</xref>). This variation provides an opportunity to extend previous research on the production-perception link to a set of contrasting mergers in a single speech community, testing the extent to which predictions of production and perception hold across different types of sound change in the same individuals while using the same tasks and measures.</p>
<table-wrap id="T1">
<label>Table 1</label>
<caption>
<p>Examples of Cantonese words involved in each merger, and the direction of change.</p>
</caption>
<table>
<thead>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3"><bold>Merger</bold></td>
<td align="left" valign="top" rowspan="3"><bold>Chinese Character</bold></td>
<td align="left" valign="top" colspan="2"><bold>Jyutping Romanization</bold></td>
<td align="left" valign="top" rowspan="3"><bold>English Translation</bold></td>
</tr>
<tr>
<td colspan="2"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Historical</bold></td>
<td align="left" valign="top"><bold>Innovative</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">[n]&#8594;[l]</td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#34253;</td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;<italic>laam4</italic></td>
<td align="left" valign="top"></td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#8216;blue&#8217;</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#30007;</td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;<italic>naam4</italic></td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;<italic>laam4</italic></td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#8216;male&#8217;</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">[&#331;&#809;]&#8594;[m&#809;]</td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#21780;</td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;<italic>m4</italic></td>
<td align="left" valign="top"></td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#8216;not&#8217;</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#20116;</td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;<italic>ng5</italic></td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;<italic>m5</italic></td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#8216;five&#8217;</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">[&#331;]&#8596;&#216;</td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#22036;</td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;<italic>au2</italic></td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;<italic>ngau2</italic></td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#8216;vomit&#8217;</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#29275;</td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;<italic>ngau4</italic></td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;<italic>au4</italic></td>
<td align="left" valign="top">&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#8216;cow&#8217;</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
</tbody>
</table>
</table-wrap>
<sec>
<title>1.3.1. [n]&#8594;[l] merger</title>
<p>The [n]&#8594;[l] onset merger represents a prototypical case of contrast loss. These syllable-initial phonemes, described by Zee (<xref ref-type="bibr" rid="B84">1999b</xref>) as an apico-laminal denti-alveolar nasal and apical (denti-)lateral approximant, historically differed in both nasality and tongue blade involvement. There is also an asymmetry in frequency: [l]-onsets occur more often than [n] by both measures of type frequency (approximately double) and token frequency (approximately six times; <xref ref-type="bibr" rid="B51">Leung, Law, &amp; Fung, 2004</xref>).</p>
<p>Since the 1940s or earlier (<xref ref-type="bibr" rid="B77">Wong, 1941</xref> as cited by <xref ref-type="bibr" rid="B71">To et al., 2015</xref>; see also <xref ref-type="bibr" rid="B11">Bourgerie, 1990</xref>), [n] has been documented to be steadily merging towards [l] in progressively younger generations (<xref ref-type="bibr" rid="B31">Hashimoto, 1972</xref>; <xref ref-type="bibr" rid="B79">Yeung, 1980</xref>; <xref ref-type="bibr" rid="B11">Bourgerie, 1990</xref>; <xref ref-type="bibr" rid="B83">Zee, 1999a</xref>). Most recently, To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) reported that nearly all of their participants, regardless of age group and gender, produced the innovative form [l] in place of the historical /n/ (94.2% of children versus 94.6% of adults). All available evidence thus points to (near-)completion of the merger by the start of the 21<sup>st</sup> century.</p>
<p>Because the innovative [l] has become so prevalent, Chen (<xref ref-type="bibr" rid="B14">2018</xref>) notes that this merger appears to be comparatively less stigmatized than other pairs such that &#8220;most users do not consider them wrong&#8221; (p. 7). Nevertheless, hypercorrection from historical [l] to [n] has been described&#8212;albeit with some debate&#8212;in the literature. In support, Pan (<xref ref-type="bibr" rid="B59">1981</xref>) reports [n] realizations of historical /l/ in a word list reading task. Zee (<xref ref-type="bibr" rid="B83">1999a</xref>) also notes that there are &#8220;hypercorrective individuals&#8221; who may use [n]. In contrast, Bourgerie (<xref ref-type="bibr" rid="B11">1990</xref>) impressionistically found little evidence of hypercorrection in sociolinguistic interviews, further claiming that no other studies had reported the [l] to [n] direction of change and that &#8220;most observers believe that this hypercorrection does not occur&#8221; (p. 46). Later studies only investigate historical /n/ items, leaving unknown the degree to which hypercorrection of [l] to [n] is present in HKC.</p>
<p>It has further been specifically noted that [n]&#8594;[l] varies by speech register such that [n] is associated with formality (<xref ref-type="bibr" rid="B59">Pan, 1981</xref>) and careful speech (<xref ref-type="bibr" rid="B83">Zee, 1999a</xref>). This is supported by production data, as Bourgerie (<xref ref-type="bibr" rid="B11">1990</xref>) found that impromptu, conversational speech included the most [l] realizations (80.5%), trailed by interview speech (58.2%) and public speech (38.6%). Thus, even while casual speech may mostly comprise innovative variants, HKC speakers also have access and exposure to speech registers that include the conservative [n] variant. The presence of stylistic variation complicates the interpretation of merger completion in this speech community across sporadic research reports with varying methodologies.</p>
</sec>
<sec>
<title>1.3.2. [&#331;&#809;]&#8594;[m&#809;] merger</title>
<p>The [&#331;&#809;]&#8594;[m&#809;] merger involves two historical phonemes, the syllabic velar and bilabial nasal consonants, that were largely non-contrastive. [&#331;&#809;] historically occurred in relatively few lexical items, limited to the three low tones. Of these, Bauer (<xref ref-type="bibr" rid="B2">1982a</xref>) contends that only four occur with any frequency in spoken HKC: two common words (&#20116; <italic>ng5</italic> &#8216;five,&#8217; &#21320; <italic>ng5</italic> &#8216;noon&#8217;) and two surnames (&#21555; <italic>ng4</italic> and &#20237; <italic>ng5</italic>). On the other hand, [m&#809;] historically occurred only as a single&#8212;albeit highly frequent&#8212;morpheme: the negation marker &#21780; <italic>m4</italic> &#8216;no, not.&#8217; Both categories therefore contained few types but still maintained a phonemic contrast. The single morpheme for /m&#809;/ combined with the rather limited inventory of /&#331;&#809;/ words suggests that contrastiveness has been largely uncompromised over the course of the merger.</p>
<p>Though this merger was first documented in the 1980s (<xref ref-type="bibr" rid="B2">Bauer, 1982a</xref>; <xref ref-type="bibr" rid="B3">1982b</xref>), evidence suggests that the [&#331;&#809;]&#8594;[m&#809;] change was well underway by the late 1970s and likely initiated around the 1940s from labial assimilation for the highly frequent word &#8216;five&#8217; (<xref ref-type="bibr" rid="B4">Bauer, 1986</xref>). Since then, To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) reported that nearly all children (born 1993-1994) and adults (born 1960-1987) produce the innovative [m&#809;]. Specifically, 98.6% of children and 95.5% of adults used [m&#809;] for the word &#8216;five.&#8217; Given that this study collected data only for the most frequently occurring historical /&#331;&#809;/ word, the generalizability of these results to other words of this category is unknown. At face value, however, the [&#331;&#809;]&#8594;[m&#809;] merger appears to be near completion at the community level, and possibly complete for a great many individual speakers.</p>
<p>Some early sources demonstrate general or public awareness of the [&#331;&#809;]&#8594;[m&#809;] merger: Newspapers mentioned homophony of the negative morpheme <italic>m4</italic> and the surname <italic>ng4</italic> in 1980 and 1981, while a Chinese dictionary listed &#8216;five&#8217; with both velar and bilabial pronunciations (<xref ref-type="bibr" rid="B49">Lau, 1977</xref>, cited by <xref ref-type="bibr" rid="B2">Bauer, 1982a</xref>). As one of the mergers identified by proper pronunciation campaigns, it can be assumed that awareness has been maintained, though the degree of salience or stigmatization of this particular merger is uncertain. Finally, like [n]&#8594;[l], the conservative [&#331;&#809;] is associated with formality and can occur in style-shifting (e.g., [m&#809;] is produced more often during spontaneous speech than story or word list reading; <xref ref-type="bibr" rid="B2">Bauer, 1982a</xref>).</p>
</sec>
<sec>
<title>1.3.3. [&#331;]&#8596;&#216; merger</title>
<p>Unlike the former two mergers, the [&#331;]&#8596;&#216; merger&#8212;involving the syllable-initial velar nasal and its null-initial (&#216;) counterpart (also referred to as zero-initial, phonetically either a vowel or glottal stop onset; <xref ref-type="bibr" rid="B11">Bourgerie, 1990</xref>; <xref ref-type="bibr" rid="B14">Chen, 2018</xref>)&#8212;is described as historically allophonic, such that [&#331;] onset occurred with low tones while null-initial occurred with high tones. While this merger is not technically a merger of phonemic categories, it is a merger of lexical classes and is discussed as a merger within the Cantonese literature (e.g., <xref ref-type="bibr" rid="B71">To et al., 2015</xref>). To complicate matters, this merger has also undergone a reversal in direction over time, leading to a loss of the historical pattern of complementary distribution.</p>
<p>Since the start of the 20th century, both historical [&#331;] and &#216; have shown evidence of merging towards the other (note some exceptions of [&#331;] onset occurring with mid to high tones; <xref ref-type="bibr" rid="B1">Ball, 1907</xref> as cited in <xref ref-type="bibr" rid="B71">To et al., 2015</xref>). Specifically, reports suggest that &#216;&#8594;[&#331;] took place in the first half of the century (<xref ref-type="bibr" rid="B77">Wong, 1941</xref> as cited in <xref ref-type="bibr" rid="B71">To et al., 2015</xref>), which was subsequently supplanted by a [&#331;]&#8594;&#216; change in the second half (<xref ref-type="bibr" rid="B79">Yeung, 1980</xref>; <xref ref-type="bibr" rid="B11">Bourgerie, 1990</xref>). Along with the change in merger direction, younger speakers (roughly born since the 1960s&#8211;70s) used null-initial more often than not for <italic>both</italic> historical /&#331;/ and &#216; (Young, 1980; <xref ref-type="bibr" rid="B11">Bourgerie, 1990</xref>; <xref ref-type="bibr" rid="B83">Zee, 1999a</xref>). Likewise, the children in To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) produced more null-initial variants for both historical /&#331;/ (65.0%) and historical &#216; (94.2%), in contrast to the adult proportions of &#216; production for historical /&#331;/ (37.5%) and historical &#216; (68.7%). However, both groups differentiated historical /&#331;/ and &#216; historical categories, using null-initial more for historically &#216; items. Thus, in spite of the bidirectional merger and variation in both historical classes, the two categories remain distinct to a degree from a community standpoint, although the original tone-based complementary distribution does appear to be lost.</p>
<p>A further complexity arises when we compare the results of To et al.&#8217;s (<xref ref-type="bibr" rid="B71">2015</xref>) survey to Bourgerie (<xref ref-type="bibr" rid="B11">1990</xref>). In To et al. (<xref ref-type="bibr" rid="B71">2015</xref>), children, as compared to adults, produced significantly more &#216; for /&#331;/ (65.0% versus 37.5%) and significantly less [&#331;] for &#216; (5.8% versus 31.3%). These group values are conspicuously similar to Bourgerie&#8217;s (<xref ref-type="bibr" rid="B11">1990</xref>) children and young adult group values of &#216; for [&#331;] (66.2% versus 36.7%) and [&#331;] for &#216; (0% versus 25.4%), despite the fact that Bourgerie&#8217;s child speakers (born 1970s&#8211;80s) correspond roughly to the adult cohort of To et al.&#8217;s study (born 1960&#8211;87). Since the methodologies of these two studies were quite different, direct comparisons may not be appropriate; however, taken at face value, there appears to have been no progression of the sound changes at the community level between the 1980s to the 2000s.<xref ref-type="fn" rid="n1">1</xref></p>
<p>One potential explanation is that stigmatization of the null-initial as &#8216;lazy&#8217; led to age-based patterning across the community, such that adult populations tend to produce the more &#8216;proper&#8217; and &#8216;prestigious&#8217; [&#331;] variant for both historical classes. Indeed, according to Chen (<xref ref-type="bibr" rid="B14">2018</xref>), this merger is more socially stigmatized compared to mergers like [n]&#8594;[l], perhaps due to its relative rarity. As with the other mergers, style-shifting is reported to occur for [&#331;]&#8594;&#216; (<xref ref-type="bibr" rid="B11">Bourgerie, 1990</xref>) such that impromptu speech featured the highest rate of null-initials for historical [&#331;] words (35.6%), followed by interview (21.8%) and public speech (8.7%). Further investigation, particularly of speakers born in the 1990s in more recent years, is necessary before coming to a conclusion about the status of the [&#331;]&#8596;&#216; variation.</p>
</sec>
</sec>
<sec>
<title>1.4. The present study</title>
<p>The present study investigates the production and perception of the [n]&#8594;[l], [&#331;&#809;]&#8594;[m&#809;], and [&#331;]&#8596;&#216; mergers across two generations in Hong Kong, examining both group-level and individual-level patterns. By studying phonetic variation and change in this context, we hope to contribute new insights to unresolved questions in the sound change literature, including those on the phonetic basis of sound change. Our research objectives are two-fold.</p>
<p>The first goal is to test how production-perception patterns generalize across sound changes with varying attributes. In doing so, we seek to clarify the factors that modulate the existence and nature of the production-perception link, for which evidence has been inconsistent. Characteristics that may be relevant include source of change (internal versus contact-induced), stage of change (e.g., early- versus late-stage), and type of change (shifting versus merging). The present study explores the comparison between three internally-driven consonant mergers within the same speaker-listeners using a consistent methodology. While [n]&#8594;[l] and [&#331;&#809;]&#8594;[m&#809;] appear to similarly be late-stage changes (compared to the mid-to-late stage [&#331;]&#8594;&#216;) according to recent report, these mergers are further differentiated by a host of other factors. We interpret these case studies in the context of previous results that have focused on cue-shifting and boundary-shifting vowel changes, with particular attention to the results of Pinget et al. (<xref ref-type="bibr" rid="B60">2020</xref>) who studied a pair of consonant voicing mergers.</p>
<p>The second objective is to add to documentation on the progression of these mergers specifically in the post-handover era of Hong Kong, revisiting several open threads in the HKC literature. Hong Kongers who grew up in the years since 1997 have had vastly different formative experiences from preceding generations, including increased exposure to Mandarin and reinforcement of &#8216;proper pronunciation&#8217; ideologies that brand the mergers as &#8216;lazy&#8217; (<xref ref-type="bibr" rid="B14">Chen, 2018</xref>). What can we glean about the status of the mergers across generations, taking into account these shifts in the sociolinguistic landscape? Because this aspect of the study is descriptive in nature, we seek to generally explore the patterning of variant use across demographic groups. Nonetheless, because of the historical context of these mergers as sound change, we place special interest in the extent to which age co-varies with usage of each phonetic variant and/or degree of lexical contrast between each pair. Specifically, based on their documentation of children&#8217;s production patterns in the early-2000s, To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) determined that the [n]&#8594;[l] and [&#331;&#809;]&#8594;[m&#809;] mergers were near completion while &#216;-initial was gaining ground over [&#331;]. Do these patterns and their predicted trajectories hold out? Alongside providing a more recent picture of production norms, we contribute to this body of literature by adding a perceptual component through a word categorization task.</p>
</sec>
</sec>
<sec sec-type="methods">
<title>2. Methods</title>
<sec>
<title>2.1. Participants</title>
<p>As part of a larger project, data was collected from Cantonese speakers in both Hong Kong and Vancouver, Canada but only the Hong Kong data are discussed in this paper. This study received approval from the Human Subjects Ethics Sub-committee of the Hong Kong Polytechnic University (HSEARS20160829002), and participants gave written consent before taking part in the experiment. Data collection took place between September 2016 and February 2017. To examine the merger trajectories in apparent time, participants were recruited across two age groups: older middle-aged individuals and younger college-aged individuals.</p>
<p>Fifty-one participants were recruited from the Hong Kong Polytechnic University. The older generation consisted of 23 speakers (M = 54.09 years, <italic>SD</italic> = 5.20, range = 44&#8211;60), 12 of whom identified as female and 11 male. The younger generation consisted of 28 speakers (M = 19.42 years, <italic>SD</italic> = 2.28, range = 17&#8211;27), 13 of whom identified as female and 15 male. All reported Cantonese as a first language and all had lived in Hong Kong since birth, except one young man who had moved to Hong Kong before age three. Due to recording errors, production data from two participants were excluded; as such, only 49 participants were included in the production and production-perception analyses. Along with Cantonese, all participants reported some level of ability to speak and/or understand English and Mandarin, with variation in reported proficiency across domains. Summary demographics for the participants are provided in <xref ref-type="table" rid="T2">Tables 2</xref> and <xref ref-type="table" rid="T3">3</xref>.</p>
<table-wrap id="T2">
<label>Table 2</label>
<caption>
<p>Summary demographic information, including mean age (standard deviation), median Cantonese and English age of acquisition (AoA), and mean Cantonese-English language dominance score (standard deviation) as calculated by the BLP.</p>
</caption>
<table>
<thead>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Age Group</bold></td>
<td align="center" valign="top"><bold>Gender</bold></td>
<td align="center" valign="top"><bold>n</bold></td>
<td align="center" valign="top"><bold>Age</bold></td>
<td align="center" valign="top"><bold>Cantonese AoA</bold></td>
<td align="center" valign="top"><bold>English AoA</bold></td>
<td align="center" valign="top"><bold>Dominance Score</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Older</td>
<td align="center" valign="top">F</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">55.25 (4.90)</td>
<td align="center" valign="top">0</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">&#8211;81 (39)</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="center" valign="top">M</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">52.82 (5.46)</td>
<td align="center" valign="top">0</td>
<td align="center" valign="top">4</td>
<td align="center" valign="top">&#8211;95 (39)</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Younger</td>
<td align="center" valign="top">F</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">18.77 (1.74)</td>
<td align="center" valign="top">0</td>
<td align="center" valign="top">3</td>
<td align="center" valign="top">&#8211;91 (25)</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="center" valign="top">M</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">20.00 (2.59)</td>
<td align="center" valign="top">0</td>
<td align="center" valign="top">3</td>
<td align="center" valign="top">&#8211;84 (26)</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="T3">
<label>Table 3</label>
<caption>
<p>Mean ratings (on a scale from 0&#8211;6) of self-reported speaking and comprehension abilities for Cantonese, English, and Mandarin per demographic group.</p>
</caption>
<table>
<thead>
<tr>
<td colspan="9"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Age Group</bold></td>
<td align="center" valign="top"><bold>Gender</bold></td>
<td align="center" valign="top"><bold>n</bold></td>
<td align="center" valign="top" colspan="2"><bold>Cantonese</bold></td>
<td align="center" valign="top" colspan="2"><bold>English</bold></td>
<td align="center" valign="top" colspan="2"><bold>Mandarin</bold></td>
</tr>
<tr>
<td colspan="9"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"><bold>Speak</bold></td>
<td align="center" valign="top"><bold>Understand</bold></td>
<td align="center" valign="top"><bold>Speak</bold></td>
<td align="center" valign="top"><bold>Understand</bold></td>
<td align="center" valign="top"><bold>Speak</bold></td>
<td align="center" valign="top"><bold>Understand</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td colspan="9"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Older</td>
<td align="center" valign="top">F</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">5.50</td>
<td align="center" valign="top">5.50</td>
<td align="center" valign="top">3.92</td>
<td align="center" valign="top">4.25</td>
<td align="center" valign="top">2.25</td>
<td align="center" valign="top">2.75</td>
</tr>
<tr>
<td colspan="8"><hr/></td>
</tr>
<tr>
<td align="center" valign="top">M</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">5.82</td>
<td align="center" valign="top">5.91</td>
<td align="center" valign="top">3.82</td>
<td align="center" valign="top">4.27</td>
<td align="center" valign="top">3.18</td>
<td align="center" valign="top">3.73</td>
</tr>
<tr>
<td colspan="9"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Younger</td>
<td align="center" valign="top">F</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">5.46</td>
<td align="center" valign="top">5.38</td>
<td align="center" valign="top">3.77</td>
<td align="center" valign="top">3.69</td>
<td align="center" valign="top">3.62</td>
<td align="center" valign="top">4.08</td>
</tr>
<tr>
<td colspan="8"><hr/></td>
</tr>
<tr>
<td align="center" valign="top">M</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">5.07</td>
<td align="center" valign="top">5.13</td>
<td align="center" valign="top">3.43</td>
<td align="center" valign="top">3.73</td>
<td align="center" valign="top">2.93</td>
<td align="center" valign="top">3.87</td>
</tr>
<tr>
<td colspan="9"><hr/></td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>2.2. Materials</title>
<sec>
<title>2.2.1. Production stimuli</title>
<p>Production stimuli consisted of 64 Cantonese words presented visually in Chinese orthography<xref ref-type="fn" rid="n2">2</xref> accompanied by an English translation (full word list in Appendix). Each target historical onset (/l/, /n/, /&#331;/, &#216;) appeared in four words while each target historical syllabic nasal (/&#331;&#809;/, /m&#809;/) appeared in three, totalling 22 target words. The remaining 42 consisted of filler word pairs with non-target Cantonese contrasts, including onset stop aspiration (/ts/-/ts&#688;/, /t/-/t&#688;/, /k/-/k&#688;/) and other merging consonants (onset /k/-/k&#695;/; coda /n/-/&#331;/, /t/-/k/).</p>
</sec>
<sec>
<title>2.2.2. Perception stimuli</title>
<p>Perception stimuli consisted of three 13-step continua generated between Cantonese real-word minimal pairs, one per merger as listed in <xref ref-type="table" rid="T4">Table 4</xref>. To create these continua, the target minimal pairs were produced by a 33-year-old Cantonese speaker and trained linguist who grew up in Hong Kong. Stimuli were digitally recorded at a sample frequency of 44.1 kHz in a sound-attenuated booth using an AKG C520 headset mic with a USBPre 2 Pre-Amp. The speaker read from transcriptions of each target word, accompanied by the Chinese orthography, to elicit maximally contrastive pronunciations. She was asked to produce each pair naturally but as similarly as she could (aside from the target contrast) to facilitate more natural-sounding synthesis. Each target word was produced at least three times. The clearest and most similar recorded pairs were selected for synthesis. Decisions were based on duration, pitch, and intonation similarity assessed by the first author via auditory and spectral analysis in Praat (Version 5.4.08, <xref ref-type="bibr" rid="B10">Boersma &amp; Weenink, 2015</xref>).</p>
<table-wrap id="T4">
<label>Table 4</label>
<caption>
<p>Minimal word pairs for each synthesized continuum.</p>
</caption>
<table>
<thead>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Target Merger</bold></td>
<td align="center" valign="top"><bold>IPA</bold></td>
<td align="center" valign="top"><bold>Jyutping Romanization</bold></td>
<td align="center" valign="top"><bold>Chinese orthography</bold></td>
<td align="center" valign="top"><bold>English translation</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">[n]&#8594;[l]</td>
<td align="center" valign="top">[lou] - [nou]</td>
<td align="center" valign="top"><italic>lou5-nou5</italic></td>
<td align="center" valign="top">&#32769;-&#33126;</td>
<td align="center" valign="top">&#8216;old&#8217;- &#8216;brain&#8217;</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">[&#331;&#809;]&#8594;[m&#809;]</td>
<td align="center" valign="top">[m&#809;] - [&#331;&#809;]</td>
<td align="center" valign="top"><italic>m4-ng4</italic></td>
<td align="center" valign="top">&#21780;-&#21555;</td>
<td align="center" valign="top">&#8216;no, not&#8217;- &#8216;Ng (surname)&#8217;</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">[&#331;]&#8596;&#216;</td>
<td align="center" valign="top">[a&#720;k&#160;&#160;&#794;] - [&#331;a&#720;k&#160;&#160;&#794;]</td>
<td align="center" valign="top"><italic>aak1-ngaak1</italic></td>
<td align="center" valign="top">&#25569;-&#21571;</td>
<td align="center" valign="top">&#8216;shake [hands]&#8217; - &#8216;deceive&#8217;</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>These six natural productions were then used as endpoints to create three word-pair continua using <sc>tandem-straight</sc> in Matlab (<xref ref-type="bibr" rid="B36">Kawahara et al., 2008</xref>). The entire word forms were used as the endpoints, which, given <sc>tandem-straight</sc>&#8217;s global synthesis methods involving acoustic decomposition and generation, allows for natural-sounding resynthesis that retains redundant co-variation. Twenty-five acoustically-equidistant steps were synthesized between the duration of the target words, with each pair time-aligned at acoustic landmarks (e.g., phone boundaries) to facilitate morphing. From these, every odd-numbered step was selected to result in thirteen equidistant steps. The resulting continuum steps gradually morph from one endpoint token to the other, thus including in the synthesis an array of phonetic differences across the tokens. The natural endpoints were carefully selected to be as similar as possible in terms of non-contrastive elements. As such, the main perceptible change across continua is between the target sounds in each pair. The continua are available at <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://osf.io/mk2v4/?view_only=51cfc7a974f345678e226944023dcb39">https://osf.io/mk2v4/?view_only=51cfc7a974f345678e226944023dcb39</ext-link>. <xref ref-type="fig" rid="F1">Figure 1</xref> illustrates the transition between the initial sound in the aak1-ngaak1 continuum (i.e., the beginning of the vowel [a&#720;] and the entire duration of [&#331;]) with waveforms and spectrograms of the midpoint and endpoint recordings. Judgments from the first author and two other Cantonese-speaking colleagues were used to select the most natural-sounding continuum per merger. In all, the 13 variants from each continuum totalled 39 tokens.</p>
<fig id="F1">
<label>Figure 1</label>
<caption>
<p>Waveforms (top) and spectrograms (bottom) of the endpoints and midpoint of the final 13-step <italic>aak1-ngaak1</italic> continuum. From left to right: null-initial [a&#720;k&#160;&#794;&#160;] (step 1), maximally ambiguous initial (step 7), velar nasal-initial [&#331;a&#720;k&#160;&#160;&#794;&#160;] (step 13).</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g1.png"/>
</fig>
</sec>
<sec>
<title>2.2.3. Language questionnaire</title>
<p>Participants completed the Bilingual Language Profile (BLP; <xref ref-type="bibr" rid="B9">Birdsong, Gertken &amp; Amengual, 2012</xref>), which was used to quantify Cantonese-English language dominance scores. A small number of modifications were made to better reflect the language experiences of the population (e.g., accounting for the education system in Hong Kong and the role of Mandarin in Hong Kong, alongside Cantonese and English). Additional questions about biographical information, language background, and language experiences were included to adjudicate participant inclusion criteria. Scores were calculated from the BLP portion of the questionnaire. The BLP is bound between &#8211;218 to +218. Zero indicates balanced dominance between the two languages while negative and positive scores represent relative Cantonese and English dominance, respectively. The range was negative (&#8211;165, &#8211;2), indicating Cantonese dominance for all speakers. Mean values and standard deviation for the four demographic groups are reported in <xref ref-type="table" rid="T2">Table 2</xref>. An analysis of variance with the BLP dominance score as the dependent measure and Age and Gender as independent variables revealed no significant effects. This confirms the uniform Cantonese dominance in the sample, and BLP scores are not considered further.</p>
</sec>
<sec>
<title>2.2.4. Post-task awareness interview</title>
<p>An exit interview was administered to examine metalinguistic awareness about the mergers in the participant&#8217;s own speech and experiences. The interview was conducted in English or Mandarin, supplemented by Cantonese when necessary to ensure comprehension.<xref ref-type="fn" rid="n3">3</xref> Participants were presented with a list of minimal or near-minimal word pairs designed to historically contrast the target sounds. For each pair, they were asked to say the words as they would in casual speech, explain any differences between their productions, and name the prescribed pronunciation if they were aware of any. At the end, participants were asked if they were familiar with the term &#25078;&#38899; <italic>laan5 jam1</italic> &#8216;lazy pronunciation,&#8217; what they thought it meant, and whether they believe (or have been told by others) that they use it in their own speech.</p>
</sec>
</sec>
<sec>
<title>2.3. Procedure</title>
<sec>
<title>2.3.1. Experiment</title>
<p>Participants were seated alone at a computer in a sound-attenuated booth where they were presented with the production task followed by the perception task. Instructions were provided visually in English on the screen. Upon completion, the experimenter returned to the room to answer any questions while participants filled out the language questionnaire, then conducted the exit interview. To reduce the chance that participants would change their speech behaviour due to realizing the purpose of the experiment (i.e., our interest in the target sounds), the production task, which included fillers, was ordered prior to the perception task, which involved only the target sounds.</p>
<p>In the self-paced production task, Chinese characters and the English translation were visually presented using E-Prime 2.0 software (<xref ref-type="bibr" rid="B64">Schneider, Eschman, &amp; Zuccolotto, 2007</xref>) in three randomized blocks, where each word appeared once per block (192 trials in total). Participants were asked to read the Chinese characters out loud. Productions were digitally recorded using an AKG C520 headset microphone connected to a UR22MKII interface. Participants pressed the zero key on the keyboard to begin a two-second recording for each trial. They were allowed to re-record as many times as they wished before moving on to the next word by pressing the spacebar. Participants spent approximately 20&#8211;30 minutes on this task.</p>
<p>Perception stimuli were auditorily presented in a two-alternative forced choice lexical identification task. To reduce influences of explicit knowledge about &#8216;proper&#8217; pronunciation, participants were instructed to respond as quickly as possible and not overthink their response. Using E-Prime 2.0, the synthesized Cantonese words were played over AKG K77 Perception headphones at a comfortable listening level, accompanied by a visual display of the appropriate minimal pair words in Cantonese orthography and the English translation. For each trial, participants heard the audio stimulus once and saw two word choices labelled with &#8216;1&#8217; or &#8216;5&#8217; (on the left or right side of the screen). They were asked to press the button on the button box (i.e., &#8216;1&#8217; or &#8216;5&#8217;) that corresponded to the word they heard. If no response was registered within three seconds, the next trial began automatically. Each token was repeated three times throughout the experiment, randomly presented over three blocks (117 trials in total).</p>
</sec>
<sec>
<title>2.3.2. Production coding</title>
<p>To assess production, two phonetically-trained Cantonese speakers who grew up in Hong Kong coded the onset of each item, or in the case of the syllabic nasals, the sole segment comprising the word. Items were blocked by talker, randomized, and presented to the coders blind, without knowledge of the intended lexical items. Coders categorized the onset from a closed set, presented orthographically as six options: &#8216;l,&#8217; &#8216;m,&#8217; &#8216;n,&#8217; &#8216;ng,&#8217; &#8216;a vowel,&#8217; or &#8216;other.&#8217; The 204 items identified by both transcribers as &#8216;other&#8217; were coded by the first author. These items either did not include a usable production (e.g., recording errors where the word was cut off or missing) or included a mispronounced production (e.g., <italic>saam1</italic> &#8216;clothing&#8217; instead of <italic>lau1</italic> &#8216;coat&#8217;).</p>
<p>The coders agreed on 4933 trials. Inter-rater reliability for the two coders was calculated in R (<xref ref-type="bibr" rid="B61">R Core Team, 2020</xref>) using the Kappa statistic for two raters unweighted using the kappa2() method from irr() (<xref ref-type="bibr" rid="B24">Gamer, Lemon, Fellows, &amp; Singh, 2019</xref>) from the perspective of whether there was agreement on the transcription of each item&#8217;s coded onset. These data from the two coders are reported in <xref ref-type="table" rid="T5">Table 5</xref>. Kappa scores ranged from values that are in the range described as near perfect agreement for historical /l/ and /n/-initial words to moderate agreement scores for the historical syllabic nasals. To resolve disagreement, all items (n = 725) on which the two coders disagreed were presented in a similar format to the first author: blinded as to their lexical identity and organized by talker. Sixteen of these disagreement items were removed due to recording errors or mispronunciations. The ultimate coding for each item was then determined as whichever category two out of the three coders agreed.</p>
<table-wrap id="T5">
<label>Table 5</label>
<caption>
<p>Inter-rater reliability measures for the auditory coding of critical items from the perspective of the historical phoneme.</p>
</caption>
<table>
<thead>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Historical</bold><break/><bold>Phoneme</bold></td>
<td align="center" valign="top"><bold>Number of</bold><break/><bold>Items Coded</bold></td>
<td align="center" valign="top"><bold>Kappa</bold><break/><bold>Statistic</bold></td>
<td align="center" valign="top"><bold><italic>p</italic>-value</bold></td>
<td align="center" valign="top"><bold>Percentage</bold><break/><bold>Agreement</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">[l]</td>
<td align="center" valign="top">1067</td>
<td align="center" valign="top">0.82</td>
<td align="center" valign="top">&lt;0.001</td>
<td align="center" valign="top">95.9%</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">[n]</td>
<td align="center" valign="top">1327</td>
<td align="center" valign="top">0.85</td>
<td align="center" valign="top">&lt;0.001</td>
<td align="center" valign="top">90.4%</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">[m&#809;]</td>
<td align="center" valign="top">801</td>
<td align="center" valign="top">0.48</td>
<td align="center" valign="top">&lt;0.001</td>
<td align="center" valign="top">92.5%</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">[&#331;&#809;]</td>
<td align="center" valign="top">801</td>
<td align="center" valign="top">0.46</td>
<td align="center" valign="top">&lt;0.001</td>
<td align="center" valign="top">79.3%</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">[&#331;]</td>
<td align="center" valign="top">798</td>
<td align="center" valign="top">0.60</td>
<td align="center" valign="top">&lt;0.001</td>
<td align="center" valign="top">84.5%</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">&#216;</td>
<td align="center" valign="top">1068</td>
<td align="center" valign="top">0.59</td>
<td align="center" valign="top">&lt;0.001</td>
<td align="center" valign="top">81.0%</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
</sec>
<sec>
<title>3. Results</title>
<sec>
<title>3.1. Production</title>
<sec>
<title>3.1.1. [n]&#8594;[l] merger</title>
<p>Data for the [n]&#8594;[l] merger (n = 1158) were analyzed in a logistic mixed effects regression model with the probability of [l] productions ([l] = 1, [n] = 0) as the dependent measure. All categorical variables were sum coded (Historical Pattern: /l/ = 1, /n/ = &#8211;1; Talker Age: Older = 1, Younger = &#8211;1; Talker Gender: Male = 1, Female = &#8211;1) and entered as possible main effects and interactions. There were random intercepts for Subject and Item, with Historical Pattern as a by-subject random slope.<xref ref-type="fn" rid="n4">4</xref></p>
<p>The model output is summarized in <xref ref-type="table" rid="T6">Table 6</xref>. There was a significant intercept, and a significant effect of Historical Pattern. These results indicate that [l] productions are overall more likely, and that [l] productions are even more likely on words that are historically pronounced with /l/. These results are visualized in <xref ref-type="fig" rid="F2">Figure 2</xref>, separated by the non-significant factors of Talker Age and Gender in order to transparently present the individual variation that is present. The figures present a barplot to indicate the group patterns, along with individual data points for each individual; these individual data points are somewhat transparent so that individual overlap is observable. Lines connect an individual&#8217;s data points for the two word classes. While the group averages indicate that both historically /l/ and /n/ words are usually pronounced with [l], <xref ref-type="fig" rid="F2">Figure 2</xref> illustrates how there is both group-level separation between historical categories (confirming the model result) and considerable variation on the individual level.</p>
<table-wrap id="T6">
<label>Table 6</label>
<caption>
<p>Model output for [n] and [l] merger in production. Significant factors (<italic>&#945;</italic> level = 0.05) are bolded.</p>
</caption>
<table>
<thead>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top"><bold>B</bold></td>
<td align="left" valign="top"><bold>Standard Error</bold></td>
<td align="left" valign="top"><bold><italic>z</italic>-value</bold></td>
<td align="left" valign="top"><bold><italic>p</italic>-value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Intercept</bold></td>
<td align="left" valign="top"><bold>2.4026</bold></td>
<td align="left" valign="top"><bold>0.52</bold></td>
<td align="left" valign="top"><bold>4.62</bold></td>
<td align="left" valign="top"><bold>3.83E-06</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Historical Pattern</bold></td>
<td align="left" valign="top"><bold>1.1601</bold></td>
<td align="left" valign="top"><bold>0.3818</bold></td>
<td align="left" valign="top"><bold>3.038</bold></td>
<td align="left" valign="top"><bold>0.00238</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Age Group</td>
<td align="left" valign="top">0.2192</td>
<td align="left" valign="top">0.4088</td>
<td align="left" valign="top">0.536</td>
<td align="left" valign="top">0.59182</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Gender</td>
<td align="left" valign="top">0.2537</td>
<td align="left" valign="top">0.4084</td>
<td align="left" valign="top">0.621</td>
<td align="left" valign="top">0.5344</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Historical Pattern : Age Group</td>
<td align="left" valign="top">&#8211;0.297</td>
<td align="left" valign="top">0.2121</td>
<td align="left" valign="top">&#8211;1.4</td>
<td align="left" valign="top">0.16156</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Historical Pattern : Gender</td>
<td align="left" valign="top">0.1822</td>
<td align="left" valign="top">0.2112</td>
<td align="left" valign="top">0.863</td>
<td align="left" valign="top">0.38839</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Age Group : Gender</td>
<td align="left" valign="top">&#8211;0.0898</td>
<td align="left" valign="top">0.4077</td>
<td align="left" valign="top">&#8211;0.22</td>
<td align="left" valign="top">0.82567</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Historical Pattern : Age Group : Gender</td>
<td align="left" valign="top">0.1713</td>
<td align="left" valign="top">0.2109</td>
<td align="left" valign="top">0.812</td>
<td align="left" valign="top">0.41668</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F2">
<label>Figure 2</label>
<caption>
<p>Group means and individual data points for proportion [l] productions for historically /l/- and /n/-onset words, faceted by age and gender. The lines connecting data points connect values for an individual. The shading of the individual points is to allow the visualization of individual overlap.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g2.png"/>
</fig>
<p>To better visualize the individual-level data, we also present each individual&#8217;s mean proportion of the innovative variant for the two historical categories as a scatter plot in <xref ref-type="fig" rid="F3">Figure 3</xref>. Here, the bottom right quadrant represents conservative contrastive productions ([n] for /n/ and [l] for /l/), the top right quadrant represents innovative merged productions ([l] for /n/, [l] for /l/), the bottom left represents hypercorrective merged productions ([n] for /n/, [n] for /l/), and the top left represents a fully flipped contrastive production ([l] for /n/, [n] for /l/). Most participants are clustered in the top right quadrant, realizing /n/ as [l] the majority (or all) of the time, though there is considerable variation ranging to fully historically contrastive for a small number of participants in the bottom right corner. Participants who lean conservative include a mix of older and younger individuals. Five participants who are mainly younger (two younger men, two younger women, and one older woman) appear relatively merged in the hypercorrective direction, realizing a majority of both /n/ and /l/ words as [n]. As expected, none are contrastive in the opposite-to-historical direction. This visual inspection of the individual variation further supports the non-significance of age and gender as a predictor of innovative variant productions.</p>
<fig id="F3">
<label>Figure 3</label>
<caption>
<p>Proportion [l] for historically /n/ items (y-axis) and historically /l/ items (x-axis) for each individual. Women are plotted with circles and men with triangles. Older speakers are in red and younger speakers in blue. The values have been jittered to minimize overlap.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g3.png"/>
</fig>
</sec>
<sec>
<title>3.1.2. [&#331;&#809;]&#8594;[m&#809;] merger</title>
<p>Data for the [&#331;&#809;]&#8594;[m&#809;] merger (n = 872) were analyzed with a logistic mixed effects regression model in an identical fashion to those for the [n]&#8594;[l] merger, with the variant-specific variables coded analogously (i.e., dependent variable as the probability of [m&#809;] productions: [m&#809;] = 1, [&#331;&#809;] = 0; Historical Pattern: /m&#809;/ = 1, /&#331;&#809;/ = &#8211;1).</p>
<p>The model output is summarized in <xref ref-type="table" rid="T7">Table 7</xref>. There was a significant intercept, along with significant effects of Historical Pattern, Age Group, and Gender. There were no significant interactions. The results are visualized in <xref ref-type="fig" rid="F4">Figure 4</xref> separated by Historical Pattern, Age Group, and Gender with the bars showing the group pattern, overlaid with individual results. These visualizations corroborate the statistical output. Overall, talkers are more likely to use [m&#809;] for both historical word classes, but the effect of Historical Pattern confirms that [m&#809;] is more likely for historically /m&#809;/ words. In addition, older speakers are more likely to show the innovative pattern, using higher rates of [m&#809;], while male participants are significantly more likely to use [m&#809;] than female participants.</p>
<table-wrap id="T7">
<label>Table 7</label>
<caption>
<p>Model output for [&#331;&#809;] and [m&#809;] merger in production. Significant factors (<italic>&#945;</italic> level = 0.05) are bolded.</p>
</caption>
<table>
<thead>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top"><bold>B</bold></td>
<td align="left" valign="top"><bold>Standard Error</bold></td>
<td align="left" valign="top"><bold><italic>z-</italic>value</bold></td>
<td align="left" valign="top"><bold><italic>p-</italic>value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Intercept</bold></td>
<td align="left" valign="top"><bold>3.21905</bold></td>
<td align="left" valign="top"><bold>0.57616</bold></td>
<td align="left" valign="top"><bold>5.587</bold></td>
<td align="left" valign="top"><bold>2.31E-08</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Historical Pattern</bold></td>
<td align="left" valign="top"><bold>1.08923</bold></td>
<td align="left" valign="top"><bold>0.49451</bold></td>
<td align="left" valign="top"><bold>2.203</bold></td>
<td align="left" valign="top"><bold>0.02762</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Age Group</bold></td>
<td align="left" valign="top"><bold>0.71872</bold></td>
<td align="left" valign="top"><bold>0.36322</bold></td>
<td align="left" valign="top"><bold>1.979</bold></td>
<td align="left" valign="top"><bold>0.04784</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Gender</bold></td>
<td align="left" valign="top"><bold>1.14734</bold></td>
<td align="left" valign="top"><bold>0.3618</bold></td>
<td align="left" valign="top"><bold>3.171</bold></td>
<td align="left" valign="top"><bold>0.00152</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Historical Pattern : Age Group</td>
<td align="left" valign="top">&#8211;0.2848</td>
<td align="left" valign="top">0.21884</td>
<td align="left" valign="top">&#8211;1.301</td>
<td align="left" valign="top">0.19312</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Historical Pattern : Gender</td>
<td align="left" valign="top">0.05902</td>
<td align="left" valign="top">0.21788</td>
<td align="left" valign="top">0.271</td>
<td align="left" valign="top">0.78649</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Age Group : Gender</td>
<td align="left" valign="top">&#8211;0.03475</td>
<td align="left" valign="top">0.35812</td>
<td align="left" valign="top">&#8211;0.097</td>
<td align="left" valign="top">0.92271</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Historical Pattern : Age Group : Gender</td>
<td align="left" valign="top">&#8211;0.04729</td>
<td align="left" valign="top">0.21276</td>
<td align="left" valign="top">&#8211;0.222</td>
<td align="left" valign="top">0.8241</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F4">
<label>Figure 4</label>
<caption>
<p>Group means and individual data points for proportion [m&#809;] productions for historically syllabic /m&#809;/ and /&#331;&#809;/ words, faceted by age and gender. The lines connecting data points connect values for an individual. The shading of the individual points is to allow the visualization of individual overlap.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g4.png"/>
</fig>
<p><xref ref-type="fig" rid="F5">Figure 5</xref> presents the individual proportion scatterplot for [&#331;&#809;]&#8594;[m&#809;], as described for [n]&#8594;[l]. The vast majority of participants are located in the top right quadrant, representing innovative productions where both /m&#809;/ and /&#331;&#809;/ are realized as [m&#809;] the majority of the time. Many individuals are fully merged, clustering in the top right corner. There is some variation in proportions of [m&#809;] productions, though no participants produce flipped categories, and two younger women produce a hypercorrective pattern (majority [&#331;&#809;] for both /&#331;&#809;/ and /m&#809;/). Overall, this visualization makes clear that the individuals who produce variable, conservative, or hypercorrective patterns are overwhelmingly younger women (along with some older women and younger men), confirming the model Age and Gender effects where women and younger participants skew towards producing [&#331;&#809;].</p>
<fig id="F5">
<label>Figure 5</label>
<caption>
<p>Proportion [m&#809;] for historically /&#331;&#809;/ items (y-axis) and historically /m&#809;/ items (x-axis) for each individual. Women are plotted with circles and men with triangles. Older speakers are in red and younger speakers in blue. The values have been jittered to minimize overlap.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g5.png"/>
</fig>
</sec>
<sec>
<title>3.1.3. [&#331;]&#8596;&#216; merger</title>
<p>Data for the [&#331;]&#8596;&#216; merger (n = 1023) were analyzed similarly to the two previous mergers with a logistic mixed effects regression model. The variant-specific variables were coded with null-initial assumed as the innovative variant (i.e., dependent variable as the probability of vowel-initial productions: &#216; = 1, [&#331;] = 0; Historical Pattern: &#216; = 1, [&#331;] = &#8211;1).</p>
<p>The model output is summarized in <xref ref-type="table" rid="T8">Table 8</xref>. There was a significant intercept, and a significant effect of Historical Pattern. The results are visualized in <xref ref-type="fig" rid="F6">Figure 6</xref> separated by Historical Pattern, along with the nonsignificant effects of Age Group and Gender. Again, the bars demonstrate the group pattern and individuals&#8217; results are presented as a top layer. Participants are more likely to use [&#331;] for all of these lexical items, with the effect of Historical Pattern indicating that vowel initial productions are more likely on historically vowel initial words, but these are still, overall, unlikely. Some clearly prefer vowel-initial onsets, however; individuals from three of the four demographic groups use vowel-initial onsets 100% of the time for both historically vowel-initial and [&#331;]-initial word classes.</p>
<table-wrap id="T8">
<label>Table 8</label>
<caption>
<p>Model output for vowel initial and initial [&#331;] merger in production. Significant factors (<italic>&#945;</italic> level = 0.05) are bolded.</p>
</caption>
<table>
<thead>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top"><bold>B</bold></td>
<td align="left" valign="top"><bold>Standard Error</bold></td>
<td align="left" valign="top"><bold><italic>z-</italic>value</bold></td>
<td align="left" valign="top"><bold><italic>p-</italic>value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Intercept</bold></td>
<td align="left" valign="top"><bold>&#8211;2.83844</bold></td>
<td align="left" valign="top"><bold>0.64677</bold></td>
<td align="left" valign="top"><bold>&#8211;4.389</bold></td>
<td align="left" valign="top"><bold>1.14E-05</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Historical Pattern</bold></td>
<td align="left" valign="top"><bold>0.58628</bold></td>
<td align="left" valign="top"><bold>0.28279</bold></td>
<td align="left" valign="top"><bold>2.073</bold></td>
<td align="left" valign="top"><bold>0.0382</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Age Group</td>
<td align="left" valign="top">0.43279</td>
<td align="left" valign="top">0.59628</td>
<td align="left" valign="top">0.726</td>
<td align="left" valign="top">0.468</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Gender</td>
<td align="left" valign="top">0.60362</td>
<td align="left" valign="top">0.58385</td>
<td align="left" valign="top">1.034</td>
<td align="left" valign="top">0.3012</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Historical Pattern : Age Group</td>
<td align="left" valign="top">0.10844</td>
<td align="left" valign="top">0.16359</td>
<td align="left" valign="top">0.663</td>
<td align="left" valign="top">0.5074</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Historical Pattern : Gender</td>
<td align="left" valign="top">&#8211;0.09481</td>
<td align="left" valign="top">0.16497</td>
<td align="left" valign="top">&#8211;0.575</td>
<td align="left" valign="top">0.5655</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Age Group : Gender</td>
<td align="left" valign="top">0.62266</td>
<td align="left" valign="top">0.58525</td>
<td align="left" valign="top">1.064</td>
<td align="left" valign="top">0.2874</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Historical Pattern : Age Group : Gender</td>
<td align="left" valign="top">&#8211;0.22533</td>
<td align="left" valign="top">0.18756</td>
<td align="left" valign="top">&#8211;1.201</td>
<td align="left" valign="top">0.2296</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F6">
<label>Figure 6</label>
<caption>
<p>Group means and individual data points for proportion vowel-initial productions for historically [&#331;]- and null-initial words, faceted by age and gender. The lines connecting data points connect values for an individual. The shading of the individual points is to allow the visualization of individual overlap.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g6.png"/>
</fig>
<p>These results can be seen more clearly in the individual scatterplot (<xref ref-type="fig" rid="F7">Figure 7</xref>). Unlike the other two mergers, the vast majority of participants are located in the bottom left quadrant, representing merged productions where both historical null and [&#331;] are overwhelmingly realized as [&#331;]-initial, including many individuals who are fully merged (clustering in the bottom left corner). Although no age effect was detected in the model, visual inspection of the individual data provides some insight into trends of variation. Specifically, the few participants who show a comparatively conservative pattern (lower right quadrant) are all older while all but one of the participants who show merging towards null (top right quadrant) are younger.</p>
<fig id="F7">
<label>Figure 7</label>
<caption>
<p>Proportion vowel-initial productions for historically [&#331;]-initial items (y-axis) and historically null-initial items (x-axis) for each individual. Women are plotted with circles and men with triangles. Older speakers are in red and younger speakers in blue. The values have been jittered to minimize overlap.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g7.png"/>
</fig>
</sec>
<sec>
<title>3.1.4. Summary of production</title>
<p>Group-level production results indicate that all three Hong Kong Cantonese merger pairs maintain some form of community-wide contrast, as indicated by a statistically significant effect of historical category. However, underlying these community averages was a wide variety of merging patterns at the individual level. For [n]&#8594;[l], speakers ranged from maximally merged to maximally contrastive, while for [&#331;&#809;]&#8594;[m&#809;] and [&#331;]&#8596;&#216;, the majority of speakers were either fully merged or demonstrated an intermediate contrast. There was also a subset of individuals who were &#8216;hypercorrective&#8217; such that they produced mainly conservative variants for both historical categories (e.g., [n] for both /n/ and /l/). Although degree of contrastiveness varied substantially across individuals, in none of the three mergers is it predicted by demographic factors like age or gender (i.e., no significant interactions were found), which suggests that these mergers should not be characterized as changes-in-progress.</p>
</sec>
</sec>
<sec>
<title>3.2. Perception</title>
<sec>
<title>3.2.1. [n]&#8594;[l] merger</title>
<p>Null responses were removed, accounting for just over 1.5% of the data. The remaining data (n = 3265) were fit to a logistic mixed effects regression model predicting the likelihood of a historical /l/ word response (/l/ = 1, /n/ = 0). Continuum step was centered and scaled, and Talker Age and Gender were sum coded (Age: Older = 1, Younger = &#8211;1; Talker Gender: Male = 1, Female = &#8211;1). Subject was a random effect with Step as a by-subject random slope.<xref ref-type="fn" rid="n5">5</xref></p>
<p>The model output is reported in <xref ref-type="table" rid="T9">Table 9</xref>, and the results are visualized in <xref ref-type="fig" rid="F8">Figure 8</xref>. There was a significant intercept, indicating that /l/ word responses were more likely, and main effects of Step and Age. The interaction between Step and Age was also significant. While, overall, participant response functions varied predictably by continuum step with more [n]-like realizations receiving more /n/ word responses, younger listeners present comparatively more extreme responses at the continuum endpoints, resulting in a steeper response function.</p>
<table-wrap id="T9">
<label>Table 9</label>
<caption>
<p>Model output for [n]&#8594;[l] merger in perception. Significant factors (<italic>&#945;</italic> level = 0.05) are bolded.</p>
</caption>
<table>
<thead>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top"><bold>B</bold></td>
<td align="left" valign="top"><bold>Standard Error</bold></td>
<td align="left" valign="top"><bold><italic>z-</italic>value</bold></td>
<td align="left" valign="top"><bold><italic>p-</italic>value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Intercept</bold></td>
<td align="left" valign="top"><bold>&#8211;0.44226</bold></td>
<td align="left" valign="top"><bold>0.11441</bold></td>
<td align="left" valign="top"><bold>&#8211;3.866</bold></td>
<td align="left" valign="top"><bold>0.000111</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Step (centered)</bold></td>
<td align="left" valign="top"><bold>&#8211;1.85118</bold></td>
<td align="left" valign="top"><bold>0.23359</bold></td>
<td align="left" valign="top"><bold>&#8211;7.925</bold></td>
<td align="left" valign="top"><bold>2.28E-15</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Age Group</bold></td>
<td align="left" valign="top"><bold>0.28777</bold></td>
<td align="left" valign="top"><bold>0.11327</bold></td>
<td align="left" valign="top"><bold>2.54</bold></td>
<td align="left" valign="top"><bold>0.01107</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Gender</td>
<td align="left" valign="top">0.10879</td>
<td align="left" valign="top">0.11218</td>
<td align="left" valign="top">0.97</td>
<td align="left" valign="top">0.332137</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Step : Age Group</bold></td>
<td align="left" valign="top"><bold>0.69128</bold></td>
<td align="left" valign="top"><bold>0.23094</bold></td>
<td align="left" valign="top"><bold>2.993</bold></td>
<td align="left" valign="top"><bold>0.002759</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Step : Gender</td>
<td align="left" valign="top">0.1023</td>
<td align="left" valign="top">0.22859</td>
<td align="left" valign="top">0.448</td>
<td align="left" valign="top">0.654494</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Age Group : Gender</td>
<td align="left" valign="top">0.05584</td>
<td align="left" valign="top">0.11223</td>
<td align="left" valign="top">0.498</td>
<td align="left" valign="top">0.618813</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Step : Age Group : Gender</td>
<td align="left" valign="top">&#8211;0.1741</td>
<td align="left" valign="top">0.22868</td>
<td align="left" valign="top">&#8211;0.761</td>
<td align="left" valign="top">0.446471</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F8">
<label>Figure 8</label>
<caption>
<p>Proportion /l/ word responses for the [n]&#8594;[l] merger by continuum step. Step 1 of the continuum is a canonical [l] pronunciation and Step 13 is a canonical [n] pronunciation. The dashed blue line and triangles represent the data of the younger listeners, while older listeners are shown in solid red lines and circles. The error bars show standard error.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g8.png"/>
</fig>
<p>To explore the individual differences in the discreteness of these lexical items, individual values were quantified using the by-subject contrast coefficient slope (CCS), following the methods provided by Casillas (<xref ref-type="bibr" rid="B12">2021</xref>). This value, which can be taken as a measure of category &#8216;crispness&#8217; (<xref ref-type="bibr" rid="B55">Morrison, 2007</xref>; <xref ref-type="bibr" rid="B13">Casillas &amp; Simonet, 2016</xref>), is an estimate of the slope of each individual&#8217;s sigmoid at its steepest point. More extreme values indicate more crisp or discrete perceptual categories, while values closer to zero are taken to indicate a less discrete contrast, which we interpret as evidence of a merger in this case. As a concept, this crispness value acknowledges that phonological categories can exist on a gradient in terms of their categorical contrasts (see, for example, <xref ref-type="bibr" rid="B27">Hall, 2013</xref>). These category crispness values were estimated by fitting new logistic regression models with continuum Step (centered and scaled) as the only fixed effect. The left panel in <xref ref-type="fig" rid="F9">Figure 9</xref> shows the individual values for the [n] and [l] contrast. There are two extreme outliers with crispness values below &#8211;5; both of these individuals had nearly discrete response functions such that the first six steps of the continuum were categorized 100% of the time as /l/ and the last six steps of the continuum was categorized 100% of the time as /n/. This contrasts with the majority of participants who had more gradient sigmoid functions. The central tendency of the value was negative (Mean = &#8211;0.35, Median = &#8211;0.10) and the range was substantial (&#8211;5.33, 0.09).<xref ref-type="fn" rid="n6">6</xref> These data, visualized in more fine-grained detail in <xref ref-type="fig" rid="F10">Figure 10</xref>, suggest that at the individual level, the degree of category crispness varied considerably. Younger women represent the most variable demographic group, demonstrating a very flat distribution and including comparatively many individuals with crisper categorization; younger men show a similar pattern which is notably flatter than both older groups.</p>
<fig id="F9">
<label>Figure 9</label>
<caption>
<p>Histograms illustrating the distributions of category crispness values for the three continua.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g9.png"/>
</fig>
<fig id="F10">
<label>Figure 10</label>
<caption>
<p>Histograms illustrating the distribution of individual category crispness scores for the [n]&#8594;[l] merger continuum; only crispness scores between &#8211;0.6 to 0.3 are presented to show the variation amongst individuals excluding the two extreme outliers. Density plots are underlaid to represent the distribution per demographic group. Women (top) and men (bottom) are plotted separately while older speakers are colored red and younger speakers colored blue.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g10.png"/>
</fig>
</sec>
<sec>
<title>3.2.2. [&#331;&#809;]&#8594;[m&#809;] merger</title>
<p>To analyze the [&#331;&#809;]&#8594;[m&#809;] merger, null responses were removed, accounting for just over 2% of the data. The remaining data (n = 3248) were fit to a logistic mixed effects regression model predicting the likelihood of a historical /m/ word response (/m&#809;/ = 1, /&#331;&#809;/ =0). All other model specifications were the same as [n]&#8594;[l].</p>
<p>The model output is reported in <xref ref-type="table" rid="T10">Table 10</xref>, and results are visualized in <xref ref-type="fig" rid="F11">Figure 11</xref>. There was a significant intercept, indicating that /m&#809;/ responses were more likely, and there were main effects of Step, a two-way interaction between Age and Gender, and a three-way interaction between Step, Age, and Gender. Older men (M = 0.64) and younger women (M = 0.62) were more likely than younger men (M = 0.54) and older women (M = 0.49) to categorize the continuum as /m&#809;/. These demographic variables interact with step, and <xref ref-type="fig" rid="F11">Figure 11</xref> makes clear that the perceptual changes by step are much smaller than those for the [n]&#8594;[l] merger. The word categorization functions suggest that all listener groups are quite advanced in the merger, with younger women presenting the biggest distinction between /m&#809;/ and /&#331;&#809;/.</p>
<table-wrap id="T10">
<label>Table 10</label>
<caption>
<p>Model output for the [&#331;&#809;]&#8594;[m&#809;] merger in perception. Significant factors (<italic>&#945;</italic> level = 0.05) are bolded.</p>
</caption>
<table>
<thead>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top"><bold>B</bold></td>
<td align="left" valign="top"><bold>Standard Error</bold></td>
<td align="left" valign="top"><bold><italic>z-</italic>value</bold></td>
<td align="left" valign="top"><bold><italic>p-</italic>value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Intercept</bold></td>
<td align="left" valign="top"><bold>0.41768</bold></td>
<td align="left" valign="top"><bold>0.13979</bold></td>
<td align="left" valign="top"><bold>2.988</bold></td>
<td align="left" valign="top"><bold>0.002809</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Step (centered)</bold></td>
<td align="left" valign="top"><bold>&#8211;0.40383</bold></td>
<td align="left" valign="top"><bold>0.10457</bold></td>
<td align="left" valign="top"><bold>&#8211;3.862</bold></td>
<td align="left" valign="top"><bold>0.000113</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Age Group</td>
<td align="left" valign="top">&#8211;0.05142</td>
<td align="left" valign="top">0.13962</td>
<td align="left" valign="top">&#8211;0.368</td>
<td align="left" valign="top">0.712672</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Gender</td>
<td align="left" valign="top">0.0954</td>
<td align="left" valign="top">0.13971</td>
<td align="left" valign="top">0.683</td>
<td align="left" valign="top">0.494698</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Step : Age Group</td>
<td align="left" valign="top">0.15525</td>
<td align="left" valign="top">0.10419</td>
<td align="left" valign="top">1.49</td>
<td align="left" valign="top">0.136216</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Step : Gender</td>
<td align="left" valign="top">0.13829</td>
<td align="left" valign="top">0.10425</td>
<td align="left" valign="top">1.327</td>
<td align="left" valign="top">0.184662</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Age Group : Gender</bold></td>
<td align="left" valign="top"><bold>0.36936</bold></td>
<td align="left" valign="top"><bold>0.13979</bold></td>
<td align="left" valign="top"><bold>2.642</bold></td>
<td align="left" valign="top"><bold>0.008239</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Step : Age Group : Gender</bold></td>
<td align="left" valign="top"><bold>&#8211;0.2807</bold></td>
<td align="left" valign="top"><bold>0.10455</bold></td>
<td align="left" valign="top"><bold>&#8211;2.685</bold></td>
<td align="left" valign="top"><bold>0.007257</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F11">
<label>Figure 11</label>
<caption>
<p>Proportion /m&#809;/ word responses for the [&#331;&#809;]&#8594;[m&#809;] merger by continuum step with different panels for male and female listeners. Step 1 of the continuum is a canonical [m&#809;] pronunciation and Step 13 is a canonical [&#331;&#809;] pronunciation. The dashed blue line and triangles represent the data of the younger listeners, while older listeners are shown in solid red lines and circles. The error bars show standard error.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g11.png"/>
</fig>
<p>Individual differences in recognition performance for [&#331;&#809;]&#8594;[m&#809;] were also analyzed in terms of category crispness. The middle panel in <xref ref-type="fig" rid="F9">Figure 9</xref> presents the distribution of these values. The range of values is attenuated compared to [n]&#8594;[l] (&#8211;0.23, 0.08), and the measures of central tendency are lower (Mean = &#8211;0.03, Median = &#8211;0.014), indicating that the contrast is overall less crisp for /&#331;&#809;/ and /m&#809;/. <xref ref-type="fig" rid="F12">Figure 12</xref> additionally reveals that while most demographic groups accordingly present distributions sharply peaking near zero, younger women as a group present a flatter distribution with increased variability towards higher crispness values. The majority of values are negative, however, indicating consistency in the direction of the category distinction that does exist.</p>
<fig id="F12">
<label>Figure 12</label>
<caption>
<p>Histograms illustrating the distribution of individual category crispness scores for the [&#331;&#809;]&#8594;[m&#809;] merger continuum. Density plots are underlaid to represent the distribution per demographic group. Women (top) and men (bottom) are plotted separately while older listeners are colored red and younger listeners colored blue.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g12.png"/>
</fig>
</sec>
<sec>
<title>3.2.3. [&#331;-]&#8596;&#216; merger</title>
<p>Lastly, the responses to the [&#331;]&#8596;&#216; continua were analyzed in a similar manner. Null responses were removed, accounting for just over 1% of the data. The remaining data (n = 3277) were fit to a logistic mixed effects regression model predicting the likelihood of a historical vowel-initial word response (&#216; = 1, /&#331;/ =0). All other model specifications were the same as above.</p>
<p>The model output is reported in <xref ref-type="table" rid="T11">Table 11</xref>, and the results are visualized in <xref ref-type="fig" rid="F13">Figure 13</xref>. There was a significant intercept, indicating that [&#331;] responses were more likely, and there was a main effect of Step. As the two-way interaction between Step and Age (<italic>p</italic> = 0.05912) approached the threshold for statistical significance, we include it in <xref ref-type="fig" rid="F13">Figure 13</xref> for consideration. Younger listeners trended towards more categoricity in their response patterns.</p>
<table-wrap id="T11">
<label>Table 11</label>
<caption>
<p>Model output for the [&#331;]&#8596;&#216;-initial model in perception. Significant factors (<italic>&#945;</italic> level = 0.05) are bolded.</p>
</caption>
<table>
<thead>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top"><bold>B</bold></td>
<td align="left" valign="top"><bold>Standard Error</bold></td>
<td align="left" valign="top"><bold><italic>z-</italic>value</bold></td>
<td align="left" valign="top"><bold><italic>p-</italic>value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Intercept</bold></td>
<td align="left" valign="top"><bold>0.21985</bold></td>
<td align="left" valign="top"><bold>0.09718</bold></td>
<td align="left" valign="top"><bold>2.262</bold></td>
<td align="left" valign="top"><bold>0.02368</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Step (centered)</bold></td>
<td align="left" valign="top"><bold>0.35784</bold></td>
<td align="left" valign="top"><bold>0.13257</bold></td>
<td align="left" valign="top"><bold>2.699</bold></td>
<td align="left" valign="top"><bold>0.00695</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Age Group</td>
<td align="left" valign="top">&#8211;0.06238</td>
<td align="left" valign="top">0.0971</td>
<td align="left" valign="top">&#8211;0.642</td>
<td align="left" valign="top">0.52062</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Gender</td>
<td align="left" valign="top">&#8211;0.04629</td>
<td align="left" valign="top">0.09708</td>
<td align="left" valign="top">&#8211;0.477</td>
<td align="left" valign="top">0.63344</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Step : Age Group</td>
<td align="left" valign="top">&#8211;0.25023</td>
<td align="left" valign="top">0.13259</td>
<td align="left" valign="top">&#8211;1.887</td>
<td align="left" valign="top">0.05912</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Step : Gender</td>
<td align="left" valign="top">&#8211;0.16481</td>
<td align="left" valign="top">0.13245</td>
<td align="left" valign="top">&#8211;1.244</td>
<td align="left" valign="top">0.21341</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Age Group : Gender</td>
<td align="left" valign="top">0.07286</td>
<td align="left" valign="top">0.09704</td>
<td align="left" valign="top">0.751</td>
<td align="left" valign="top">0.45271</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Step : Age Group : Gender</td>
<td align="left" valign="top">0.12243</td>
<td align="left" valign="top">0.13234</td>
<td align="left" valign="top">0.925</td>
<td align="left" valign="top">0.3549</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F13">
<label>Figure 13</label>
<caption>
<p>Proportion vowel-initial word responses for the [&#331;]&#8596;&#216;-initial merger by continuum step. Step 1 of the continuum is a canonical vowel-initial pronunciation and Step 13 is a canonical [&#331;] onset pronunciation. The dashed blue line and triangles represent the data of the younger listeners, while older listeners are shown in solid red lines and circles. The error bars show standard error.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g13.png"/>
</fig>
<p>Individual differences for [&#331;]&#8596;&#216; were also quantified in terms of category crispness. While the mean (Mean = 0.029) and median (Median = 0.009) both are positive, the range of values (&#8211;0.22, 0.31) spans 0 to a greater degree than those for the previous two continua. Individuals&#8217; data for these continua are shown in the right panel of <xref ref-type="fig" rid="F9">Figure 9</xref>. Not only do the values for [&#331;]&#8596;&#216; indicate that listeners are not at all crisp in their category boundary, the varying signs of the crispness value indicates that individuals differ in the direction of their categorization of these items. That is, individuals have different mappings of which lexical item is vowel-initial and which is [&#331;]-initial. <xref ref-type="fig" rid="F14">Figure 14</xref> indicates that younger women again stand out with more distributed scores, particularly including a slight skew towards crisper positive values. While younger men show a strongly peaked distribution centered on zero, there were also a few individuals who look rather different in showing relatively crisp categorization.</p>
<fig id="F14">
<label>Figure 14</label>
<caption>
<p>Histograms illustrating the distribution of individual category crispness scores for the [&#331;]&#8596;&#216;-initial merger continuum. Density plots are underlaid to represent the distribution per demographic group. Women (top) and men (bottom) are plotted separately while older listeners are colored red and younger listeners colored blue.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g14.png"/>
</fig>
</sec>
<sec>
<title>3.2.4. Summary of perception</title>
<p>Group-level perceptual results indicate that each merger is characterized by different patterns, but unlike production, age appears to be a relevant factor. Simplifying slightly, younger listeners were generally more categorical: significantly so for [n]&#8594;[l], a trending effect for [&#331;]&#8596;&#216;, and a case where younger women alone appear more categorical for [&#331;&#809;]&#8594;[m&#809;]. In other words, younger listeners were often less merged in recognition than older participants. Individual category crispness values further demonstrate that individuals varied from no differentiation between lexical items (present for all mergers) to fully categorical (for /n/ and /l/). Across mergers, the /n/ and /l/ categories were the most discrete, followed by /m&#809;/ and /&#331;&#809;/. The [&#331;] and null-initial lexical items elicited the least discrete patterns, and the +/&#8211; signs of the crispness values further indicate that listeners categorize these items in opposing directions.</p>
</sec>
</sec>
<sec>
<title>3.3. Production-Perception</title>
<p>To understand the relationship between perception and production at the individual level for the three mergers under investigation, we quantified the degree of mergedness for perception and production. Mergers in perception were quantified using the absolute value of the by-subject crispness values described above. Higher values indicate more discrete perceptual categories, while lower values are taken to indicate a merger of perceptual categories.</p>
<p>Mergers in production were quantified as the absolute value of the difference between proportions of the more novel pronunciation (i.e., [l], [m&#809;], null) for historical /l/, /m&#809;/, and null-onsets words and proportions of the more novel pronunciation for the historical /n/, /&#331;&#809;/, and [&#331;]-onset words. This quantification means that individuals who produce, for example, a full merger 100% of the time (e.g., 100% [l] for historical /l/ and /n/ words) will have an equivalent merger score as those who exclusively show hypercorrection (e.g., 0% [l] for historical /n/ and /l/ words; thus 100% [n] for both lexical classes) and those who exhibit a variable mix of pronunciations for both lexical sets (e.g., 50% [l] for historical /n/ and /l/ words). Crucially, however, participants of these types are demonstrating a lack of a reliable difference in pronunciation variants for the lexical sets, which we take as an indication of a merger of these categories at the lexical level.</p>
<p>Importantly, for the production-perception analysis, we removed the two perceptual crispness outliers (see <xref ref-type="fig" rid="F9">Figure 9</xref>), leaving 47 individuals for consideration. We did so because, first, their values were so exceptionally different (values less than &#8211;5 compared to an overall CCS median of &#8211;0.021 and IQR of 0.081) that it was neither possible to conduct any reasonable comparison between them and the remaining data, nor within the remaining data due to the expanded range. Second, based on visual inspection of individual response curves, the extreme jump in numerical value appears not to be linearly reflective of actual change in categoricity given that the next highest values within a reasonable range (approximately 0.6) looked similar and highly categorical;<xref ref-type="fn" rid="n7">7</xref> as a result, although treating the next highest values as maximum crispness is a simplification, using them to represent strongly categorical perceptual responses is a fair representation of the data and exclusion of the outliers is not expected to alter conclusions.</p>
<p><xref ref-type="fig" rid="F15">Figure 15</xref> visualizes the individual patterns for merger in perception and production for the three mergers. Merger in production is on the x-axis. For ease of interpretation, the x-axis is reversed in these visualizations to represent the time course of merger, which proceeds from 100% contrast (no merger) to 0% contrast (full merger). Merger in perception is on the y-axis, represented by absolute category crispness scaled to a range of zero to one.<xref ref-type="fn" rid="n8">8</xref> Low values along either dimension indicate a stronger merger, with values of zero indicating a complete merger of linguistic categories. A dashed line is included to represent the hypothetical fitted line if production and perception were perfectly correlated, assuming, as a simplification, the maximum value across all three mergers as maximum categoricity.</p>
<fig id="F15">
<label>Figure 15</label>
<caption>
<p>Degree of merger in perception (y-axis) versus production (x-axis) for the three mergers. Merger in perception is the absolute value of category crispness and the merger in production is quantified as the absolute value of the difference in the proportions of the novel pronunciation for the two lexical sets for each merger. The range of the x-axis is from 1 (fully contrastive) to 0 (fully merged) to reflect the time course of change. The solid lines represent fitted lines to the data while the dashed lines represent a hypothetical fitted line if production and perception were perfectly correlated.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g15.png"/>
</fig>
<p>These measures of merger in perception and production are moderately correlated for the [n]&#8594;[l] merger [t(47) = 4.10, <italic>r</italic> = 0.51, <italic>p</italic> = 0.0002]<xref ref-type="fn" rid="n9">9</xref> and [&#331;&#809;]&#8594;[m&#809;] merger [t(47) = 4.13, <italic>r</italic> = 0.52, <italic>p</italic> = 0.001]. These positive correlations are represented as negative in the plots due to x-axis reversal, but indicate the degree to which perception and production move in tandem. There was no relationship between merger in perception and production for the [&#331;]&#8596;&#216; merger [t(47) = &#8211;0.06, <italic>r</italic> = &#8211;0.009, <italic>p</italic> = 0.95]. The regression lines in the panels present these relationships. Note that the y-axes on these figures are uniform to illustrate the different distributions of crispness values, and this makes the strength of the correlation for [&#331;&#809;]&#8594;[m&#809;] look substantially less strong than for [n]&#8594;[l], which is not the case numerically.</p>
<p>To assess the direction of misalignment within an individual, we calculate the difference between scaled production and perception values per merger (DiffPP, following <xref ref-type="bibr" rid="B60">Pinget et al., 2020</xref>). Zero indicates completely aligned production and perception, regardless of whether individuals are fully merged, fully contrastive, or at an intermediate value. Scores above zero represent individuals whose production is more contrastive (less merged) than perception while scores below zero represent individuals whose perception is more contrastive (less merged) than production. Visualized in <xref ref-type="fig" rid="F16">Figure 16</xref>, this provides another way to look at the data, in essence representing the direction and distance of each individual from the hypothetical perfect correlation line in <xref ref-type="fig" rid="F15">Figure 15</xref>.</p>
<fig id="F16">
<label>Figure 16</label>
<caption>
<p>DiffPP scores, calculated as the difference of production and perception measures (y-axis) versus production (x-axis, reversed) for the three mergers. DiffPP scores above zero represent individuals whose production is more contrastive (less merged) than perception while scores below zero represent individuals whose perception is more contrastive (less merged) than production.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g16.png"/>
</fig>
<p>The left panel in <xref ref-type="fig" rid="F15">Figure 15</xref> illustrates the reliable correlation between merger in /n/ and /l/ production and perceptual category crispness in a forced choice task of word selection. Individuals who make smaller differences in their rates of [l] production for historical /l/ and /n/ categories are more likely to be merged in perception; this is seen as the clustering in the bottom right corner. At the same time, the subset of participants who show a difference in [l] pronunciation rates for historical /n/ and /l/ words are more likely to exhibit a crisper perceptual contrast. Nevertheless, the relationship between production and perception is far from perfect. As seen in the left panel of <xref ref-type="fig" rid="F16">Figure 16</xref>, many individuals show production-perception misalignment by falling above or below the zero line. A number of observations are notable. First, multiple individuals who produce little to no contrast (on the right side of the plot) fall below zero, demonstrating contrast in perception despite none in production. Conversely, misaligned individuals who produce some contrast (roughly greater than 0.2) tend to fall above zero, demonstrating more contrast in production than perception. In other words, perception appears to lead production in merger, but when merger is (nearly) complete in production, perception appears to lag (i.e., retain contrast). Finally, age appears to play a role such that younger individuals appear more likely to show lower DiffPP scores relative to older individuals with similar production patterns. In particular, three younger women defy the general trend, showing intermediate production contrast but with comparatively greater perceptual categoricity than production.</p>
<p>The middle panel in <xref ref-type="fig" rid="F15">Figure 15</xref> illustrates the reliable correlation between merger in /m&#809;/ and /&#331;&#809;/ production and perceptual category crispness in a forced choice task of word selection. Specifically, individuals who exhibit a smaller difference in their rates of production of the more innovative [m&#809;] for historically /m&#809;/ and /&#331;&#809;/ words are more likely to be merged in perception, while those producing larger contrasts were more likely to show crisper perceptual distinction. The distribution of perceptual crispness values is contracted compared to those values for the [n]&#8594;[l] merger, indicating that, overall, this contrast is less perceptually robust.<xref ref-type="fn" rid="n10">10</xref> Again, while there is a significant correlation, the relationship is not isomorphic: There is substantial variability in the alignment of production of these variants and listeners&#8217; ability to systematically distinguish the lexical items by historical category in perception. Taking the maximum perceptual crispness value for [n]&#8594;[l] as the upper limit,<xref ref-type="fn" rid="n11">11</xref> the overall pattern in the middle panel of <xref ref-type="fig" rid="F16">Figure 16</xref> is that individuals who are not fully merged show more contrastiveness in production than perception. Nearly all participants fall roughly at or above zero; a few individuals do show a small degree of misalignment towards perception, but otherwise, if not aligned, perception leads production. In addition, younger participants tend to show lower DiffPP scores, indicating less alignment skew towards production contrast.</p>
<p>Lastly, the third panel of <xref ref-type="fig" rid="F15">Figure 15</xref> illustrates the lack of correlation between the merger of historically [&#331;] and vowel-initial items in production and perceptual category crispness in a forced choice task of word selection (of <italic>aak1</italic> &#8216;shake&#8217; and <italic>ngaak1</italic> &#8216;deceive,&#8217; specifically). The range of perceptual crispness values are similar to those for [&#331;&#809;]&#8594;[m&#809;], but the production range is more limited. This reaffirms the production results where individuals either mainly produce [&#331;] or null-initial. In <xref ref-type="fig" rid="F16">Figure 16</xref>, the vast majority of participants merge [&#331;] and null-initial in both production and perception but those who do not follow a similar pattern to [n]&#8594;[l]: Those who produce little to no contrast fall below zero (perception lags behind production) while misaligned individuals who produce some contrast (roughly greater than 0.2) fall above zero (perception leads production). Further, there appears to be an age-based pattern whereby younger individuals make up the majority of production-merged individuals who fall below zero, showing a contrast in perception despite none in production.</p>
<sec>
<title>3.3.1. Summary of production-perception</title>
<p>Production-perception analyses, which correlate an individual&#8217;s degree of merger in production to their degree of merger in perception, reveal a mixed bag of results. We find moderate production-perception correlations for [n]&#8594;[l] and [&#331;&#809;]&#8594;[m&#809;] despite different degrees of overall mergedness, but no correlation for [&#331;]&#8596;&#216;. At the same time, while all mergers reveal a similar overall trend of misalignment such that if not aligned, production remains more contrastive than perception, [n]&#8594;[l] and [&#331;]&#8596;&#216; show an additional pattern where individuals with merged production show contrast in perception despite little to none in production. Finally, age-based trends suggest that younger individuals are less misaligned than older individuals in the direction of production contrast but more misaligned in the direction of perception contrast.</p>
</sec>
</sec>
</sec>
<sec>
<title>4. Discussion</title>
<p>The two aims of the study were to (1) test generalizations of the production-perception link and (2) describe apparent-time patterning of the merger variants in production and perception. We first discuss the descriptive data relative to previous documentation of these mergers (Sections 4.1 and 4.2). As this study was not designed to provide a definitive or comprehensive overview of the completion status of the mergers, we discuss various possible interpretations consistent with the data but do not diagnose the situation further. Then, we discuss the production-perception results situated in the sound change literature (Sections 4.3 and 4.4). <xref ref-type="fig" rid="T12">Tables 12</xref>, <xref ref-type="fig" rid="T13">13</xref>, <xref ref-type="fig" rid="T14">14</xref> summarize the full set of findings for each merger.</p>
<fig id="T12">
<label>Table 12</label>
<caption>
<p>Summary of results for <italic>[n]&#8594;[l]</italic>. Individual productions patterns (n=49) are described based on their proportion of [l] for /n/ as either <italic>innovative</italic> (&gt;0.75), <italic>variable</italic> (0.75&#8211;0.25), or <italic>conservative</italic> (&lt;0.25); if proportion of [l] for /l/ was under 0.5, individuals were instead described as <italic>hypercorrective</italic>. Individual perception patterns (n=51) are described based on scaled category crispness scores as <italic>merged</italic> (&lt;0.25), <italic>intermediate</italic> (0.25&#8211;0.75), or <italic>categorical</italic> (&gt;0.75).</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g17.png"/>
</fig>
<fig id="T13">
<label>Table 13</label>
<caption>
<p>Summary of results for <italic>[&#331;&#809;]&#8594;[m&#809;]</italic>. Individual productions patterns (n=49) are described based on their proportion of [m&#809;] for /&#331;&#809;/ as either <italic>innovative</italic> (&gt;0.75), <italic>variable</italic> (0.75&#8211;0.25), or <italic>conservative</italic> (&lt;0.25); if proportion of [m&#809;] for /m&#809;/ was under 0.5, individuals were instead described as <italic>hypercorrective</italic>. Individual perception patterns (n=51) are described based on scaled category crispness scores as <italic>merged</italic> (&lt;0.25), <italic>intermediate</italic> (0.25&#8211;0.75), or <italic>categorical</italic> (&gt;0.75).</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g18.png"/>
</fig>
<fig id="T14">
<label>Table 14</label>
<caption>
<p>Summary of results for <italic>[&#331;]&#8596;&#216;</italic>. Individual productions patterns (n=49) are described based on their proportion of &#216; for /&#331;/ as either <italic>innovative</italic> (&gt;0.75), <italic>variable</italic> (0.75&#8211;0.25), or <italic>conservative</italic> (&lt;0.25); if proportion of &#216; for &#216; was under 0.5, individuals were instead described as <italic>hypercorrective</italic>. Individual perception patterns (n=51) are described based on scaled category crispness scores as <italic>merged</italic> (&lt;0.25), <italic>intermediate</italic> (0.25&#8211;0.75), or <italic>categorical</italic> (&gt;0.75).</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="labphon-13-6461-g19.png"/>
</fig>
<sec>
<title>4.1. Past versus present in production</title>
<p>In To et al. (<xref ref-type="bibr" rid="B71">2015</xref>), the [n]&#8594;[l] merger appeared near-complete in both adults and children (mean rates of [l] for /n/ around 94%), but in the current data, /n/ is realized as [l] between 54% (younger women) to 71% (older women) of the time. Within each group, speakers also varied considerably as to whether they realized /n/ as [l], ranging from 0% to 100%&#8212;in fact, 13 of the 49 speakers were fully merged. Further, no age or gender pattern was evident for speakers who were mainly contrastive. This suggests that the linguistic situation is more complicated than a simple story of change-in-progress where the contrast is maintained by the older generation. In addition to variation in /n/, there was variation in the amount of [l] produced for /l/, which has been generally ignored in previous literature due to the assumption that /l/ was not changing. That some speakers produce a significant amount of [n] for /l/ (up to 100% in one speaker&#8217;s case) confirms that [l]&#8594;[n] &#8216;hypercorrection&#8217; does exist, contrary to Bourgerie&#8217;s (<xref ref-type="bibr" rid="B11">1990</xref>) assertion. For at least these individuals, it seems that the social meaning of &#8216;properness&#8217; indexed to [n] (<xref ref-type="bibr" rid="B59">Pan, 1981</xref>) can be applied to both historical /n/ and /l/. Altogether, this suggests that the two variants [l] and [n] are dually-mapped to the same lexical items regardless of historical lexical class (<xref ref-type="bibr" rid="B62">Samuel &amp; Larraza, 2015</xref>); this is indicative of a merger at the lexical level (<xref ref-type="bibr" rid="B67">Soo &amp; Babel, 2020</xref>). Listeners have implicit knowledge&#8212;and possibly explicit awareness, given the pronunciation campaigns and social meaning attributed to the variants&#8212;that /n/ and /l/ are merged, but they lack the knowledge of which items were historically /l/, allowing for both [n] and [l] pronunciation variants for both lexical classes.</p>
<p>The [&#331;&#809;]&#8594;[m&#809;] merger was likewise reported by To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) to be near-complete. While older adults (particularly men) show comparable or slightly decreased production of [m&#809;] for /&#331;&#809;/ in the current data (70&#8211;100% now versus 91&#8211;100% before), the younger cohort shows clearly decreased [m&#809;] usage roughly 15 years later (50&#8211;75% now versus 97&#8211;100% before). While interactions with age were not statistically significant, this result seems linked to trending differences in historical category merger across demographic groups: Nearly all older men are fully merged (92.6% [m&#809;] for /&#331;&#809;/, 98.8% [m&#809;] for /m&#809;/), many older women (71.6% [m&#809;] for /&#331;&#809;/, 90.5% [m&#809;] for /m&#809;/) and younger men (77.6% [m&#809;] for /&#331;&#809;/, 97.8% [m&#809;] for /m&#809;/) are relatively merged, but a sizable number of younger women make moderate-to-large contrasts (50.4% [m&#809;] for /&#331;&#809;/, 82.9% [m&#809;] for /m&#809;/). We additionally see gender differences in the preferred variant influenced by some women using high rates of both hypercorrective and conservative [&#331;&#809;]. Again, this behaviour indicates dual-mapping of [&#331;&#809;] to both historical /&#331;&#809;/ and /m&#809;/ lexical items, implying a certain degree of lexical merger.</p>
<p>For the [&#331;]&#8596;&#216; merger, children in To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) produced more null onsets for both historical [&#331;] and null-initial (65.0% and 94.2%) while adults used the historical variants more often (37.5% and 68.7%). Yet, rather than a continuing trend of increased null, the current findings indicate that all groups produce <italic>less</italic> null-initial for both historical categories, at mean rates below 33.6% null (i.e., above 66.4% [&#331;]). Individual patterns show that there is large variation in behaviour of merging and choice of variant that, to a certain degree, pattern relative to age. Descriptively, those who show a higher degree of merger or are merged to null-initial are more likely younger, while those who maintain distinction between lexical classes are more likely older. Thus, while merger of the [&#331;]- and &#216;-initial lexical classes may indeed have progressed, variant preference seems to have reversed dramatically.</p>
<sec>
<title>4.1.1. What has changed?</title>
<p>Several factors could plausibly lead to the discrepancies between current results and prior predictions about the trajectory of these consonant mergers. We consider here four possible sources: methodological differences, style-shifting, age-grading, and community-wide change.</p>
<p>In terms of methodology, we used several words per merger pair as opposed to the single word used in To et al. (<xref ref-type="bibr" rid="B71">2015</xref>). Rather than representing a true difference in norms over time, the larger set of lexical items could have simply provided a more accurate depiction of variant usage across the lexicon. However, by-word examination of the current results rules out this explanation: Production of the same words used in To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) demonstrates consistently lower rates of [l], [m&#809;], and null-initial.<xref ref-type="fn" rid="n12">12</xref> Specifically, the group rates of [l] for <italic>naam4</italic> &#8216;male&#8217; are similar to the aggregate values, ranging from 35.9% for young women to 74.1% for older men (compared to 94%). Similarly, rates of [m&#809;] for <italic>ng5</italic> &#8216;five&#8217; range from 51.2% to 88.9% (compared to 95&#8211;98%) while rates of null-initial for <italic>ngau4</italic> &#8216;cow&#8217; range from 2.8% to 22.2% (compared to 37&#8211;65%).</p>
<p>Another methodological difference between our study and the previous offers the possibility that speakers may have been style-shifting to a formal register in the current data given our lab-based reading task, but would have produced more innovative variants in a more casual setting without the influence of orthography (e.g., the picture naming task in <xref ref-type="bibr" rid="B71">To et al., 2015</xref>). This interpretation could be congruent with the across-the-board increases in [n] and [&#331;]. Given the prescriptivist norms in the local context, task-based style-shifting is plausible and cannot be ruled out as an explanation for our diverging results (a point we return to below).</p>
<p>A third possible scenario is that of age-grading<xref ref-type="fn" rid="n13">13</xref> due to linguistic marketplace pressure, wherein individuals change their production patterns over different life stages, linked to the social value of &#8216;standard&#8217; variants (for reviews of age-grading and lifespan changes, see <xref ref-type="bibr" rid="B73">Wagner, 2012</xref>; <xref ref-type="bibr" rid="B15">Cheshire, 2006</xref>). If age-preferential language use were the explanation in the current data, the generation studied in To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) as children may have begun to use more of the &#8216;formal&#8217; prestige variants, namely [n] and [&#331;], now that they are young adults and likely in more professional settings. Like style-shifting, this could explain the increase in conservative variants in the younger generation across all three mergers. On the other hand, interpretation of the results as age-grading is less clearly motivated for older adults based on these data. For one, the adult group in To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) cannot be directly matched to our older adult sample, and two, it is not fully clear what an age-grading explanation would predict for this generation, which spans middle-aged to retirement-aged individuals. Future data collection specifically targeting age-grading will be necessary to adjudicate such an interpretation.</p>
<p>Lastly, production norms may have changed due to reversal of the phonological merger in younger generations. Given that older men appear to be merging [&#331;&#809;]&#8594;[m&#809;] at a similar rate as previously reported, the current results could be consistent with an incipient reversal. For [l]&#8594;[n] and [&#331;]&#8596;&#216;, however, younger speakers did not make a larger contrast relative to the older generation. Perhaps these age-undifferentiated results reflect a community-wide change of language ideologies due to increase in awareness of the &#8216;proper&#8217; social meaning of [n] and [&#331;] since the advent of the &#8216;proper pronunciation&#8217; campaigns in the late 2000s. Notably these events occurred after To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) conducted their study. This social shift may have led to more prestige-related style-shifting in the present day as compared to the past.<xref ref-type="fn" rid="n14">14</xref></p>
<p>If changes in production were indeed driven by an ideological shift, the age-related results for [&#331;&#809;]&#8594;[m&#809;] could be seen as representing an effect of register that is particularly pronounced in younger speakers; that is, the social association of [&#331;&#809;] with &#8216;properness&#8217; may not exist as strongly for older speakers (or, alternatively, older speakers may be less invested in abiding to newer social norms). Moreover, women were more likely to use high rates of [&#331;&#809;] for both categories, including historical /m&#809;/. Given that women have been suggested to (a) be more aware of social stigma and (b) use more prestige variants (e.g., <xref ref-type="bibr" rid="B43">Labov, 1990</xref>), this is compatible with the interpretation that prestige motivates the maintenance of contrast or overall increased use of the conservative variant. The sharp shift in [&#331;]&#8596;&#216; norms from majority null-initial in To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) to majority [&#331;] in the current study could also stem from drastically increased salience of the social meaning of [&#331;] as &#8216;proper,&#8217; aligning with Chen&#8217;s (<xref ref-type="bibr" rid="B14">2018</xref>) assertion that null-initial is particularly stigmatized as &#8216;lazy.&#8217;</p>
<p>It is clear that, to fully understand the patterns found in these data, speech style and attitudes must be accounted for. Thus, limitations of the current study include a lack of consideration for multiple registers in the production task and attitudes about &#8216;lazy pronunciation.&#8217; Accounting for variable styles would have allowed for more conclusive interpretation of descriptive results, especially in comparison to previous literature. At the same time, while we were limited in our methodological choices due to constraints of the stimuli, the formal style elicited in our task does not invalidate our findings, including those of the production-perception analyses. Speakers have a repertoire at their disposal, and we tapped into a particular style of production, which may well have been influenced by our methodological choices; regardless, we need to account for the linguistic knowledge presented to us. Moreover, our choice of an isolated, context-free, single word production task accompanied by orthography is well matched with the perception task, which is characterized by all those features as well. This well-matched formality allows for interpretable (mis)alignment patterns across production and perception.</p>
<p>With respect to &#8216;lazy pronunciation,&#8217; although the exit interviews confirm that all participants knew what &#8216;lazy pronunciation&#8217; was, we did not assess awareness or attitudes for each particular merger. Attitudes can vary substantially, potentially influencing patterns of both production and perception. For example, although many participants characterized the mergers as &#8216;incorrect&#8217; and &#8216;lazy,&#8217; one young man expressed more neutral or positive sentiments, describing the pronunciations as &#8216;more convenient&#8217; (see also <xref ref-type="bibr" rid="B53">Lin, Yao, &amp; Luo, 2021</xref> on both positive and negative attitudes about the younger generation&#8217;s accent). Knowledge of the degree of stigmatization for each merger, along with individual attitudes towards the stigmatization, could help contextualize variation across individuals as well as any potential community-level reversals that are motivated by group orientation towards or away from the existing social meanings of the merger variants (see, e.g., <xref ref-type="bibr" rid="B46">Labov, Rosenfelder, &amp; Fruehwald, 2013</xref>; <xref ref-type="bibr" rid="B70">Tamminga, 2019</xref>). Continued research is needed to document whether merger reversals eventually emerge in production across the Hong Kong community; this work would benefit from comparison of multiple speech styles, including more naturalistic speech, as well as inclusion of awareness and attitude assessments to form a clearer picture of production norms across both age and context.</p>
</sec>
</sec>
<sec>
<title>4.2. Perception versus production in the community</title>
<p>Unlike production, a general theme in perception is that younger participants, particularly women, responded to historical lexical classes more categorically than older participants, though the degree of categoricity was objectively low for mergers involving [&#331;].<xref ref-type="fn" rid="n15">15</xref> Considering that age effects are not robustly supported by evidence in production, how can the perceptual age effects be interpreted? Younger listeners have had more exposure to the historical contrasts due to growing up with &#8216;proper pronunciation&#8217; prescriptivism and language contact with Mandarin. One possibility is that, due to these experiences, they may have adapted to flexibly recognize the contrast in perception, regardless of whether they make a contrast between the two in their own productions (see <xref ref-type="bibr" rid="B62">Samuel &amp; Larraza, 2015</xref> on adaptation to variable pronunciations or &#8216;errors&#8217;; <xref ref-type="bibr" rid="B69">Sumner &amp; Samuel, 2009</xref> on covert experience with pronunciation variants on word recognition timing). Cognitively, this would be a dual-mapping of two pronunciation variants to a single phonological or lexical category for a style or register difference. This accommodation to phonetic variation is akin to interpretations of late-stage sound changes where perception &#8216;lags&#8217; behind production due to the more advanced (e.g., younger) listeners&#8217; need to maintain these representations to cope with speech produced by less advanced (e.g., older) speakers (see <xref ref-type="bibr" rid="B60">Pinget et al., 2020</xref>; <xref ref-type="bibr" rid="B17">Coetzee et al., 2018</xref>). In other words, they are flexible in perception while showing stability in production.</p>
<p>Another possibility is that, rather than simply demonstrating perceptual flexibility, younger speaker-listeners are in fact beginning to reverse the mergers in their recognition of lexical items, prior to showing clear signs of reversal in production. That is, a phonological change across generations may truly be occurring, where perception leads production in the opposition direction to the historical change. As suggested earlier, this explanation could align with qualitative age-based trends found in [&#331;&#809;]&#8594;[m&#809;] production. Only future investigations on the continuing trajectory of these variants can shed light on these open questions.</p>
<p>An alternative explanation does not rely on listeners&#8217; decontextualized phonological representations but involves social representations construed from the model speaker&#8217;s voice. That is, differences may have arisen due to younger listeners perceiving the model speaker as a peer while older listeners perceived the same speaker as younger (<xref ref-type="bibr" rid="B32">Hay et al., 2006</xref>; <xref ref-type="bibr" rid="B40">Koops et al., 2008</xref>). Specifically, if there is an association between the younger generation and increased &#8216;lazy pronunciation&#8217; held by older speakers (see discussion in <xref ref-type="bibr" rid="B53">Lin et al., 2021</xref>), we might expect that younger listeners (without such an association) perceive the stimuli more veridically while older listeners perceive the stimuli with less distinction. This possibility cannot be teased apart from other possible interpretations and must be left to future research.</p>
</sec>
<sec>
<title>4.3. Individual evidence for the production-perception link</title>
<p>Moving on to the production-perception link, our individual-level production-perception results reveal a link between production and perception for two consonantal mergers, corroborating recent positive findings from the sound change literature. In particular, it allows us to more confidently generalize the findings from Pinget et al. (<xref ref-type="bibr" rid="B60">2020</xref>) that a production-perception link can be found for <italic>consonantal mergers</italic>, just like more frequently-studied types of vowel-related sound change. Given the rather different properties of mergers to shifting-type changes (e.g., loss of contrast), the consistency of detecting an individual-level production-perception relationship is interesting and strengthens the claim of an underlying mechanistic link between systems that drives sound change.</p>
<p>On the contrary, no production-perception correlation was found for the [&#331;]&#8596;&#216; merger. Since the other two merger pairs demonstrated a correlation, the type of change (i.e., merger rather than cue shifting, for example) cannot be the constraining factor, contributing evidence against the hypothesis that production-perception relationships vary by type of change. Still, it is possible that the specific nature of this change is relevant. That is, the other two mergers are &#8216;standard&#8217; contrast-loss mergers, unlike [&#331;]&#8596;&#216; which was originally allophonic and exhibited bidirectional merging. The nature of this change (including the changes in direction) may have contributed to the apparently unlinked representations. Given this, it may be beneficial for future research to consider more fine-grained variability among types of change and how it may influence the existence of a production-perception link. We additionally note that these findings could have resulted from the choice of perceptual stimuli and task, which involved lexical identification between items in a purported minimal pair that did not follow historical tone allophony patterns (i.e., involved an exception). With the current data, it is not possible to tease apart whether this null effect is due to the nature of this particular merger or aspects of the experiment. Future work would benefit from examining the [&#331;]&#8596;&#216; merger further in perception.</p>
</sec>
<sec>
<title>4.4. Direction of production-perception misalignment</title>
<p>Despite correlation differences, similar patterns of misalignment were uncovered across all three merger pairs. In general, individuals making a contrast in production showed comparatively less perceptual contrast, indicating that perceptual merger is more advanced than production merger. On the other hand, many individuals with nearly or fully merged production for [n]&#8594;[l] and [&#331;]&#8596;&#216;&#8212;if not &#8216;realigned&#8217; through full merger in production and perception&#8212;in fact show evidence of categoricity in perception; this suggests that at these late stages of production merger, perception is less advanced, or lagging. Given the data at hand, these results fall in line with previous reports of production-perception misalignment with regards to stage of change (<xref ref-type="bibr" rid="B60">Pinget et al., 2020</xref>; <xref ref-type="bibr" rid="B17">Coetzee et al., 2018</xref>). However, the switch from perception-led to production-led change was reported in Pinget et al. (<xref ref-type="bibr" rid="B60">2020</xref>) to be roughly at the halfway point of merger while here, only those who were nearly or fully merged exhibited production leading perception. Whether this difference relates to the fact that the present mergers appear to have reached a state of socially conditioned variation rather than continued merger or whether such a pattern is dependent on the particular situation requires further investigation.</p>
<p>In addition, younger participants on the whole tend towards less misalignment in the direction of production contrast and/or more in the direction of perceptual contrast. This confirms that community-level perceptual and production results, where younger individuals showed more contrast in perception, can be sourced to the individual level. A final observation is noteworthy. Unlike the other two mergers, few individuals showed misalignment in the direction of perceptual contrast for the [&#331;&#809;]&#8594;[m&#809;] merger, even for those merged in production. Why does this difference arise? While speculative, it seems conceivable that [&#331;&#809;]&#8594;[m&#809;] operates on a reduced perceptual scale due to the lack of acoustic-perceptual salience for nasals (<xref ref-type="bibr" rid="B28">Harnsberger, 2000</xref>; <xref ref-type="bibr" rid="B56">Narayan, 2008</xref>); if so, a different approach to assessing misalignment may be needed in future in work that attempts to compare across multiple cases of variation and change.</p>
</sec>
</sec>
<sec>
<title>5. Conclusion</title>
<p>This study investigates production and perception of the [n]&#8594;[l], [&#331;&#809;]&#8594;[m&#809;], and [&#331;]&#8596;&#216; mergers in Hong Kong Cantonese, on one hand documenting the present-day status of these long-standing sound changes and on the other hand extending hypotheses about the production-perception relationship in the context of sound change. Production results demonstrate substantial amounts of individual variability but a lack of clear age- or gender-related variation, indicating that the picture for each merger is less straightforward than continued or completed merger. Perceptual results find that younger speaker-listeners are less merged than the older generation, potentially due to adaptation to high phonetic variability. While this project did not collect the data necessary to fully diagnose the current stage of the mergers, the results do suggest that [n]&#8594;[l] and [&#331;]&#8596;&#216; no longer appear to be changes-in-progress, though incipient reversal is a possibility for [&#331;&#809;]&#8594;[m&#809;]; instead, usage of the formal variants appear inflated compared to previous records, suggesting that &#8216;proper pronunciation&#8217; ideologies are at play. Another possibility is that the mergers have stabilized into context-conditioned style-shifting, a scenario that Labov (<xref ref-type="bibr" rid="B45">2001</xref>) proposes may historically be more common than changes going to completion (p. 75). Future research can disentangle these possibilities.</p>
<p>A birds-eye view of our results on the production-perception link reveals that, on the whole, it parallels the literature: Some level of linkage evidently exists between production and perception systems, though it is not always strong and not always found. We show here that, in the same population using the same methodology, a production-perception correlation can be found moderately for two mergers yet not at all for another. Our finding of production-perception coupling for two additional consonant mergers specifically bolsters the conclusion that convergent results in the literature are generalizable beyond certain types of sound change. However, while we aimed to use comparable measures for production and perception, we also acknowledge that our experimental choices may still have impacted results. Ultimately, we do not solve the problem of methodological inconsistency, but we urge future research to take these issues into careful consideration.</p>
<p>Interestingly, when a correlation was detected, they were of very similar magnitude, even though these mergers are characterized by rather different profiles (e.g., functional load), trajectories, and perceptual salience. This direct comparison of magnitudes indicates that there is a modicum of consistency about the production-perception relationship across cases; notably, such a comparison is difficult to make across studies due to the variability in context, methodology, participants, and more (see <xref ref-type="bibr" rid="B60">Pinget et al., 2020</xref>, who nevertheless report that previous studies often find moderate correlations).</p>
<p>That production and perception systems constrain but do not determine each other is sensible as we expect flexibility in perception beyond variability exhibited in production (e.g., we understand and accept a much larger range of speech than we tend to produce; see <xref ref-type="bibr" rid="B62">Samuel &amp; Larraza, 2015</xref>). It cannot simply be that all exposure in perception affects production, as effects of exposure in stored representations are most likely mediated by encoding (see attention-weighting approaches to exemplar theoretic models of speech perception, <xref ref-type="bibr" rid="B68">Sumner et al., 2014</xref>; <xref ref-type="bibr" rid="B19">Drager &amp; Kirtley, 2016</xref>; <xref ref-type="bibr" rid="B34">Johnson, 1997</xref>). In the same way, production is mediated by other factors, such as speaker agency, and does not reflect purely linguistic aspects of experience. If this is the case, however, what is the expected extent of transfer between perception and production, and what cognitive factors influence the degree of coupling?</p>
<p>One route forward is to consider moving beyond a search for evidence on the existence of a production-perception link to a more nuanced examination of when and to what degree we are able to detect a relationship. In this vein, we encourage researchers to apply a controlled comparative approach with a larger range of case studies in order to paint a broader typology of sound change and the extent of coupling between production-perception systems. By focusing attention on theorizing why there would not be a link for any particular case, what mechanisms underlie such connections, and what regulates the strength of relationship across controlled groups, variables, or tasks, we can move closer to understanding the persistently mixed results of production-perception studies and the underlying cognitive processes.</p>
</sec>
<sec sec-type="supplementary-material">
<title>Additional File</title>
<p>The additional file for this article can be found as follows:</p>
<supplementary-material id="S1" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.16995/labphon.6461.s1">
<!--[<inline-supplementary-material xlink:title="local_file" xlink:href="labphon-13-6461-s1.pdf">labphon-13-6461-s1.pdf</inline-supplementary-material>]-->
<label>Appendix</label>
<caption>
<p>Word list for production task. DOI: <uri>https://doi.org/10.16995/labphon.6461.s1</uri></p>
</caption>
</supplementary-material>
</sec>
</body>
<back>
<fn-group>
<fn id="n1"><p>In a study on the frequencies of all Cantonese word initial sounds, Ng and Kwok (<xref ref-type="bibr" rid="B57">2004</xref>) do report that, comparing to an earlier survey from the 1970s (<xref ref-type="bibr" rid="B22">Fok, 1979</xref>), the overall occurrence of [&#331;-] dropped from 2.9% to 1.91%. This suggests that there has been a general decrease of [&#331;] in favor of &#216;.</p></fn>
<fn id="n2"><p>While including orthography in our tasks departs from methodology in previous research (e.g., picture naming) and likely elicits particular styles of speech that are more formal than spontaneous speech, because target word selection was constrained by the limited options for the [&#331;&#809;]&#8594;[m&#809;] and [&#331;-]&#8596;&#216; mergers (especially with the requirement of minimal pairs for the perception task), the available target words were not easily picturable. In order to maintain comparability across domains, we chose to include orthography for both production (reading) and perception (word selection) tasks; thus, production and perception tasks were well-matched methodologically. For more discussion, see Section 4.1.1.</p></fn>
<fn id="n3"><p>Interviews were not conducted fully in Cantonese mainly due to limitations of available personnel, but an added advantage is that we minimize the potential for an interviewers&#8217; pronunciation of Cantonese (e.g., if they used merged variants) to have influenced participants&#39; response to minimal pair comparisons and &#8216;lazy pronunciation&#8217;-related questions. In addition, we note that because participants only interacted with the experimenter outside the main task and that the post-task interview was conducted only for additional context, there is no reason to expect this choice to impact the study results in any meaningful way.</p></fn>
<fn id="n4"><p>The model syntax used was: glmer(ProportionL ~ HistoricalPattern * AgeGroup * Gender + (1+HistoricalPattern&#124;Subject) + (1&#124;Item), family=&#8221;binomial&#8221;).</p></fn>
<fn id="n5"><p>Model syntax: glmer(PropNovel ~ Step_centered * AgeGroup * Gender + (1+Step_centered&#124;Subject), family = &#8220;binomial&#8221;)</p></fn>
<fn id="n6"><p>As a comparison point to facilitate interpretation, Casillas and Simonet (<xref ref-type="bibr" rid="B13">2016</xref>) report mean group crispness values of 0.42, 0.21, and 0.11 for monolingual English speakers, Spanish-speaking early learners of English, and Spanish-speaking late learners of English, respectively, for perception of the English /&#230;/-/&#593;/ contrast.</p></fn>
<fn id="n7"><p>Because the distance was too great, attempts to apply transformations for skewed data were not successful in creating a reasonable distribution; thus, we ultimately chose to remove the outliers.</p></fn>
<fn id="n8"><p>Due to the removal of the two outliers prior to scaling and visualizations, the maximum scaled value in Figures 12 and 13 represents the next highest CCS value, which was 0.632.</p></fn>
<fn id="n9"><p>To be conservative, the correlation provided in text includes the two extreme values. Removing those two data points from the correlation does not affect the direction or interpretation of the results [t(45) = 4.54, <italic>r</italic> = 0.56, <italic>p</italic> &lt; 0.001].</p></fn>
<fn id="n10"><p>It&#8217;s worth noting that regardless of its participation in a merger, the auditory-acoustic contrast between syllabic nasals compared to /n/ and /l/ is less robust.</p></fn>
<fn id="n11"><p>While this may not be a foolproof approach due to the less perceptually robust nature of nasal place of articulation contrasts, the use of the upper boundary of [n]&#8594;[l] crispness values serves as an estimate, given that &#8216;maximum&#8217; perceptual crispness values are unknown for this particular contrast.</p></fn>
<fn id="n12"><p>A direct comparison could not be made for the historical null-initial word, as the word <italic>uk1</italic> &#8216;house&#8217; used by To et al. (<xref ref-type="bibr" rid="B71">2015</xref>) was not used in the current study.</p></fn>
<fn id="n13"><p>See Wagner (<xref ref-type="bibr" rid="B73">2012</xref>) for discussion on the multiple definitions or criteria of the term &#8216;age-grading&#8217; (community stability, repeating patterns across generations, marketplace pressure) and why there isn&#8217;t a clear, consensus differentiation between &#8216;age-grading&#8217; and &#8216;lifespan change.&#8217; We reference a particular approach to age-grading described by Wagner (<xref ref-type="bibr" rid="B73">2012</xref>), but we recognize it is not unanimously accepted.</p></fn>
<fn id="n14"><p>Merger reversal across contexts and context-based style-shifting cannot be distinguished in the current data, as we do not compare contexts. As well, they may not be mutually exclusive: More prestige-related style-shifting could co-occur with and/or precede incipient reversals in production outside of formal contexts, both motivated by increased awareness and attitudes towards the social meanings of the merger variants. As we propose these outcomes to have the same source, we discuss them together.</p></fn>
<fn id="n15"><p>We acknowledge the possibility that an age-based effect of hearing sensitivity (or cognitive processing abilities) may have influenced perceptual results to some extent and note that we did not specifically control for this confounding factor. At the same time, because younger men often patterned with older individuals (most noticeably for [&#331;&#809;]&#8594;[m&#809;]), we take the position that the age trend cannot simply be reduced to a difference in auditory-perceptual abilities.</p></fn>
</fn-group>
<ack>
<title>Acknowledgements</title>
<p>We thank Chang Liu for conducting data collection at the Hong Kong Polytechnic University. Thank you to all the members of the UBC Speech in Context Lab, particularly Zoe Lam, Kristy Chang, Ivan Fong, Sophie Bishop, Cassandra Savage, Shannon Briggs, and Rachel Soo, for help with stimuli creation, auditory coding, data analysis, and discussion of the project at various stages. Finally, we appreciate all the audience feedback that we received at the 4th and 5th Workshops on Sound Change, the NorthWest Phonetics &amp; Phonology Conference, and the UBC Language Sciences Undergraduate Research Conference.</p>
</ack>
<sec>
<title>Funding Information</title>
<p>LSPC is supported in part by funding from the Social Sciences and Humanities Research Council of Canada (SSHRC). This research was partially funded by the UBC Language Sciences Initiative and a SSHRC grant awarded to MB, as well as a research grant (P0001897) from the Hong Kong Polytechnic University awarded to YY.</p>
</sec>
<sec>
<title>Competing Interests</title>
<p>The authors have no competing interests to declare.</p>
</sec>
<ref-list>
<ref id="B1"><label>1</label><mixed-citation publication-type="book"><string-name><surname>Ball</surname>, <given-names>J. D.</given-names></string-name> (<year>1907</year>). <source>Cantonese Made Easy: A Book of Simple Sentences in the Cantonese Dialect, with Free and Literal Translations, and Directions for the Rendering of English Grammatical Forms in Chinese</source> (<edition>3rd</edition> ed.). <publisher-name>Kelly &amp; Walsh, Limited</publisher-name>.</mixed-citation></ref>
<ref id="B2"><label>2</label><mixed-citation publication-type="thesis"><string-name><surname>Bauer</surname>, <given-names>R. S.</given-names></string-name> (<year>1982a</year>). <source>Cantonese Sociolinguistic Patterns: Correlating Social Characteristics of Speakers with Phonological Variables in Hong Kong Cantonese</source> (Doctoral dissertation, <publisher-name>University of California, Berkeley</publisher-name>). <uri>https://escholarship.org/uc/item/1sn2f75h</uri></mixed-citation></ref>
<ref id="B3"><label>3</label><mixed-citation publication-type="journal"><string-name><surname>Bauer</surname>, <given-names>R. S.</given-names></string-name> (<year>1982b</year>). <article-title>Lexical Diffusion in Hong Kong Cantonese: &#8220;Five&#8221; Leads the Way</article-title>. <source>Proceedings of the 8th Annual Meeting of the Berkeley Linguistics Society</source>, <fpage>550</fpage>&#8211;<lpage>561</lpage>. DOI: <pub-id pub-id-type="doi">10.3765/bls.v8i0.2037</pub-id></mixed-citation></ref>
<ref id="B4"><label>4</label><mixed-citation publication-type="webpage"><string-name><surname>Bauer</surname>, <given-names>R. S.</given-names></string-name> (<year>1986</year>). <article-title>The Microhistory of a sound change in progress in Hong Kong Cantonese / &#39321;&#28207;&#31908;&#35821;&#20013;&#19968;&#20010;&#27491;&#22312;&#36827;&#34892;&#30340;&#38899;&#21464;&#30340;&#24494;&#35266;&#21382;&#21490;</article-title>. <source>Journal of Chinese Linguistics</source>, <volume>14</volume>(<issue>1</issue>), <fpage>1</fpage>&#8211;<lpage>42</lpage>. <uri>http://www.jstor.org/stable/23754216</uri></mixed-citation></ref>
<ref id="B5"><label>5</label><mixed-citation publication-type="webpage"><string-name><surname>Bauer</surname>, <given-names>R. S.</given-names></string-name>, &amp; <string-name><surname>Benedict</surname>, <given-names>P. K.</given-names></string-name> (<year>1997</year>). <chapter-title>Modern Cantonese Phonology</chapter-title>. In <source>Modern Cantonese Phonology</source>. <publisher-name>De Gruyter Mouton</publisher-name>. <uri>http://www.degruyter.com/document/doi/10.1515/9783110823707/html</uri>. DOI: <pub-id pub-id-type="doi">10.1515/9783110823707</pub-id></mixed-citation></ref>
<ref id="B6"><label>6</label><mixed-citation publication-type="journal"><string-name><surname>Bauer</surname>, <given-names>R. S.</given-names></string-name>, <string-name><surname>Cheung</surname>, <given-names>K.</given-names></string-name>, &amp; <string-name><surname>Cheung</surname>, <given-names>P.</given-names></string-name> (<year>2003</year>). <article-title>Variation and merger of the rising tones in Hong Kong Cantonese</article-title>. <source>Language Variation and Change</source>, <volume>15</volume>(<issue>02</issue>). DOI: <pub-id pub-id-type="doi">10.1017/S0954394503152039</pub-id></mixed-citation></ref>
<ref id="B7"><label>7</label><mixed-citation publication-type="book"><string-name><surname>Beddor</surname>, <given-names>P. S.</given-names></string-name> (<year>2015</year>). <chapter-title>The Relation between Language Users&#8217; Perception and Production Repertoires</chapter-title>. In <collab>The Scottish Consortium for ICPhS 2015</collab> (Ed.), <source>Proceedings of the 18th International Congress of Phonetic Sciences</source> (pp. <fpage>1</fpage>&#8211;<lpage>9</lpage>). <publisher-name>the University of Glasgow</publisher-name>.</mixed-citation></ref>
<ref id="B8"><label>8</label><mixed-citation publication-type="journal"><string-name><surname>Beddor</surname>, <given-names>P. S.</given-names></string-name>, <string-name><surname>Coetzee</surname>, <given-names>A. W.</given-names></string-name>, <string-name><surname>Styler</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>McGowan</surname>, <given-names>K. B.</given-names></string-name>, &amp; <string-name><surname>Boland</surname>, <given-names>J. E.</given-names></string-name> (<year>2018</year>). <article-title>The time course of individuals&#8217; perception of coarticulatory information is linked to their production: Implications for sound change</article-title>. <source>Language</source>, <volume>94</volume>(<issue>4</issue>), <fpage>931</fpage>&#8211;<lpage>968</lpage>. DOI: <pub-id pub-id-type="doi">10.1353/lan.2018.0051</pub-id></mixed-citation></ref>
<ref id="B9"><label>9</label><mixed-citation publication-type="webpage"><string-name><surname>Birdsong</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Gertken</surname>, <given-names>L. M.</given-names></string-name>, &amp; <string-name><surname>Amengual</surname>, <given-names>M.</given-names></string-name> (<year>2012</year>). <source>Bilingual Language Profile: An Easy-to-Use Instrument to Assess Bilingualism</source>. <publisher-name>Centre for Open Educational Resources and Language Learning, University of Texas at Austin</publisher-name>. <uri>https://sites.la.utexas.edu/bilingual/</uri></mixed-citation></ref>
<ref id="B10"><label>10</label><mixed-citation publication-type="webpage"><string-name><surname>Boersma</surname>, <given-names>P.</given-names></string-name>, &amp; <string-name><surname>Weenink</surname>, <given-names>D.</given-names></string-name> (<year>2015</year>). <source>Praat: Doing phonetics by computer</source> (5.4.08) [Computer software]. <uri>www.praat.org</uri></mixed-citation></ref>
<ref id="B11"><label>11</label><mixed-citation publication-type="thesis"><string-name><surname>Bourgerie</surname>, <given-names>D. S.</given-names></string-name> (<year>1990</year>). <source>A quantitative study of sociolinguistic variation in Cantonese</source> (Doctoral dissertation, <publisher-name>The Ohio State University</publisher-name>). <uri>http://search.proquest.com/docview/303872748/abstract/856EDC18ADFD4B8DPQ/1</uri></mixed-citation></ref>
<ref id="B12"><label>12</label><mixed-citation publication-type="webpage"><string-name><surname>Casillas</surname>, <given-names>J. V.</given-names></string-name> (<year>2021</year>). <source>Exploring phonemic boundaries using logistic regression</source>. <uri>https://www.jvcasillas.com/posts/2021-05-15_logistic_regression_and_phonemic_boundaries/</uri></mixed-citation></ref>
<ref id="B13"><label>13</label><mixed-citation publication-type="journal"><string-name><surname>Casillas</surname>, <given-names>J. V.</given-names></string-name>, &amp; <string-name><surname>Simonet</surname>, <given-names>M.</given-names></string-name> (<year>2016</year>). <article-title>Production and perception of the English /&#230;/&#8211;/&#593;/ contrast in switched-dominance speakers</article-title>. <source>Second Language Research</source>, <volume>32</volume>(<issue>2</issue>), <fpage>171</fpage>&#8211;<lpage>195</lpage>. DOI: <pub-id pub-id-type="doi">10.1177/0267658315608912</pub-id></mixed-citation></ref>
<ref id="B14"><label>14</label><mixed-citation publication-type="journal"><string-name><surname>Chen</surname>, <given-names>K. H. Y.</given-names></string-name> (<year>2018</year>). <article-title>Ideologies of Language Standardization: The Case of Cantonese in Hong Kong</article-title>. In <string-name><given-names>J. W.</given-names> <surname>Tollefson</surname></string-name> &amp; <string-name><given-names>M.</given-names> <surname>P&#233;rez-Milans</surname></string-name> (Eds.), <source>The Oxford Handbook of Language Policy and Planning</source> (Vol. <volume>1</volume>). DOI: <pub-id pub-id-type="doi">10.1093/oxfordhb/9780190458898.013.22</pub-id></mixed-citation></ref>
<ref id="B15"><label>15</label><mixed-citation publication-type="book"><string-name><surname>Cheshire</surname>, <given-names>Jenny</given-names></string-name>. <year>2006</year>. <chapter-title>Age- and generation-specific use of language. Sociolinguistics: An international handbook of the science of language and society</chapter-title>, ed. by <string-name><given-names>U.</given-names> <surname>Ammond</surname></string-name>, <string-name><given-names>N.</given-names> <surname>Dittmar</surname></string-name> and <string-name><given-names>K.</given-names> <surname>Mattheier</surname></string-name>, <fpage>1552</fpage>&#8211;<lpage>63</lpage>. <publisher-loc>Berlin, Germany</publisher-loc>: <publisher-name>Walter de Gruyter</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1515/9783110171488.2.8.1552</pub-id></mixed-citation></ref>
<ref id="B16"><label>16</label><mixed-citation publication-type="journal"><string-name><surname>Coetzee</surname>, <given-names>A. W.</given-names></string-name> (<year>2018</year>). <article-title>Individual and community level variation in phonetics and phonology</article-title>. In <string-name><given-names>R.</given-names> <surname>Mesthrie</surname></string-name> &amp; <string-name><given-names>D.</given-names> <surname>Bradley</surname></string-name> (Eds.), <source>The Dynamics of Language: Plenary and focus lectures from the 20th International Congress of Linguists</source> (p. <fpage>17</fpage>).</mixed-citation></ref>
<ref id="B17"><label>17</label><mixed-citation publication-type="journal"><string-name><surname>Coetzee</surname>, <given-names>A. W.</given-names></string-name>, <string-name><surname>Beddor</surname>, <given-names>P. S.</given-names></string-name>, <string-name><surname>Shedden</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Styler</surname>, <given-names>W.</given-names></string-name>, &amp; <string-name><surname>Wissing</surname>, <given-names>D.</given-names></string-name> (<year>2018</year>). <article-title>Plosive voicing in Afrikaans: Differential cue weighting and tonogenesis</article-title>. <source>Journal of Phonetics</source>, <volume>66</volume>, <fpage>185</fpage>&#8211;<lpage>216</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.wocn.2017.09.009</pub-id></mixed-citation></ref>
<ref id="B18"><label>18</label><mixed-citation publication-type="book"><string-name><surname>Dimov</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Katseff</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Johnson</surname>, <given-names>K.</given-names></string-name> (<year>2012</year>). <chapter-title>Social and personality variables in compensation for altered auditory feedback</chapter-title>. In <string-name><given-names>M.-J.</given-names> <surname>Sol&#233;</surname></string-name> &amp; <string-name><given-names>D.</given-names> <surname>Recasens</surname></string-name> (Eds.), <source>The initiation of sound change: Perception, production, and social factors</source> (Vol. <volume>323</volume>, pp. <fpage>185</fpage>&#8211;<lpage>210</lpage>). <publisher-name>John Benjamins Publishing Company</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1075/cilt.323.15dim</pub-id></mixed-citation></ref>
<ref id="B19"><label>19</label><mixed-citation publication-type="book"><string-name><surname>Drager</surname>, <given-names>K.</given-names></string-name>, &amp; <string-name><surname>Kirtley</surname>, <given-names>M. J.</given-names></string-name> (<year>2016</year>). <chapter-title>Awareness, Salience, and Stereotypes in Exemplar-Based Models of Speech Production and Perception</chapter-title>. In <string-name><given-names>A. M.</given-names> <surname>Babel</surname></string-name> (Ed.), <source>Awareness and Control in Sociolinguistic Research</source> (pp. <fpage>1</fpage>&#8211;<lpage>24</lpage>). <publisher-name>Cambridge University Press</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1017/CBO9781139680448.003</pub-id></mixed-citation></ref>
<ref id="B20"><label>20</label><mixed-citation publication-type="journal"><string-name><surname>Evans</surname>, <given-names>B. G.</given-names></string-name>, &amp; <string-name><surname>Iverson</surname>, <given-names>P.</given-names></string-name> (<year>2007</year>). <article-title>Plasticity in vowel perception and production: A study of accent change in young adults</article-title>. <source>The Journal of the Acoustical Society of America</source>, <volume>121</volume>(<issue>6</issue>), <fpage>3814</fpage>&#8211;<lpage>3826</lpage>. DOI: <pub-id pub-id-type="doi">10.1121/1.2722209</pub-id></mixed-citation></ref>
<ref id="B21"><label>21</label><mixed-citation publication-type="journal"><string-name><surname>Evans</surname>, <given-names>K. E.</given-names></string-name>, <string-name><surname>Munson</surname>, <given-names>B.</given-names></string-name>, &amp; <string-name><surname>Edwards</surname>, <given-names>J.</given-names></string-name> (<year>2018</year>). <article-title>Does Speaker Race Affect the Assessment of Children&#8217;s Speech Accuracy? A Comparison of Speech-Language Pathologists and Clinically Untrained Listeners</article-title>. <source>Language, Speech, and Hearing Services in Schools</source>, <volume>49</volume>(<issue>4</issue>), <fpage>906</fpage>&#8211;<lpage>921</lpage>. DOI: <pub-id pub-id-type="doi">10.1044/2018_LSHSS-17-0120</pub-id></mixed-citation></ref>
<ref id="B22"><label>22</label><mixed-citation publication-type="book"><string-name><surname>Fok</surname>, <given-names>C. Y. Y.</given-names></string-name> (<year>1979</year>). <chapter-title>The frequency of occurrence of speech sounds and tones in Cantonese</chapter-title>. In <string-name><given-names>L.</given-names> <surname>Robert</surname></string-name> (Ed.), <source>Hong Kong Language Paper</source> (pp. <fpage>150</fpage>&#8211;<lpage>157</lpage>). <publisher-name>Hong Kong University Press</publisher-name>.</mixed-citation></ref>
<ref id="B23"><label>23</label><mixed-citation publication-type="journal"><string-name><surname>Fridland</surname>, <given-names>V.</given-names></string-name>, &amp; <string-name><surname>Kendall</surname>, <given-names>T.</given-names></string-name> (<year>2012</year>). <article-title>Exploring the relationship between production and perception in the mid front vowels of U.S. English</article-title>. <source>Lingua</source>, <volume>122</volume>(<issue>7</issue>), <fpage>779</fpage>&#8211;<lpage>793</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.lingua.2011.12.007</pub-id></mixed-citation></ref>
<ref id="B24"><label>24</label><mixed-citation publication-type="webpage"><string-name><surname>Gamer</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Lemon</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Fellows</surname>, <given-names>I.</given-names></string-name>, &amp; <string-name><surname>Singh</surname>, <given-names>P.</given-names></string-name> (<year>2019</year>). <source>irr: Various Coefficients of Interrater Reliability and Agreement</source> (0.84.1) [Computer software]. <uri>https://rdrr.io/cran/irr/</uri></mixed-citation></ref>
<ref id="B25"><label>25</label><mixed-citation publication-type="book"><string-name><surname>Garrett</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Johnson</surname>, <given-names>K.</given-names></string-name> (<year>2013</year>). <chapter-title>Phonetic bias in sound change</chapter-title>. In <string-name><given-names>A. C. L.</given-names> <surname>Yu</surname></string-name> (Ed.), <source>Origins of Sound Change</source> (pp. <fpage>51</fpage>&#8211;<lpage>97</lpage>). <publisher-name>Oxford University Press</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1093/acprof:oso/9780199573745.003.0003</pub-id></mixed-citation></ref>
<ref id="B26"><label>26</label><mixed-citation publication-type="book"><string-name><surname>Grosvald</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>Corina</surname>, <given-names>D. P.</given-names></string-name> (<year>2012</year>). <chapter-title>The production and perception of sub-phonemic vowel contrasts and the role of the listener in sound change</chapter-title>. In <string-name><given-names>M.-J.</given-names> <surname>Sol&#233;</surname></string-name> &amp; <string-name><given-names>D.</given-names> <surname>Recasens</surname></string-name> (Eds.), <source>The initiation of sound change: Perception, production, and social factors</source> (Vol. <volume>323</volume>, pp. <fpage>77</fpage>&#8211;<lpage>100</lpage>). <publisher-name>John Benjamins Publishing Company</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1075/cilt.323.08gro</pub-id></mixed-citation></ref>
<ref id="B27"><label>27</label><mixed-citation publication-type="journal"><string-name><surname>Hall</surname>, <given-names>K. C.</given-names></string-name> (<year>2013</year>). <article-title>A typology of intermediate phonological relationships</article-title>. <source>The Linguistic Review</source>, <volume>30</volume>(<issue>2</issue>), <fpage>215</fpage>&#8211;<lpage>275</lpage>. DOI: <pub-id pub-id-type="doi">10.1515/tlr-2013-0008</pub-id></mixed-citation></ref>
<ref id="B28"><label>28</label><mixed-citation publication-type="journal"><string-name><surname>Harnsberger</surname>, <given-names>J.</given-names></string-name> (<year>2000</year>). <article-title>On the relationship between identification and discrimination of non-native nasal consonants</article-title>. <source>Journal of the Acoustical Society of America</source>, <volume>110</volume>, <fpage>489</fpage>&#8211;<lpage>503</lpage>. DOI: <pub-id pub-id-type="doi">10.1121/1.1371758</pub-id></mixed-citation></ref>
<ref id="B29"><label>29</label><mixed-citation publication-type="webpage"><string-name><surname>Harrington</surname>, <given-names>J.</given-names></string-name> (<year>2012</year>). <article-title>The coarticulatory basis of diachronic high back vowel fronting</article-title>. <source>The Initiation of Sound Change</source>, <fpage>103</fpage>&#8211;<lpage>122</lpage>. <uri>http://www.jbe.platform.com/content/books/9789027273666-cilt.323.10har</uri>. DOI: <pub-id pub-id-type="doi">10.1075/cilt.323.10har</pub-id></mixed-citation></ref>
<ref id="B30"><label>30</label><mixed-citation publication-type="journal"><string-name><surname>Harrington</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Kleber</surname>, <given-names>F.</given-names></string-name>, &amp; <string-name><surname>Reubold</surname>, <given-names>U.</given-names></string-name> (<year>2008</year>). <article-title>Compensation for coarticulation, /u/-fronting, and sound change in standard southern British: An acoustic and perceptual study</article-title>. <source>The Journal of the Acoustical Society of America</source>, <volume>123</volume>(<issue>5</issue>), <fpage>2825</fpage>&#8211;<lpage>2835</lpage>. DOI: <pub-id pub-id-type="doi">10.1121/1.2897042</pub-id></mixed-citation></ref>
<ref id="B31"><label>31</label><mixed-citation publication-type="book"><string-name><surname>Hashimoto</surname>, <given-names>O. Y.</given-names></string-name> (<year>1972</year>). <source>Phonology of Cantonese</source> (Vol. <volume>1</volume>). <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation></ref>
<ref id="B32"><label>32</label><mixed-citation publication-type="journal"><string-name><surname>Hay</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Warren</surname>, <given-names>P.</given-names></string-name>, &amp; <string-name><surname>Drager</surname>, <given-names>K.</given-names></string-name> (<year>2006</year>). <article-title>Factors influencing speech perception in the context of a merger-in-progress</article-title>. <source>Journal of Phonetics</source>, <volume>34</volume>(<issue>4</issue>), <fpage>458</fpage>&#8211;<lpage>484</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.wocn.2005.10.001</pub-id></mixed-citation></ref>
<ref id="B33"><label>33</label><mixed-citation publication-type="book"><string-name><surname>Hickey</surname>, <given-names>R.</given-names></string-name> (<year>2012</year>). <chapter-title>Internally- and externally-motivated language change</chapter-title>. In <string-name><given-names>J. M.</given-names> <surname>Hern&#225;ndez-Campoy</surname></string-name> &amp; <string-name><given-names>J. C.</given-names> <surname>Conde-Silvestre</surname></string-name> (Eds.), <source>The Handbook of Historical Sociolinguistics</source> (pp. <fpage>387</fpage>&#8211;<lpage>407</lpage>). <publisher-name>Wiley-Blackwell</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1002/9781118257227.ch21</pub-id></mixed-citation></ref>
<ref id="B34"><label>34</label><mixed-citation publication-type="book"><string-name><surname>Johnson</surname>, <given-names>K.</given-names></string-name> (<year>1997</year>). <chapter-title>Speech perception without speaker normalization: An exemplar model</chapter-title>. In <string-name><given-names>J.</given-names> <surname>Mullennix</surname></string-name> (Ed.), <source>Talker Variability in Speech Processing</source>. (pp. <fpage>145</fpage>&#8211;<lpage>165</lpage>.). <publisher-name>Academic Press</publisher-name>.</mixed-citation></ref>
<ref id="B35"><label>35</label><mixed-citation publication-type="thesis"><string-name><surname>Kataoka</surname>, <given-names>R.</given-names></string-name> (<year>2011</year>). <source>Phonetic and Cognitive Bases of Sound Change</source> (Doctoral dissertation, <publisher-name>University of California, Berkeley</publisher-name>). <uri>https://escholarship.org/content/qt3m20112c/qt3m20112c.pdf</uri></mixed-citation></ref>
<ref id="B36"><label>36</label><mixed-citation publication-type="journal"><string-name><surname>Kawahara</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Morise</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Takahashi</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Nisimura</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Irino</surname>, <given-names>T.</given-names></string-name>, &amp; <string-name><surname>Banno</surname>, <given-names>H.</given-names></string-name> (<year>2008</year>). <article-title>Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity estimation</article-title>. <source>Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing</source>, <fpage>3933</fpage>&#8211;<lpage>3936</lpage>. DOI: <pub-id pub-id-type="doi">10.1109/ICASSP.2008.4518514</pub-id></mixed-citation></ref>
<ref id="B37"><label>37</label><mixed-citation publication-type="journal"><string-name><surname>Kim</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Clayards</surname>, <given-names>M.</given-names></string-name> (<year>2019</year>). <article-title>Individual differences in the link between perception and production and the mechanisms of phonetic imitation</article-title>. <source>Language, Cognition and Neuroscience</source>, <volume>34</volume>(<issue>6</issue>), <fpage>769</fpage>&#8211;<lpage>786</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/23273798.2019.1582787</pub-id></mixed-citation></ref>
<ref id="B38"><label>38</label><mixed-citation publication-type="journal"><string-name><surname>Kleber</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Harrington</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Reubold</surname>, <given-names>U.</given-names></string-name> (<year>2012</year>). <article-title>The Relationship between the Perception and Production of Coarticulation during a Sound Change in Progress</article-title>. <source>Language and Speech</source>, <volume>55</volume>(<issue>3</issue>), <fpage>383</fpage>&#8211;<lpage>405</lpage>. DOI: <pub-id pub-id-type="doi">10.1177/0023830911422194</pub-id></mixed-citation></ref>
<ref id="B39"><label>39</label><mixed-citation publication-type="journal"><string-name><surname>Kong</surname>, <given-names>E. J.</given-names></string-name>, &amp; <string-name><surname>Edwards</surname>, <given-names>J.</given-names></string-name> (<year>2016</year>). <article-title>Individual differences in categorical perception of speech: Cue weighting and executive function</article-title>. <source>Journal of Phonetics</source>, <volume>59</volume>, <fpage>40</fpage>&#8211;<lpage>57</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.wocn.2016.08.006</pub-id></mixed-citation></ref>
<ref id="B40"><label>40</label><mixed-citation publication-type="webpage"><string-name><surname>Koops</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Gentry</surname>, <given-names>E.</given-names></string-name>, &amp; <string-name><surname>Pantos</surname>, <given-names>A.</given-names></string-name> (<year>2008</year>). <article-title>The effect of perceived speaker age on the perception of PIN and PEN vowels in Houston, Texas</article-title>. <source>University of Pennsylvania Working Papers in Linguistics</source>, <volume>14</volume>(<issue>2</issue>). <uri>https://repository.upenn.edu/pwpl/vol14/iss2/12</uri></mixed-citation></ref>
<ref id="B41"><label>41</label><mixed-citation publication-type="journal"><string-name><surname>Kuang</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Cui</surname>, <given-names>A.</given-names></string-name> (<year>2018</year>). <article-title>Relative cue weighting in production and perception of an ongoing sound change in Southern Yi</article-title>. <source>Journal of Phonetics</source>, <volume>71</volume>, <fpage>194</fpage>&#8211;<lpage>214</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.wocn.2018.09.002</pub-id></mixed-citation></ref>
<ref id="B42"><label>42</label><mixed-citation publication-type="thesis"><string-name><surname>Kwon</surname>, <given-names>H.</given-names></string-name> (<year>2015</year>). <source>Cue Primacy and Spontaneous Imitation: Is Imitation Phonetic or Phonological?</source> (Doctoral dissertation, <publisher-name>University of Michigan</publisher-name>).</mixed-citation></ref>
<ref id="B43"><label>43</label><mixed-citation publication-type="journal"><string-name><surname>Labov</surname>, <given-names>W.</given-names></string-name> (<year>1990</year>). <article-title>The intersection of sex and social class in the course of linguistic change</article-title>. <source>Language Variation and Change</source>, <volume>2</volume>(<issue>2</issue>), <fpage>205</fpage>&#8211;<lpage>254</lpage>. DOI: <pub-id pub-id-type="doi">10.1017/S0954394500000338</pub-id></mixed-citation></ref>
<ref id="B44"><label>44</label><mixed-citation publication-type="book"><string-name><surname>Labov</surname>, <given-names>W.</given-names></string-name> (<year>1994</year>). <source>Principles of linguistic change. Vol. 1: Internal factors</source>. <publisher-name>Blackwell</publisher-name>.</mixed-citation></ref>
<ref id="B45"><label>45</label><mixed-citation publication-type="book"><string-name><surname>Labov</surname>, <given-names>W.</given-names></string-name> (<year>2001</year>). <source>Principles of linguistic change. Vol. 2: Social factors</source> (Digital print). <publisher-name>Blackwell</publisher-name>.</mixed-citation></ref>
<ref id="B46"><label>46</label><mixed-citation publication-type="webpage"><string-name><surname>Labov</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Rosenfelder</surname>, <given-names>I.</given-names></string-name>, &amp; <string-name><surname>Fruehwald</surname>, <given-names>J.</given-names></string-name> (<year>2013</year>). <article-title>One hundred years of sound change in Philadelphia: Linear incrementation, reversal, and reanalysis</article-title>. <source>Language</source>, <volume>89</volume>(<issue>1</issue>), <fpage>30</fpage>&#8211;<lpage>65</lpage>. <uri>http://www.jstor.org/stable/23357721</uri>. DOI: <pub-id pub-id-type="doi">10.1353/lan.2013.0015</pub-id></mixed-citation></ref>
<ref id="B47"><label>47</label><mixed-citation publication-type="journal"><string-name><surname>Lai</surname>, <given-names>M.-L.</given-names></string-name> (<year>2001</year>). <article-title>Hong Kong Students&#8217; Attitudes Towards Cantonese, Putonghua and English After the Change of Sovereignty</article-title>. <source>Journal of Multilingual and Multicultural Development</source>, <volume>22</volume>(<issue>2</issue>), <fpage>112</fpage>&#8211;<lpage>133</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/01434630108666428</pub-id></mixed-citation></ref>
<ref id="B48"><label>48</label><mixed-citation publication-type="journal"><string-name><surname>Lai</surname>, <given-names>M. L.</given-names></string-name> (<year>2011</year>). <article-title>Cultural identity and language attitudes &#8211; into the second decade of postcolonial Hong Kong</article-title>. <source>Journal of Multilingual and Multicultural Development</source>, <volume>32</volume>(<issue>3</issue>), <fpage>249</fpage>&#8211;<lpage>264</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/01434632.2010.539692</pub-id></mixed-citation></ref>
<ref id="B49"><label>49</label><mixed-citation publication-type="book"><string-name><surname>Lau</surname>, <given-names>S.</given-names></string-name> (<year>1977</year>). <source>A Practical Cantonese-English Dictionary</source>. <publisher-name>The Government Printer</publisher-name>.</mixed-citation></ref>
<ref id="B50"><label>50</label><mixed-citation publication-type="journal"><string-name><surname>Lee</surname>, <given-names>K. S.</given-names></string-name>, &amp; <string-name><surname>Leung</surname>, <given-names>W. M.</given-names></string-name> (<year>2012</year>). <article-title>The status of Cantonese in the education policy of Hong Kong</article-title>. <source>Multilingual Education</source>, <volume>2</volume>(<issue>1</issue>), <fpage>2</fpage>. DOI: <pub-id pub-id-type="doi">10.1186/10.1186/2191-5059-2-2</pub-id></mixed-citation></ref>
<ref id="B51"><label>51</label><mixed-citation publication-type="journal"><string-name><surname>Leung</surname>, <given-names>M.-T.</given-names></string-name>, <string-name><surname>Law</surname>, <given-names>S.-P.</given-names></string-name>, &amp; <string-name><surname>Fung</surname>, <given-names>S.-Y.</given-names></string-name> (<year>2004</year>). <article-title>Type and token frequencies of phonological units in Hong Kong Cantonese</article-title>. <source>Behavior Research Methods, Instruments, &amp; Computers</source>, <volume>36</volume>(<issue>3</issue>), <fpage>500</fpage>&#8211;<lpage>505</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/BF03195596</pub-id></mixed-citation></ref>
<ref id="B52"><label>52</label><mixed-citation publication-type="journal"><string-name><surname>Lev-Ari</surname>, <given-names>S.</given-names></string-name> (<year>2017</year>). <article-title>Talking to fewer people leads to having more malleable linguistic representations</article-title>. <source>PLOS ONE</source>, <volume>12</volume>(<issue>8</issue>), <elocation-id>e0183593</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1371/journal.pone.0183593</pub-id></mixed-citation></ref>
<ref id="B53"><label>53</label><mixed-citation publication-type="journal"><string-name><surname>Lin</surname>, <given-names>Y.</given-names></string-name>, <string-name><surname>Yao</surname>, <given-names>Y.</given-names></string-name>, &amp; <string-name><surname>Luo</surname>, <given-names>J.</given-names></string-name> (<year>2021</year>). <article-title>Phonetic accommodation of tone: Reversing a tone merger-in-progress via imitation</article-title>. <source>Journal of Phonetics</source>, <volume>87</volume>, <elocation-id>101060</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1016/j.wocn.2021.101060</pub-id></mixed-citation></ref>
<ref id="B54"><label>54</label><mixed-citation publication-type="journal"><string-name><surname>Mok</surname>, <given-names>P. P. K.</given-names></string-name>, <string-name><surname>Zuo</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Wong</surname>, <given-names>P. W. Y.</given-names></string-name> (<year>2013</year>). <article-title>Production and perception of a sound change in progress: Tone merging in Hong Kong Cantonese</article-title>. <source>Language Variation and Change</source>, <volume>25</volume>(<issue>03</issue>), <fpage>341</fpage>&#8211;<lpage>370</lpage>. DOI: <pub-id pub-id-type="doi">10.1017/S0954394513000161</pub-id></mixed-citation></ref>
<ref id="B55"><label>55</label><mixed-citation publication-type="webpage"><string-name><surname>Morrison</surname>, <given-names>G.</given-names></string-name> (<year>2007</year>). <chapter-title>Logistic regression modelling for first and second language perception data</chapter-title>. In <string-name><given-names>M.-J.</given-names> <surname>Sol&#233;</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Prieto</surname></string-name>, &amp; <string-name><given-names>J.</given-names> <surname>Mascaro</surname></string-name> (Eds.), <source>Segmental and prosodic issues in Romance phonology</source> (pp. <fpage>219</fpage>&#8211;<lpage>236</lpage>). <publisher-name>John Benjamins</publisher-name>. <uri>https://research.aston.ac.uk/en/publications/logistic-regression-modelling-for-first-and-second-language-perce</uri>. DOI: <pub-id pub-id-type="doi">10.1075/cilt.282.15mor</pub-id></mixed-citation></ref>
<ref id="B56"><label>56</label><mixed-citation publication-type="journal"><string-name><surname>Narayan</surname>, <given-names>C.</given-names></string-name> (<year>2008</year>). <article-title>The acoustic-perceptual salience of nasal place contrasts</article-title>. <source>Journal of Phonetics</source>, <volume>36</volume>, <fpage>191</fpage>&#8211;<lpage>217</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.wocn.2007.10.001</pub-id></mixed-citation></ref>
<ref id="B57"><label>57</label><mixed-citation publication-type="journal"><string-name><surname>Ng</surname>, <given-names>M. L.</given-names></string-name>, &amp; <string-name><surname>Kwok</surname>, <given-names>I. C. L.</given-names></string-name> (<year>2004</year>). <article-title>Frequency of occurrence of Cantonese word initials, finals and tones</article-title>. <source>Asia Pacific Journal of Speech, Language and Hearing</source>, <volume>9</volume>(<issue>3</issue>), <fpage>189</fpage>&#8211;<lpage>199</lpage>. DOI: <pub-id pub-id-type="doi">10.1179/136132804805575868</pub-id></mixed-citation></ref>
<ref id="B58"><label>58</label><mixed-citation publication-type="journal"><string-name><surname>Ou</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Law</surname>, <given-names>S.-P.</given-names></string-name> (<year>2017</year>). <article-title>Cognitive basis of individual differences in speech perception, production and representations: The role of domain general attentional switching</article-title>. <source>Attention, Perception, &amp; Psychophysics</source>, <volume>79</volume>(<issue>3</issue>), <fpage>945</fpage>&#8211;<lpage>963</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/s13414-017-1283-z</pub-id></mixed-citation></ref>
<ref id="B59"><label>59</label><mixed-citation publication-type="thesis"><string-name><surname>Pan</surname>, <given-names>P. G.</given-names></string-name> (<year>1981</year>). <source>Prestige forms and phonological variation in Hong Kong Cantonese speech</source> (Doctoral dissertation, <publisher-name>The University of Hong Kong</publisher-name>). DOI: <pub-id pub-id-type="doi">10.5353/th_b4389299</pub-id></mixed-citation></ref>
<ref id="B60"><label>60</label><mixed-citation publication-type="journal"><string-name><surname>Pinget</surname>, <given-names>A.-F.</given-names></string-name>, <string-name><surname>Kager</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Van de Velde</surname>, <given-names>H.</given-names></string-name> (<year>2020</year>). <article-title>Linking variation in perception and production in sound change: Evidence from Dutch obstruent devoicing</article-title>. <source>Language and Speech</source>, <volume>63</volume>(<issue>3</issue>), <fpage>660</fpage>&#8211;<lpage>685</lpage>. DOI: <pub-id pub-id-type="doi">10.1177/0023830919880206</pub-id></mixed-citation></ref>
<ref id="B61"><label>61</label><mixed-citation publication-type="webpage"><collab>R Core Team</collab>. (<year>2020</year>). <source>R: A language and environment for statistical computing</source>. <publisher-name>R Foundation for Statistical Computing</publisher-name>. <uri>https://www.R-project.org/</uri></mixed-citation></ref>
<ref id="B62"><label>62</label><mixed-citation publication-type="journal"><string-name><surname>Samuel</surname>, <given-names>A. G.</given-names></string-name>, &amp; <string-name><surname>Larraza</surname>, <given-names>S.</given-names></string-name> (<year>2015</year>). <article-title>Does listening to non-native speech impair speech perception?</article-title> <source>Journal of Memory and Language</source>, <volume>81</volume>, <fpage>51</fpage>&#8211;<lpage>71</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2015.01.003</pub-id></mixed-citation></ref>
<ref id="B63"><label>63</label><mixed-citation publication-type="journal"><string-name><surname>Schertz</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Cho</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Lotto</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Warner</surname>, <given-names>N.</given-names></string-name> (<year>2015</year>). <article-title>Individual differences in phonetic cue use in production and perception of a non-native sound contrast</article-title>. <source>Journal of Phonetics</source>, <volume>52</volume>, <fpage>183</fpage>&#8211;<lpage>204</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.wocn.2015.07.003</pub-id></mixed-citation></ref>
<ref id="B64"><label>64</label><mixed-citation publication-type="book"><string-name><surname>Schneider</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Eschman</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Zuccolotto</surname>, <given-names>A.</given-names></string-name> (<year>2007</year>). <source>E-Prime 2.0 professional</source>. <publisher-name>Psychology Software Tools</publisher-name>.</mixed-citation></ref>
<ref id="B65"><label>65</label><mixed-citation publication-type="journal"><string-name><surname>Schultz</surname>, <given-names>A. A.</given-names></string-name>, <string-name><surname>Francis</surname>, <given-names>A. L.</given-names></string-name>, &amp; <string-name><surname>Llanos</surname>, <given-names>F.</given-names></string-name> (<year>2012</year>). <article-title>Differential cue weighting in perception and production of consonant voicing</article-title>. <source>The Journal of the Acoustical Society of America</source>, <volume>132</volume>(<issue>2</issue>), <fpage>EL95</fpage>&#8211;<lpage>EL101</lpage>. DOI: <pub-id pub-id-type="doi">10.1121/1.4736711</pub-id></mixed-citation></ref>
<ref id="B66"><label>66</label><mixed-citation publication-type="journal"><string-name><surname>Sharma</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Sankaran</surname>, <given-names>L.</given-names></string-name> (<year>2011</year>). <article-title>Cognitive and social forces in dialect shift: Gradual change in London Asian speech</article-title>. <source>Language Variation and Change</source>, <volume>23</volume>(<issue>3</issue>), <fpage>399</fpage>&#8211;<lpage>428</lpage>. DOI: <pub-id pub-id-type="doi">10.1017/S0954394511000159</pub-id></mixed-citation></ref>
<ref id="B67"><label>67</label><mixed-citation publication-type="journal"><string-name><surname>Soo</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Babel</surname>, <given-names>M.</given-names></string-name> (<year>2020</year>, <month>November</month> <day>19</day>). <source>Recognition &amp; representation of Cantonese sound change variants in the lexicon</source>. 61st Annual Meeting of the Psychonomics Society.</mixed-citation></ref>
<ref id="B68"><label>68</label><mixed-citation publication-type="journal"><string-name><surname>Sumner</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Kim</surname>, <given-names>S. K.</given-names></string-name>, <string-name><surname>King</surname>, <given-names>E.</given-names></string-name>, &amp; <string-name><surname>McGowan</surname>, <given-names>K.</given-names></string-name> (<year>2014</year>). <article-title>The socially weighted encoding of spoken words: A dual-route approach to speech perception</article-title>. <source>Frontiers in Psychology</source>, <volume>4</volume>, <fpage>1015</fpage>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2013.01015</pub-id></mixed-citation></ref>
<ref id="B69"><label>69</label><mixed-citation publication-type="journal"><string-name><surname>Sumner</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>Samuel</surname>, <given-names>A. G.</given-names></string-name> (<year>2009</year>). <article-title>The effect of experience on the perception and representation of dialect variants</article-title>. <source>Journal of Memory and Language</source>, <volume>60</volume>(<issue>4</issue>), <fpage>487</fpage>&#8211;<lpage>501</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2009.01.001</pub-id></mixed-citation></ref>
<ref id="B70"><label>70</label><mixed-citation publication-type="journal"><string-name><surname>Tamminga</surname>, <given-names>M.</given-names></string-name> (<year>2019</year>). <article-title>Interspeaker covariation in Philadelphia vowel changes</article-title>. <source>Language Variation and Change</source>, <volume>31</volume>(<issue>2</issue>), <fpage>119</fpage>&#8211;<lpage>133</lpage>. DOI: <pub-id pub-id-type="doi">10.1017/S0954394519000139</pub-id></mixed-citation></ref>
<ref id="B71"><label>71</label><mixed-citation publication-type="journal"><string-name><surname>To</surname>, <given-names>C. K. S.</given-names></string-name>, <string-name><surname>Mcleod</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Cheung</surname>, <given-names>P. S. P.</given-names></string-name> (<year>2015</year>). <article-title>Phonetic variations and sound changes in Hong Kong Cantonese: Diachronic review, synchronic study and implications for speech sound assessment</article-title>. <source>Clinical Linguistics &amp; Phonetics</source>, <volume>29</volume>(<issue>5</issue>), <fpage>333</fpage>&#8211;<lpage>353</lpage>. DOI: <pub-id pub-id-type="doi">10.3109/02699206.2014.1003329</pub-id></mixed-citation></ref>
<ref id="B72"><label>72</label><mixed-citation publication-type="journal"><string-name><surname>Voeten</surname>, <given-names>C. C.</given-names></string-name> (<year>2020</year>). <article-title>Individual differences in the adoption of sound change</article-title>. <source>Language and Speech</source>, <fpage>1</fpage>&#8211;<lpage>37</lpage>. DOI: <pub-id pub-id-type="doi">10.1177/0023830920959753</pub-id></mixed-citation></ref>
<ref id="B73"><label>73</label><mixed-citation publication-type="journal"><string-name><surname>Wagner</surname>, <given-names>S. E.</given-names></string-name> (<year>2012</year>). <article-title>Age grading in sociolinguistic theory</article-title>. <source>Language and Linguistics Compass</source>, <volume>6</volume>(<issue>6</issue>), <fpage>371</fpage>&#8211;<lpage>382</lpage>. DOI: <pub-id pub-id-type="doi">10.1002/lnc3.343</pub-id></mixed-citation></ref>
<ref id="B74"><label>74</label><mixed-citation publication-type="journal"><string-name><surname>Wang</surname>, <given-names>L.</given-names></string-name>, &amp; <string-name><surname>Kirkpatrick</surname>, <given-names>A.</given-names></string-name> (<year>2015</year>). <article-title>Trilingual education in Hong Kong primary schools: An overview</article-title>. <source>Multilingual Education</source>, <volume>5</volume>(<issue>1</issue>), <fpage>3</fpage>. DOI: <pub-id pub-id-type="doi">10.1186/s13616-015-0023-8</pub-id></mixed-citation></ref>
<ref id="B75"><label>75</label><mixed-citation publication-type="journal"><string-name><surname>Whelpton</surname>, <given-names>J.</given-names></string-name> (<year>1999</year>). <article-title>The future of Cantonese: Current trends</article-title>. <source>Hong Kong Journal of Applied Linguistics</source>, <volume>4</volume>(<issue>1</issue>), <fpage>43</fpage>&#8211;<lpage>60</lpage>.</mixed-citation></ref>
<ref id="B76"><label>76</label><mixed-citation publication-type="journal"><string-name><surname>Wong</surname>, <given-names>A. D.</given-names></string-name> (<year>2019</year>). <article-title>Authenticity, belonging, and charter myths of Cantonese</article-title>. <source>Language &amp; Communication</source>, <volume>68</volume>, <fpage>37</fpage>&#8211;<lpage>45</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.langcom.2018.10.011</pub-id></mixed-citation></ref>
<ref id="B77"><label>77</label><mixed-citation publication-type="book"><string-name><surname>Wong</surname>, <given-names>S. L.</given-names></string-name> (<year>1941</year>). <source>A Chinese syllabary pronounced according to the dialect of Canton</source>. <publisher-name>Chung Hwa Book Co</publisher-name>.</mixed-citation></ref>
<ref id="B78"><label>78</label><mixed-citation publication-type="journal"><string-name><surname>Yao</surname>, <given-names>Y.</given-names></string-name>, &amp; <string-name><surname>Chang</surname>, <given-names>C. B.</given-names></string-name> (<year>2016</year>). <article-title>On the cognitive basis of contact-induced sound change: Vowel merger reversal in Shanghainese</article-title>. <source>Language</source>, <volume>92</volume>(<issue>2</issue>), <fpage>433</fpage>&#8211;<lpage>467</lpage>. DOI: <pub-id pub-id-type="doi">10.1353/lan.2016.0031</pub-id></mixed-citation></ref>
<ref id="B79"><label>79</label><mixed-citation publication-type="thesis"><string-name><surname>Yeung</surname>, <given-names>S.</given-names></string-name> (<year>1980</year>). <source>Some aspects of phonological variations in the Cantonese spoken in Hong Kong</source> [Master&#8217;s thesis]. <publisher-name>Hong Kong University</publisher-name>.</mixed-citation></ref>
<ref id="B80"><label>80</label><mixed-citation publication-type="journal"><string-name><surname>Yu</surname>, <given-names>A. C. L.</given-names></string-name> (<year>2010</year>). <article-title>Perceptual compensation is correlated with individuals&#8217; &#8220;autistic&#8221; traits: Implications for models of sound change</article-title>. <source>PLoS ONE</source>, <volume>5</volume>(<issue>8</issue>), <elocation-id>e11950</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1371/journal.pone.0011950</pub-id></mixed-citation></ref>
<ref id="B81"><label>81</label><mixed-citation publication-type="journal"><string-name><surname>Yu</surname>, <given-names>A. C. L.</given-names></string-name>, <string-name><surname>Abrego-Collier</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Sonderegger</surname>, <given-names>M.</given-names></string-name> (<year>2013</year>). <article-title>Phonetic Imitation from an Individual-Difference Perspective: Subjective Attitude, Personality and &#8220;Autistic&#8221; Traits</article-title>. <source>PLoS ONE</source>, <volume>8</volume>(<issue>9</issue>), <elocation-id>e74746</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1371/journal.pone.0074746</pub-id></mixed-citation></ref>
<ref id="B82"><label>82</label><mixed-citation publication-type="journal"><string-name><surname>Yu</surname>, <given-names>A. C. L.</given-names></string-name>, &amp; <string-name><surname>Zellou</surname>, <given-names>G.</given-names></string-name> (<year>2019</year>). <article-title>Individual differences in language processing: Phonology</article-title>. <source>Annual Review of Linguistics</source>, <volume>5</volume>(<issue>1</issue>), <fpage>131</fpage>&#8211;<lpage>150</lpage>. DOI: <pub-id pub-id-type="doi">10.1146/annurev-linguistics-011516-033815</pub-id></mixed-citation></ref>
<ref id="B83"><label>83</label><mixed-citation publication-type="webpage"><string-name><surname>Zee</surname>, <given-names>E.</given-names></string-name> (<year>1999a</year>). <chapter-title>Change and variation in the syllable-initial and syllable-final consonants in Hong Kong Cantonese / &#39321;&#28207;&#31908;&#35821;&#20013;&#22768;&#27597;&#21450;&#38901;&#23614;&#36741;&#38899;&#20043;&#21464;&#21270;&#19982;&#21464;&#24322;</chapter-title>. <source>Journal of Chinese Linguistics</source>, <volume>27</volume>(<issue>1</issue>), <fpage>120</fpage>&#8211;<lpage>167</lpage>. <publisher-name>JSTOR</publisher-name>. <uri>http://www.jstor.org/stable/23756746</uri></mixed-citation></ref>
<ref id="B84"><label>84</label><mixed-citation publication-type="book"><string-name><surname>Zee</surname>, <given-names>E.</given-names></string-name> (<year>1999b</year>). <chapter-title>Chinese (Hong Kong Cantonese)</chapter-title>. In <string-name><given-names>I. P.</given-names> <surname>Association</surname></string-name> (Ed.), <source>Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet</source> (pp. <fpage>58</fpage>&#8211;<lpage>60</lpage>). <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation></ref>
<ref id="B85"><label>85</label><mixed-citation publication-type="journal"><string-name><surname>Zellou</surname>, <given-names>G.</given-names></string-name> (<year>2017</year>). <article-title>Individual differences in the production of nasal coarticulation and perceptual compensation</article-title>. <source>Journal of Phonetics</source>, <volume>61</volume>, <fpage>13</fpage>&#8211;<lpage>29</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.wocn.2016.12.002</pub-id></mixed-citation></ref>
<ref id="B86"><label>86</label><mixed-citation publication-type="journal"><string-name><surname>Zhang</surname>, <given-names>J.</given-names></string-name> (<year>2019</year>). <article-title>Tone mergers in Cantonese: Evidence from Hong Kong, Macao, and Zhuhai</article-title>. <source>Asia-Pacific Language Variation</source>, <volume>5</volume>(<issue>1</issue>), <fpage>28</fpage>&#8211;<lpage>49</lpage>. DOI: <pub-id pub-id-type="doi">10.1075/aplv.18007.zha</pub-id></mixed-citation></ref>
</ref-list>
</back>
</article>