The Patterns that Escape Us | Oxford Protein Informatics Group

Part The First: An Outrageous Claim

Reproduced below is the introductory passage from a psycholinguistics paper, published in the mid-nineties. Riveted, as I’m sure you are, having just read that banger opening line to my blog post, humour me and read on; I promise it gets interesting.

The segmentation problem

The orthography of English has a very simple basis for establishing where words in written texts begin and end: both before and also after every word are empty spaces and this demarcation surely helps the reader comprehend. In a spoken text, however, as presented to a hearer, such explicit segmentation cues are rarely to be found; little pauses after every single word might make things clearer, but the input is continuous – a running stream of sound. This implies that part of listening involves an operation whereby input is segmented, to be processed word by word, for we cannot hold in memory each total collocation, as most sentences we come across are previously unheard. Yet we listeners experience no sense of some dramatic act of separating input into pieces that are known; as we listen to an utterance it seems unproblematic – words in sentences seem just as clear as words that stand alone. Just how listeners accomplish such an effortless division is a question that psychologists have now begun to solve, and this paper will describe (although with minimal precision) some experimental studies showing what it might involve. The findings, as this summary explains, at once can vindicate the order of the problem and the hearer’s sense of ease, for though speech must be segmented, yet the data plainly indicate that rhythm in the input makes segmenting speech a breeze.

Notice any stylistic choices that might be considered odd for a research paper? Any particular feature of the word usage that jumps out and you? Does the text seem strangely hard to read? (I promise this isn’t a post about ‘proper writing style’ – I’d hardly be qualified.)

OK, how about we try some re-formatting?

The segmentation problem

 The orthography of English has a very simple basis 
for establishing where words in written texts begin and end: 
 both before and also after every word are empty spaces 
 and this demarcation surely helps the reader comprehend.

In a spoken text, however, as presented to a hearer, 
such explicit segmentation cues are rarely to be found; 
little pauses after every single word might make things clearer, 
but the input is continuous - a running stream of sound. 

This implies that part of listening involves an operation 
whereby input is segmented, to be processed word by word, 
for we cannot hold in memory each total collocation,
as most sentences we come across are previously unheard.

Yet we listeners experience no sense of some dramatic 
act of separating input into pieces that are known; 
as we listen to an utterance it seems unproblematic - 
words in sentences seem just as clear as words that stand alone. 

Just how listeners accomplish such an effortless division 
is a question that psychologists have now begun to solve,  
and this paper will describe (although with minimal precision) 
some experimental studies showing what it might involve. 

The findings, as this summary explains, at once can vindicate 
the order of the problem and the hearer's sense of ease, 
for though speech must be segmented, yet the data plainly indicate 
that rhythm in the input makes segmenting speech a breeze.

If it isn’t clear yet, this is perfectly metered verse with rhyming lines; a poem with a very regular rhythm, typeset as prose. Once this rhythm was pointed out to me (or read out, rather), it was impossible to unsee. I can hear it in my head, when I re-read the original text, it jumps out at me. Yet, first time round, I completely failed to notice it, despite the stark, obvious, unfailing regularity. How did I miss such an obvious pattern? How could anyone?

The passage is from a 1994 Cognition paper by Anne Cutler, entitled ‘The perception of rhythm in language’. Yes, the entire three-page thing is written in verse (and is–in my opinion–a delight to read, once you pick up on the rhythm). No, the vast majority of its readers will not have noticed this fact until the text impishly hints at it in the very last paragraph.

Content-wise, the thrust of the paper is as follows:

To understand speech, human brains must somehow chop up continuous audio-stream input into discrete tokens (words) with defined meanings – otherwise they would have to somehow store a giant hash-table mapping every possible sequence of language sounds to its intended meaning, which seems implausible.
- side-note: This should be quite intuitive, if you’ve ever studied a foreign language: Because. Native. Speakers. Don’t. Speak. With. Pauses. Like. This, pinpointing where exactly in their speech one unknown word ends and where another one begins is among the hardest challenges, when first starting out.
- second side-note: NLP folks face the same problem, when building speech-recognition models. I don’t know much about how they go about solving it, but one assumes there are parallels there.
The paper contends that human brains solve this segmentation problem by attending closely to the rhythm in the audio stream. The exact rhythmic patterns they latch onto differ between languages, but in all cases the rhythm helps define syllable- and word-boundaries, which then allow the stream to be properly chopped up and processed.
However, when we read a text, the problem doesn’t arise, because words are separated from each other by spaces – it is already clear how the input should be segmented into semantic units or tokens. So people cease to attend to the rhythm of the language (presumably to save cognitive labour), unless the text is typeset to make rhymes and rhythm obvious.

Now, I don’t know whether humans actually solve the segmentation problem in this manner. I imagine linguists, cognitive scientists and NLP people will have a much more informed opinion on this, than I – in fact, I’m reliably informed this notion is quite outdated. That is not the point of this blog post.

What I took exception to is the third point only – because I just didn’t believe it. Surely, when the rhythm is so blindingly obvious when pointed out, most people will eventually notice on their own, unprompted? What if they’re asked to read it out loud, transforming the text back into speech? What if they do so fluently and proficiently, stressing the words in exactly the right pattern, and can hear themselves doing it – surely then they must notice? Maybe non-native speakers have an advantage, being perhaps less comfortable with the language and relying more on subvocalising (‘reading it out loud in their heads’). Or perhaps natives are more likely to pick up the pattern, fully at home in their language and confronted here with a very regularly patterned, high signal-to-noise ratio anomaly. Surely, surely someone must notice?

Part The Second: The Survey

I’m nothing if not an empiricist, so I conducted a survey among the OPIGlets, none of whom had previously encountered the text. (N=15, though, so take this with a grain of salt.) Participants self-identified as either native or non-native speakers of English, and as either speakers or non-speakers of another (non-English) language. They were then invited to read the above passage (presented as ‘an excerpt from a research article’) either silently or out loud, and to comment on any stylistic peculiarities they might have noticed.

The sample composition was as follows:

	natives	non-natives	TOTAL
speak other language	7	4	11
do not speak other language	4	0	4
TOTAL	11	4	15

Participants were allocated to treatment groups approximately evenly, stratified by native language,

	natives	non-natives	TOTAL
asked to read silently	6	2	8
asked to read out loud	5	2	7
TOTAL	11	4	15

but without regard for whether they spoke a non-English language:

	speak other languages	don’t speak other languages	TOTAL
asked to read silently	5	3	8
asked to read out loud	6	1	7
TOTAL	11	4	15

Part the Third: The Results

First, of all, I was shocked –shocked– to find that it really is rare for people to pick up on this. It wasn’t just me. Only one person (6.7% of participants) could specifically point to the fact that the text had a well-defined rhythm which was the cause of the unusual structure. That person was a non-native speaker, asked to read the text out loud. A native speaker (read silently, spoke no foreign languages) identified individual rhymes, but no overall rhythmic pattern, even when prompted. Another native speaker (read out loud, spoke a foreign language) commented on the use of uncharacteristically poetic language for a research article –noting a less clinical, emotionally evocative style– but did not specifically identify rhyme or metre as the underlying cause, even when prompted. Everybody else missed it entirely. This includes one person who read it out so fluently and in such perfect prosody that any listener would have immediately identified their performance as the recitation of a poem; but no, the reader themselves did not notice.

One person (native, reading silently, spoke a foreign language) offered that the text might be a word-for-word translation of some kind; foreign syntax imposed on English, impeding ‘flow’ in strange ways. A lack of flow was noted by almost all participants (which is ironic, given what is actually going on – I’d argue the text flows rather well). Many commented on the unusual sentence length and perceived the text as (overly) verbose, oddly punctuated, and hard to read.

There’s of course limited insight one can get from a sample this small, but overinterpreting results in fun – so I went ahead and checked for pairwise associations anyway. In order of significance:

Speaks Foreign Language vs. Noticing Rhythm: NOT SIGNIFICANT

	noticed rhythm	did not notice rhythm	TOTAL
speaks foreign language	1	10	11
doesn’t speak foreign language	0	4	4
TOTAL	1	14	15

import scipy.stats as stats
contingency_table = [[1, 10], [0, 4]]
odds_ratio, p_value = stats.fisher_exact(contingency_table)
print(odds_ratio, p_value)

> (inf, 1.0)

Reading Mode vs. Noticing Rhythm: VERY PROBABLY NOT SIGNIFICANT

	noticed rhythm	did not notice rhythm	TOTAL
asked to read silently	0	8	8
asked to read out loud	1	6	7
TOTAL	1	14	15

> (0.0, 0.4666666666666667)

Native Speaker vs. Noticing Rhythm: PROBABLY NOT SIGNIFICANT?

	noticed rhythm	did not notice rhythm	TOTAL
native	0	11	11
non-native	1	3	4
TOTAL	1	14	15

>(0.0, 0.26666666666666666)

I dunno, maybe this last one is something to look into with a larger sample… 😉

Part The Fourth: Get to The Point Already

Without, of course, taking my very professional, highly scientific, perfectly engineered study too seriously, I find it fascinating that most people, including myself, seem to consistently miss this pattern, even though it seems blindingly obvious in hindsight (at least to me). For what it’s worth, Claude (sonnet-3.5) also failed to identify the rhythm, though, to be fair, its architecture is not really laid out to notice things in pronunciations it has never heard. I’m half-certain someone out there has done this study properly and I’d be really interested to know if non-natives tend to notice more or less frequently, for example. I also highly recommend the original paper – it’s just a joy to read out loud, and quite short, too.

More importantly, however, there’s a point I want to make about the patterns right in front of us we simply fail to see. A simple change in your perspective’s often all that’s really needed for the pieces to fall into place – whatever they may be.

Oh, look, that last bit rhymed.

Author

Odysseas Vavourakis

View all posts