Separated by a Common Language: computers

Showing posts with label computers. Show all posts

Book Week 2019: Gretchen McCulloch's Because Internet

Dec 2, 2019

Welcome to the final review post of Book Week 2019. In the intro to Book Week 2019, I explain what I'm doing. The 'week' has turned out to be eight days. If you're perturbed about that, I'm happy to offer you a full refund on your subscription fees for this blog.

On with the show. Today's book is:

Because Internet

Understanding the new rules of language (US subtitle)
Understanding how language is changing (UK subtitle)

by Gretchen McCulloch
Riverhead, 2019 (N America)
Harvill Secker, 2019 (UK)

Gretchen McCulloch describes herself as an internet linguist: writing about internet language for people on the internet. She actually does a lot more than that, with daily blogging at All Things Linguistic for years and being one half of the Lingthusiasm podcast team and writing on all sorts of linguistic themes for all sorts of publications. So, I expect many readers of this blog will already know her and have heard about this book.

US Cover

I expected Because Internet to be good, knowing Gretchen's work, but I also probably (in my grumpy, middle-aged, oh-do-we-have-to-talk-about-emojis-again? way) expected it to be faddish. There have been too many just-plain-bad, (orig. AmE) jumping-on-the-bandwagon books about emojis, and I've got(ten) a bit sour on the topic.

This book is so much more than I expected it to be.

I should have known better. Having read and heard much of her work, I should have expected that this would be a truly sophisticated approach to language and to general-audience linguistics writing. So far in Book Week 2019, I've recommended the books as gifts for A-level students/teachers, science lovers, and language curmudgeons. This book is good for all those groups and more.

UK cover

The key is in the subtitle(s).* This is not just a book about emojis and autocomplete (and, actually, autocomplete isn't even in the index). This is a book about the relationship between speech and writing and how that's changed with technology. It seamlessly introduces theories of why language changes, how change spreads and how communication works in a time when the potential for change is high and the potential for changes to spread is unprecedented.

That seamless introduction of linguistic concepts is the reason I've started this book from the beginning and not skipped around (unlike for other books in Book Week—where the rule is that I don't have to read the whole book before I start writing about it). In most books about language for non-linguists, I'm able to skim or skip the bit where they talk about the basics of how language works and the classic studies on the topic and the ideas springing from them. McCulloch covers those issues and those studies (the Labovs, the Milroys, the Eckerts), but since this is intertwined with looking at how language is changing in the 21st century—because (of the) internet—it was worth my while to read straight through. The great thing about the language of the internet is: even when it looks really different from non-internet language, it's still illustrating general principles about how language, communication, and society work. But it also shows how society is changing because of technology, particularly in changing who we are likely to interact with or hear from, In the process, it gives a history of the internet that's enlightening even for those of us who've lived through it all. (I've just flipped open to a section about PLATO at the University of Illinois. One of my student jobs was working in a PLATO lab, playing Bugs-n-Drugs [aka Medcenter] while signing people in and out. That game was not good for my hypochondria, but I have awfully fond memories of PLATO.)

Another thing to appreciate about McCulloch's book is how unreactionary it is. She doesn't set up her discussion as "You've heard people say these stupid things about the internet, but here's the TRUTH." (A style of writing that I can be very, very guilty of.) She mostly just makes her case gracefully, based on what the language is doing, rather than reacting to what other people say the language is doing. Rather than 'This, that and the other person say emoji are a new language, but they're not', she just gets on with explaining how emoji fulfil(l) our communicative need to gesture. It's a positive approach that academic linguists will have had trained out of them by the requirements of academic publishing.

This is a bit of a nerdview 'review'. Usually reviews tell you some fun facts from the book they're reviewing, whereas I'm telling you what I've noticed about its information structure. That's because that's what I really look for in books as I prepare to write a new one. In terms of information, in this book you'll learn, among other things:

which "internet generation" you belong to and how your language is likely to be different from other generations'.
what punctuation communicates in texting/chat and how that differs from formal writing
how language change can be traced through studying strong and weak social links and geographic tagging on Twitter

Inevitably, the book is mainly about English, in no small part because English rules the internet. But it does make its way to other languages and cultures—for instance, how Arabic chat users adapted their spelling to the roman alphabet and how emojis are interpreted differently around the world. In the end, she briefly considers whether space is being made for other languages on the internet.

It's a galloping read and you'll learn all sorts of things.

So, on that happy review, I declare Book Week 2019 FINISHED.

* I love the transatlantic change in subtitles, since it completely illustrates the point of chapter 8 of The Prodigal Tongue: that Americans like to talk about language in terms of rules, and Britons in terms of history/tradition. I've also written a shorter piece about my personal experience of it for Zócalo Public Square.

Labels: books , computers | 3 comments

Descriptions in Twitter profiles

Oct 26, 2014

When Twitter tells me I have new followers, I can see their name and self-description before I can see their location (if they've given any). So I play a little game of 'guess which country they're from' before I click through to see it. I seem to be good at picking out the Americans (or at least North Americans--the Canada/US distinction is hard to make here--sorry Canadians), based on the style of the name and self-description. To be fair, I'm probably guided by the photos too. (Pick out the Americans at the airport is another fun and not-too-difficult game. There, you can see the red maple leaf patches on all the Canadians' bags, which save them from the lumping-together.) But I'm a linguist, so I like to think it's the language I'm sensitive to.

(A postscript on 27 Oct 2014: In the comments, Dorothy Bishop reminds me of a post she did three years ago that was in the back of my head when I started this, but I failed to find it in my preliminary search. I didn't want to take the chance of you missing that if you don't read the comments. So, if you like this post, you'll love this one.)

Because I probably should have been doing something else, I decided to try to test 'what marks an American (versus British) Twitter profile'. Here's my method:

I worked backwards from recent followers using the 'Who Unfollowed Me' (Pro) list of followers whom I don't follow back. I used this because it does the opposite of Twitter: it shows me the location, and I have to click through for the description.

For each follower who (a) was a person, not a company, and (b) unambiguously listed their location as being in the US or UK, I recorded:

country: I only included people whose locations were unambiguous, so no London-Islamabad-Hong Kong multiple locations and no indications of internationality in the descriptor, such as An American in London

gender: by name/photo/description (female, male, unknown/other)

Twitter handle: does it reflect their name? Three possible values:

Yes/name: the handle is some version of their name or their name + numbers, e.g. @lynne_murphy, @LynneM34, @Lynney, etc.
Mixed: part name/part descriptor, e.g. @LynneLinguist, @LynneEdits, @LordLynne, @CrankyLynne
Not name: e.g. @poltroonish, @LinguistYay, @subjunctiverobot

Number of self-descriptors: this is the tricky one. Basically, I counted nouns or verbs that constituted separate descriptions of the person, so:

Writer, teacher, blogger, linguist, parent, feminist, Scrabble player: counts as 7.
Loves cooking, dreaming, whittling, singing: counts as 4.
Teacher of ESL, EFL and Intercultural Communication counts as 1--the main identity is 'teacher'.
Dreams are sometimes songs: counts as 0, since if it's label(l)ing the person, it's very indirect.
An empty description also counts as 0, but I had a separate sub-category in which I distinguished the content-ful and content-less zero scores.

Caveats: I also kept track of who said things like "RTs are not endorsements" or "All views are my own", but there were some in each nationality and not enough of these to warrant further analysis.

I did this for two notebook pages for each nationality, which totals 64 from each country. More would have been better, but I can only justify a certain amount of procrastination per day.

Then I did some math(s).

A big caveat here is that I'm only sampling my own followers, which is to say it's probably people with an interest in language or US/UK issues, possibly a bit older and educated than average. So this might not be generali{s/z}able to US/UK Twitter users generally. The other caveat, of course, is that I'm equating location with culture. For all I know, half of the people who listed their location as 'London' are exchange students who aren't mentioning that they're from Iceland. But I'm working with what I've got, and we can only hope that the rate of 'false positives' in one country's data is matched in the other country's data.

All examples in this post are made up, mimicking profiles I read. I don't want people to feel like I'm giving any individuals a hard time. Or to [orig. AmE] out them to their friends and family as someone who follows me. If it so happens that I've made up a handle or a profile that actually exists, that's accidental.

So this is what I found:

Gender
Overall the sample had more women than men, which is to be expected because 62% of Twitter users are female (according to one study). (For this sample it's 56% female. The table below has raw numbers.) The gender breakdown was similar across nationalities, so whatever we see here is more likely to be a national effect rather than a gender effect.


		gender
		F	M	O
	US	37	26	1
	UK	34	25	5
Total		71	51	6

Handle = name?
I was interested in the name versus pseudonym issue because, as we've seen before, Americans introduce themselves by name earlier in conversation than Brits do. So, I wondered, are the British more careful about giving out names on Twitter? The caveat for this result is that I have no way of knowing whether the handles people use are their real names. If someone's name was presented as 'Gemma Thornton-Baker' and her handle is @gemmatbkr, then the fact that the handle matched the name meant that I counted this as a name, rather than a pseudonym. If their name was presented as 'Hunky Cloud' or their handle was @rottenweather, then I took it for granted that their name wasn't really Rotten Weather.

But after all this preamble, I'm not even going to bother to put together an HTML table of results because the numbers were exactly the same for US and UK. So, hypothesis that Brits would be less apt to use their name as a handle was not supported.

Self-descriptions
On to the meat of what I wanted to look at. Remember, I'm not testing word-count of the self-descriptions, but the number of separate descriptions given--a single description may be one or ten words long. So, this isn't about how much one says, but how many different things one says. My hypothesis was that Americans list more different things, divulging more about themselves.

The result favo(u)red the hypothesis, in that Americans listed, on average, 3.58 descriptors and the British 2.78. The range was exactly the same: 0 to 14 descriptors. The US median was 3 and the UK median was 2.

But although the numbers were in the right direction for the hypothesis, they are only significant at p=.100, which means, basically, that there's a 10% likelihood that the difference is down to chance. We'd probably have a better answer if I'd looked at more than 64 people per country. Which is why I'm going to point this out to our students who are currently looking for research projects to do...

Labels: communication , computers , epithets , names , project ideas | 17 comments

-og and -ogue

Mar 4, 2013

Rachel Ward aka @FwdTranslations just asked me via Twitter:

Trying to check usage of epilog(ue) and prolog(ue) in US spelling. Seen suggestion that "ue" forms still more widely used. True?

And I felt the need to blog this immediately, since this is something that niggles me about British understanding of US spelling sometimes. I am often being told that Americans don't write catalogue, they write catalog. The same for dialogue/dialog. But, the thing is, I've always (or at least since I was a grown-up) used the -ue in all of them. Because the shorter forms are only American, from the British perspective, the shorter forms are "the American spelling". But from the American perspective, most wouldn't consider the longer forms to be "the British spelling" in the same way that we'd consider colour or centre as British spellings. They're just alternative spellings, listed in American dictionaries without any dialect marking. Noah Webster is generally credited/blamed for these kinds of 'shortenings' in AmE, but he used dialogue in at least the earliest edition of his Blue-Backed Speller. The move for this change seems to have come later, in the period when Melvil(le) Dewey (he of the Dewey Decimal System) was a leading spelling-reform advocate. In an article in Verbatim on The American Spelling Reform Movement, Richard Whelan writes:

During the 1890s, a few state legislatures passed bills calling for simplified spelling to be taught in public schools, and the prestigious American dictionaries began to acknowledge the call for reform, first by listing simplifications in appendices, and eventually transferring some to the main entries as acceptable alternatives.

The turning point came in February 1897, when the National Education Association (NEA) resolved that all of its official correspondence and publications would thenceforth use simplified spellings for twelve words: catalog, decalog, demagog, pedagog, prolog, program, tho, altho, thoro, thorofare, thru, and thruout. This move brought the issue of spelling reform to wide public attention and forced even many conservatives to take seriously what they had previously dismissed as the folly of cranks

But note that the ue-less forms have pre-American precedent. For instance, the OED notes that from Middle English to the 16th century dialogue was mostly dialoge (as it was in the French of the time), and sometimes dialog. The spelling dialogue is really only seen after this, following a spelling change in French.

So, for fun, here's how some of these spellings fare in the Corpus of Contemporary American English and Noah Webster's namesake, the Merriam-Webster (online) Dictionary. The middle two columns give the raw numbers of how many of each spelling is found for the singular noun form of each of these words. The last column says which spelling is given first by Merriam-Webster.

	-ogue	-og	M-W
catalog(ue)	2559	4955	catalog
dialog(ue)	12657	702*	dialogue
epilog(ue)	490	8	epilogue
monolog(ue)	1098	7	monologue
pedagog(ue)	56	0**	pedagogue
prolog(ue)	890	3	prologue
analog(ue)	1554	306	analogue***

HTML Tables

So THE ONLY ONE that is more frequently used in the shorter form in AmE is catalog(ue), and even then, the longer form is well represented. In other words, the most commercial term is the most likely to use the shorter form. [And, afterthought: also the one that is closest to Dewey's heart, as a library term.] Despite the National Education Association's example, this spelling reform has not been wholly successful.

Some footnotes to the table:

*The case of dialogue is interesting because of dialog box, which is spelled/spelt without the -ue in computer jargon in both countries. This is like the case of program, which is longer (programme) in most senses in BrE, but which uses the shorter (AmE) form for the computer sense. (And color in html and so forth. One could say that America runs computing jargon, or one could say that programmers prefer shorter and consistent forms. Or one could say it's a bit of both.) Anyhow, 375 (53%) of the 702 cases of dialog here are in the phrase dialog box and its variants (dialog boxes, dialog box-in). (There are also 18 cases of dialogue box[es].) So, this means that outside this two-word compound, dialogue outnumbers dialog in AmE by 38:1.

** There was one case of pedagogs in COCA. There were 0 cases of demagog or demagogs. So, while M-W lists these as variants, they don't seem to have made deep inroads into the written language.

*** I meant to do analog(ue) too, and was reminded of it when commenters started asking for it, so here it is, several hours later. ~~This one is noteworthy because~~ M-W says for the adjective that analogue is a 'chiefly British variant' of analog, rather than just listing it as an alternative spelling, but for the noun sense it has analogue as the preferred spelling for the noun--which is in contrast with the numbers from COCA [thanks to @empty in the comments for pointing out my error]. Like catalog and dialog box, its "technological" senses are more common. So we have a general pattern here of literary words keeping the -ue and more techie stuff dropping it.

My to-do list says that I'm (BrE) marking/(AmE) grading this morning. Please don't tell my to-do list that I was here.

Labels: computers , more complicated than you might think , shopping , spelling | 104 comments

Subscribe to: Comments ( Atom )

Separated by a Common Language

Observations on British and American English by an American linguist in the UK

Book Week 2019: Gretchen McCulloch's Because Internet

Because Internet

Understanding the new rules of language (US subtitle)
Understanding how language is changing (UK subtitle)

by Gretchen McCulloch
Riverhead, 2019 (N America)
Harvill Secker, 2019 (UK)

Descriptions in Twitter profiles

-og and -ogue

The book!

View by topic

Find me elsewhere on the internet!

Linguistic Toolbox

Abbr.

Book Week 2019: Gretchen McCulloch's Because Internet

Because Internet

Understanding the new rules of language (US subtitle)Understanding how language is changing (UK subtitle)

by Gretchen McCullochRiverhead, 2019 (N America)Harvill Secker, 2019 (UK)

Descriptions in Twitter profiles

-og and -ogue

The book!

Subscribe to

View by topic

Find me elsewhere on the internet!

Linguistic Toolbox

Abbr.

Understanding the new rules of language (US subtitle)
Understanding how language is changing (UK subtitle)

by Gretchen McCulloch
Riverhead, 2019 (N America)
Harvill Secker, 2019 (UK)