The Invention of Chinese

Believing language would unify their struggling nation, Chinese officials began a project to create a national language and define what it meant to speak Chinese.

Gina Anne Tam | Published in History Today Volume 71 Issue 1 January 2021

a street in Guangzhou/Canton, 1870s. Following spread: Chinese ‘dialect’ map, 1987. — A street in Guangzhou/Canton, 1870s © Bridgeman Images.

The Chinese language is deceptively difficult to define. To speak ‘Chinese’ today usually means Mandarin, the national language of the People’s Republic of China (PRC) and Taiwan. Called Putonghua, or the ‘common language’, in China, or Guoyu, the ‘national language’, in Taiwan, Mandarin is what citizens of Taiwan and the PRC learn in schools and hear on TV and in films and from their political leaders. It is taught in Chinese-language programmes around the world and represents ‘China’ at the UN or WHO.

Yet Mandarin is by no means the only language that could be called ‘Chinese’. Leaving aside the Chinese script, itself having a long, complex history, ‘Chinese’ also refers to a large number of spoken languages that are mutually unintelligible with Mandarin: from the ‘ancient’ Chinese of the Tang dynasty (AD 618-907) to a multitude of local languages. These include well-known ones, like Cantonese or Hokkien, with millions of speakers, and dozens, if not hundreds, of others. The Chinese government officially recognises these languages, from the widely spoken to the smaller variants, as fangyan – a word that dates at least to the second century BC, most frequently translated as ‘dialect’. But such a translation is inherently problematic. Dialects are subordinate – a dialect only makes sense if it is a dialect of something – and are mutually intelligible with the language they are a dialect of and with one another. These criteria do not apply to all, or even most, Chinese fangyan. Cantonese is not mutually intelligible with Shanghainese, nor is Qingdao-ese mutually intelligible with Hakka. And though the Chinese government calls Mandarin a ‘language’, a title not afforded to other fangyan, fangyan are not derived from Mandarin.

Despite this, however, the Chinese government insists that fangyan ought to be translated using the English word dialect, with all its connotations. Educational materials and popular media call fangyan ‘variants’ or ‘branches’ of the Chinese language and high-profile government figures are often found directly refuting the suggestion that fangyan could be considered independent languages. Mandarin, on the other hand, is officially called the ‘common language of the Han people’ without which society can neither ‘be preserved, develop, or progress’.

Why, then, does the Chinese government insist not only that Chinese fangyan ought to be translated as dialects, but, more importantly, that they should be treated as such in academic research, education and public policy? The presumption that fangyan are subordinate was neither natural nor pre-ordained. Rather, it was a historical process, one that fundamentally shifted the relationship between language, nation and collective identity.

The politics of translation

The word fangyan is comprised of fang, ‘place’, and yan, ‘language’; taken together, a literal rendering might be ‘language of place’. For millennia, fangyan was a common term for referring to local languages of particular regions. It rarely carried any hierarchical implications.

When thought of as ‘languages of place’, other English terms might have been more logical equivalents and were, in fact, used for centuries. Matteo Ricci, the first European Jesuit to live in China, in the late 16th century, used the Latin word vernacula, frequently translated as ‘colloquial’, to describe local languages. Later missionaries added ‘vernacular’ and ‘dialect’ to the growing list of terms. By the mid-19th century, the rapidly expanding population of Protestant missionaries, given imperialistic license to settle in villages throughout the Qing Empire, used these English terms interchangeably. The first written source to directly translate ‘dialect’ as fangyan was probably the British diplomat Herbert Giles in his A Chinese English Dictionary of 1892.

Matteo Ricci and another Christian missionary to China, from China Illustrata, by Athanasius Kircher, 1667 © Bridgeman Images.

We can attribute some of this uncertainty to the fact that a translation is only as good as the translator’s knowledge and, before the 19th century, Europeans knew very little about China. Faced with the unknown, they drew parallels between their scattered observations and their known phenomena at home. Take, for instance, the 19th-century Scottish missionary Carstairs Douglas. Stationed in Amoy (now Xiamen), Douglas looked at the languages around him and noted that they were mutually unintelligible with the language spoken in the imperial capital. He also observed that the local Xiamen language was spoken by members of all classes, not simply the lower classes. As such, in his Dictionary of the Spoken Language of Amoy, he argued that because ‘colloquial’ implied that fangyan were only spoken by lower classes and ‘dialect’ implied they were branches or auxiliaries, neither translation fit. Douglas ultimately settled on ‘vernacular of’ and ‘spoken language of’ to describe fangyan. But not all missionaries had the same experiences as Douglas.

At their core, disagreements over translations stemmed from the fact that Europeans struggled to define what China’s language was. Was it the formal language that appeared in elite written texts, a tradition that they saw as akin to Latin? Was it the oral language spoken in the imperial capital, what 19th-century Europeans called the ‘language of the Mandarins’? Because European and American observers disagreed on what constituted the ‘Chinese language’ – or if such a thing even existed – there was no clear consensus on how fangyan related to it; whether they were dialects, or something else.

That was, until the early 20th century. The final years of the Qing dynasty marked a watershed, as domestic turmoil encouraged unprecedented creativity in imagining what a modern Chinese nation might look like. This intense experimentation ultimately shattered many well established sociocultural frameworks and norms, including ideas about language. The meaning of spoken language in China – an integral part of what ‘China’ was – would never be the same.

A ‘Chinese language’

It is frequently taken for granted that the history of Chinese language in the 20th century is first and foremost a history of Mandarin. The commonly told narrative is straightforward. At the turn of the 20th century, China had just endured half a century of rebellion, war and semicolonialism. Convinced that the best way to strengthen and save their nation was to mould the subjects of their fallen empire into a culturally united citizenry, reformers and revolutionaries, supported by the new Republican government, placed the creation and promulgation of a national language at the forefront. Men came from north and south to gather in the capital, where they chose, by vote, a fangyan to serve as the national language. Though contentious, Beijing was ultimately victorious.

The actual story of how Beijing ‘won’ is far more complex than this retelling may suggest. To imagine the process of creating a Chinese national language as a close vote and a regional power struggle is to ignore how these men actually conceived of a ‘Chinese language’: not as one language among many, but a linguistic representative of the nation’s soul. The question these reformers were asking was not ‘which fangyan do we choose?’ but ‘how do we encapsulate what it means to be Chinese in a spoken language?’

Yuen Ren Chao being awarded an honorary degree at the University of California, 1962. — Yuen Ren Chao being awarded an honorary degree at the University of California, 1962 © Jon Brenneis/LIFE/Getty Images.

Indeed, before the 20th century the idea of a singular, spoken Chinese language was a foreign concept. That is not to deny that China has a long history of thinking about its own language. Since at least the second century, dictionaries, rime tables and treatises analysing the Chinese script, grammar, phonology and oral diversity were being written. But while Chinese thinkers frequently mentioned ‘official languages’, this was by no means synonymous with a ‘national language’ – a language unified in its sound and script used by and representative of a Chinese nation.

This changed in the early 20th century, when the ‘Chinese language’ suddenly transformed from a foreign construct to an almost existential problem. A sequence of military defeats beginning with the First Opium War in 1842 made many elites question where their empire had gone wrong. They began with the obvious suspects. They noted that China’s military technology lagged behind that of their foes and that soldiers lacked the training and discipline necessary for modern warfare. But nearly half a century of lost wars left the Qing beleaguered, forcing many to wonder if the source of these losses was much more fundamental. Some began to argue that the Qing’s weakness lay not in something concrete, such as infrastructure, but in the country’s very cultural anatomy, of which language was a central part.

Unity and disunity

The idea that China’s national weakness stemmed in part from its lack of linguistic unity was something that Western missionaries and old China hands frequently repeated. Some wrote that China’s local patois were ‘so numerous that persons of neighbouring provinces … are frequently unable to carry on a conversation of any length without having recourse to writing’. Others criticised the Chinese script as being ‘hieroglyphics’ that facilitated disunity, arguing that the Roman alphabet would be the ‘readiest gradus ad Parnassum [steps to Parnassus]’ to mutual linguistic intelligibility. Their diagnosis was simple: China could never be a modern nation if it did not have a national, unified tongue.

After the overthrow of the Qing dynasty in 1911, there were few bureaucratic barriers to aggressive administrative action, allowing the founders of the new Republic of China to make language reform a top legislative priority.

Yet the question of how to unify the ‘Chinese language’ had few easy answers. Some, looking at France and Japan, thought that the national language should simply be the language of the seat of government, meaning the national language would be based on the language of Beijing. This proposal, however, was met with fierce opposition. As the former capital of the fallen Qing dynasty and a region that had, for large swathes of Chinese history, been ruled by non-Han Chinese ethnic groups, Beijing was too ‘foreign’ to be an acceptable representative of China. Instead, it was argued that any language that represented the Chinese nation should not simply be chosen from existing fangyan, but constructed as an ideal: a new language that represented the historic unity of the Han Chinese people and the language they spoke at the dawn of Chinese civilisation thousands of years before. After a contentious meeting in 1913, the Committee on the Unification of Pronunciation agreed on the first Chinese national language, adopting an amalgamated, invented construction that was largely based upon the language of Beijing, but included key elements from other fangyan.

A classroom in Beijing, 1959. — A classroom in Beijing, 1959 © Hulton Getty Images.

Inevitably, this national language seemingly born of compromise pleased few people. Those who had been tasked with the practical work of spreading it to the population quickly saw its shortcomings, casting doubt on the government’s ability to teach the nation a language with no native speakers. Even those who initially advocated a bold, experimental language began to question its feasibility. The linguist Yuen Ren Chao, for example, graduate of Cornell and Harvard, was so committed to this hybridised national language that he had made a recording of it to be used in classrooms across China in 1921. But by 1924 he admitted to friends and family that its artificial nature created too high a barrier. Reflecting on his experience, Chao laughed: ‘For 13 years I was the sole speaker of this idiolect, meant to be the national language of four, five, or 600 million speakers.’

And so this dream of an idealised, constructed national language fizzled out. In 1925 the government officially declared that China’s Guoyu (or national language) would be based on the language of Beijing. But, although the reformers abandoned their idealised language in favour of simply making Beijing’s language their national tongue, the unrealised dreams did not simply disappear. The idea that a Chinese language had to represent all of China, not just part of it, continued to imbue language policy.

The constructed language also, in part, laid the foundation for imagining fangyan as subsidiary branches of a Chinese language. In conceiving of the Chinese language as something rooted in and connected to all of China’s fangyan, it made logical sense to think of fangyan as pieces or branches of it, subsidiary to the national language. The project’s failure did not erase the hierarchical implications it seeded.

Making dialects

Calling the language of Beijing the national language of China when, linguistically, it was just another fangyan, did not change the linguistic landscape; the other fangyan did not become its variants or subsidiaries. Thus, in order for the hierarchy between language and fangyan to make sense, the general public had to be convinced that their new national language could represent the whole nation. This meant, too, that the connotations inherent in the word ‘dialect’ – hierarchy and dependency – had to become integral to what a fangyan was.

The transformation of Beijing fangyan into a national language was partly done through public policy. The government decreed that the national language should be taught in schools, encouraged its use in radio and cinema, supported magazines, such as National Language Weekly, which defended government policy and offered short, accessible language lessons. By the 1930s these encouragements became threats, as Chiang Kai Shek’s government attempted to censor cinema in other Chinese fangyan, targeting in particular the thriving Cantonese film industry in Guangzhou.

These policies were not very effective at getting people to speak the national language. Most children did not attend schools in 1930s’ China and, for those who did, the central government did not have the reach to regulate what language teachers used in the classroom. And, while the ban on Cantonese films sparked fierce debates among film-makers and cultural critics, it was difficult to enforce. Many film-makers moved their production to the British colony of Hong Kong or simply aired the films without sending them in for government approval.

A scene from rural China, where few people spoke the national language during the Maoist years, Hunan Province, 1958. — A scene from rural China, where few people spoke the national language during the Maoist years, Hunan Province, 1958 © Ullstein Bild/Getty Images.

But, despite the fact that the government could not enforce these policies, their mere existence still reinforced a hierarchy. Whether people spoke it or. not, Beijing’s fangyan was no longer just Beijing’s fangyan; it was now the basis of the language of the nation. Guoyu, ‘National Language’, became a household term, peppered throughout newspapers, discussed on radio and repeated in children’s textbooks. Regardless of who spoke it, its position as the national standard was normalised by implicitly defining everything else as non-standard subsidiaries.

Language families

Sometimes the hierarchy between national language and fangyan was reified in more subtle ways. In the early 1920s a group of Chinese linguists began to advocate the introduction of a more ‘scientific’ study of languages at Chinese universities. These men, many of whom had received their doctorates in the United States or Europe, claimed that it was a simple, objective truth that human languages existed upon a taxonomic tree, all connected to one singular root. As Beijing University Professor of English Lin Yutang explained: ‘There should be no confusion as to the definition of fangyan. The world’s languages are connected in one system, called a yuyanxi [family of languages]. Language families are then divided into yuyan [languages], and within each language there are divisions of fangyan [dialects].’

Here Lin makes a direct argument for the equivalence of dialect and fangyan. But it was not just that he thought ‘dialect’ was the best translation for fangyan; he believed the term fangyan ought to carry all the implications of the English term ‘dialect’, regardless of local context. Most Chinese linguists agreed and designed their research methods around that presumption. Yuen Ren Chao published the first full-length fangyan survey, Study on the Modern Wu Dialect, in 1928, in which he sought out speakers in towns throughout the Yangzi River Delta (near Shanghai), asking them to read aloud a list of nearly 2,700 Chinese characters. After recording their pronunciation, Chao then arranged the data into charts that compared the relative pronunciation of each of the characters from one area of the region to one another.

Chao claimed to be a scientist and believed that any good scientific comparative study should include a ‘constant’ – a set of pronunciations that would ground the data for his reader and give them a point of comparison. While his first study had three ‘constants’, after 1930 it became the norm for both Chao and other scholars who followed him to only use the pronunciation of Beijing fangyan, which he called ‘national phonetics’. On charts of surveys from Zhongxiang county and Nanjing city, he juxtaposed the local fangyan he was surveying with the national language.

It is important to reiterate that the national language was not necessarily more justifiable as a scientific constant than any other. But, because of this choice, Chao and his colleagues granted the national language a status that was more than just another fangyan and even more than just a national representative. They normalised the hierarchy between national language and fangyan by encasing it in the veneer of objective science.

Chao’s hierarchical models informed national language policy even after the Communist revolution of 1949. The Chinese Communist Party renamed Guoyu as Putonghua. Like their predecessors, they viewed fangyan as hierarchically subordinate to the national language. The way Putonghua was promulgated was different, though.

In 1956, the central government called for scholars to descend on the countryside to conduct a standardised nationwide fangyan survey. They recorded the local language of each township, village and district and used their data to create new, locally specific textbooks for teaching children Putonghua.

In this survey not only was the national language taken as a scientific constant, its entire purpose was to define all fangyan in relation to the national language. Researchers were required to publish their results in handbooks, designed solely to help ‘correct pronunciation’ of the national language by ‘fixing’ pronunciation problems particular to speakers of that fangyan.

Within decades, the fact that fangyan were mutually unintelligible languages, or the fact that the national language was simply one Chinese language among many, no longer became important in how they were conceived. Fangyan had, in the eyes of many, simply become dialects.

‘I love Cantonese’

But why does this matter? That one term was translated into another does not seem to be of vital importance. Translations do not cause governments to crumble or start wars. But the words we use bind observable things to a series of assumptions, ideas and cultural touchstones and, as such, they frame our thinking and guide our actions. To presume that fangyan had all of the same connotations as dialects is to force China’s diverse linguistic landscape into a hierarchy in which all but one are subordinate. To give the title of ‘national language’ to something nearly synonymous with Beijing’s fangyan is to elevate that fangyan to a national and global significance that cannot be afforded to any other Chinese language. The history of this translation shows that these presumptions were not predetermined: they were the product of complex, historical processes.

The presumption that fangyan are dialects is also significant because it shows that fangyan’s subordination is not limited to linguistic structure alone. Calling one language a national ‘standard’ and everything else ‘variants’ implies that the national language alone can represent a unified sense of identity and citizenship. Ultimately, to speak a language is to own a particular kind of cultural power. To speak a dialect is to settle for an expression of identity that is limited in its scope and diminished in its significance.

Perhaps because of the inherent power imbued in the term ‘language’, the framing of fangyan as ‘dialects’ has always had critics and has sometimes elicited outright protest. In 2010, for example, protestors wielding signs that declared ‘I love Cantonese’ gathered in Guangzhou’s People’s Park to protest against a decrease in television offerings in the local Cantonese. And in 2016, an announcement by a consultant of Mandarin promulgation in Hong Kong that Cantonese ‘cannot be a mother tongue’ elicited a fierce online backlash, arguing that Cantonese is not a fangyan but a language with a rich history that far predates Mandarin. In each of these instances of dissent, some overt, others subtle, the emotional stakes of their protests are always palpable. They make clear that policies that reject the status of language simultaneously reject expressions of identities, that policies that diminish the value of native tongues diminish the value of their speakers.

In this way, what counts as a ‘language’ is a battle over a place in a cultural hierarchy. It is a battle over whose language is more important to the nation and who has the power to define it. The language we speak carries enormous significance for how we see ourselves as people. It is no wonder, then, that the history of how fangyan became dialects is about much more than simply translation. It is a story about what it means to be Chinese and who has the right to represent it.

Gina Anne Tam is assistant professor in History at Trinity University, Texas and the author of Dialect and Nationalism in China, 1860-1960 (Cambridge, 2020).

The Invention of Chinese

The politics of translation

A ‘Chinese language’

Unity and disunity

Making dialects

Language families

‘I love Cantonese’

Related Articles

Pause and Effect

Here Be Monsters

Popular articles

Crimes of Fashion

The Ghastly Truth

Publication of A Christmas Carol

Half a Life

Quiz of the Year