Name: Building a National Corpus
ISBN: 978-3-030-81858-6

Overview

Authors:

Dawn Knight ⁰,
Steve Morris ¹,
Laura Arman ²,
Jennifer Needs ³,
…
Mair Rees ⁴

Dawn Knight
1. School of English, Communication & Philosophy, Cardiff University, Cardiff, UK
View author publications

You can also search for this author in PubMed Google Scholar
Steve Morris
1. School of Culture and Communication, Swansea University, Swansea, UK
View author publications

You can also search for this author in PubMed Google Scholar
Laura Arman
1. Wales Institute of Social and Economic Research and Data, Cardiff University, Cardiff, UK
View author publications

You can also search for this author in PubMed Google Scholar
Jennifer Needs
1. Cardiff, UK
View author publications

You can also search for this author in PubMed Google Scholar
Mair Rees
1. Newport, UK
View author publications

You can also search for this author in PubMed Google Scholar

Provides a working model which can be used for creating other linguistic corpora in any language
Designed to be of value and relevance to those interested in critically engaging with corpus methodology
Uses as a case study a unique, large-scale, real-world corpus of Welsh

638 Accesses
5 Citations
12 Altmetric

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 49.99

Price excludes VAT (USA)

Hardcover Book USD 64.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (5 chapters)

Front Matter

Pages i-xiv

Download chapter PDF
Understanding the Language Context
- Dawn Knight, Steve Morris, Laura Arman, Jennifer Needs, Mair Rees
Pages 1-22
Design Frames
- Dawn Knight, Steve Morris, Laura Arman, Jennifer Needs, Mair Rees
Pages 23-74
(Meta)Data Collection
- Dawn Knight, Steve Morris, Laura Arman, Jennifer Needs, Mair Rees
Pages 75-103
Processing and (Re)presenting Corpora
- Dawn Knight, Steve Morris, Laura Arman, Jennifer Needs, Mair Rees
Pages 105-135
Results, Reflections and Lessons Learnt
- Dawn Knight, Steve Morris, Laura Arman, Jennifer Needs, Mair Rees
Pages 137-169
Back Matter

Pages 171-183

Download chapter PDF

Keywords

About this book

This book aims to provide a micro-level, working model of a methodological approach and practical guidelines for building a corpus, informed by the work on the CorCenCC project (Corpws Cenedlaethol Cymraeg Cyfoes - the National Corpus of Contemporary Welsh). It focuses specifically on the development of detailed design frames for corpora across communicative modes (spoken, written and e-language), and the practical processes involved in the planning, collection, transcription, collation and (re)presentation of language data. The book is designed to be of significant value and relevance to those interested in critically engaging with corpus methodology. Although Welsh is the language under discussion, the processes and approaches discussed in the building of CorCenCC can be applied to a lesser or greater extent to other language contexts. This book provides a working model, and an account of how to build a corpus dataset from which step by step guidelines for creating other linguistic corpora in any language can be easily extrapolated. It will be of value to students and scholars of minority languages and corpus linguistics.

Reviews

"The Welsh National Corpus is an important state-of-the-art of corpus. Written with great precision and honesty, this book brings the reader inside the anatomy of the corpus and in doing so creates both an important archive of decision-making and protocols. Cutting-edge insights are also presented through case studies on spoken, written and e-language data form the corpus. For anyone attempting to build a corpus, this book is essential reading."-Anne O’Keeffe, University of Limerick, Ireland

Authors and Affiliations

School of English, Communication & Philosophy, Cardiff University, Cardiff, UK

Dawn Knight
School of Culture and Communication, Swansea University, Swansea, UK

Steve Morris
Wales Institute of Social and Economic Research and Data, Cardiff University, Cardiff, UK

Laura Arman
Cardiff, UK

Jennifer Needs
Newport, UK

Mair Rees

About the authors

Dawn Knight is a Reader in Applied Linguistics at Cardiff University, UK, and Chair of the British Association for Applied Linguistics (BAAL). She was the Principal Investigator (PI) of the CorCenCC (National Corpus of Contemporary Welsh) project.

Steve Morris is an Honorary Research Fellow in Applied Linguistics at Swansea University, UK. He was a co-investigator on the CorCenCC (National Corpus of Contemporary Welsh) project.

Laura Arman is a Research Associate at Cardiff University, UK. Her research is centred on the linguistics of minoritised languages, with a focus on her native language of Welsh.

Jennifer Needs works for the UK Civil Service as a Welsh language translator. She previously worked at Swansea University and Cardiff University as a Welsh-medium researcher.

Mair Rees gained her PhD in Welsh literature from Cardiff University, UK in 2012. She was a creative editor for amajor Welsh-language publisher for four years before joining the CorCenCC team as a research assistant. She currently works part-time as a translator in addition to running her own small business.

Bibliographic Information

Book Title: Building a National Corpus
Book Subtitle: A Welsh Language Case Study
Authors: Dawn Knight, Steve Morris, Laura Arman, Jennifer Needs, Mair Rees
DOI: https://doi.org/10.1007/978-3-030-81858-6
Publisher: Palgrave Macmillan Cham
eBook Packages: Social Sciences, Social Sciences (R0)
Copyright Information: The Editor(s) (if applicable) and The Author(s), under exclusive licence to Springer Nature Switzerland AG 2021
Hardcover ISBN: 978-3-030-81857-9Published: 09 October 2021
eBook ISBN: 978-3-030-81858-6Published: 08 October 2021
Edition Number: 1
Number of Pages: XIV, 183
Number of Illustrations: 8 b/w illustrations, 14 illustrations in colour
Topics: Research Methods in Language and Linguistics, Applied Linguistics, Digital Humanities, Language Policy and Planning

Publish with us

Policies and ethics

Building a National Corpus

Overview

Access this book

Other ways to access

Table of contents (5 chapters)

Front Matter

Understanding the Language Context

Design Frames

(Meta)Data Collection

Processing and (Re)presenting Corpora

Results, Reflections and Lessons Learnt

Back Matter

Keywords

About this book

Reviews

Authors and Affiliations

School of English, Communication & Philosophy, Cardiff University, Cardiff, UK

School of Culture and Communication, Swansea University, Swansea, UK

Wales Institute of Social and Economic Research and Data, Cardiff University, Cardiff, UK

Cardiff, UK

Newport, UK

About the authors

Bibliographic Information

Publish with us

Search

Navigation