Overview
- Provides a working model which can be used for creating other linguistic corpora in any language
- Designed to be of value and relevance to those interested in critically engaging with corpus methodology
- Uses as a case study a unique, large-scale, real-world corpus of Welsh
Access this book
Tax calculation will be finalised at checkout
Other ways to access
Table of contents (5 chapters)
Keywords
About this book
This book aims to provide a micro-level, working model of a methodological approach and practical guidelines for building a corpus, informed by the work on the CorCenCC project (Corpws Cenedlaethol Cymraeg Cyfoes - the National Corpus of Contemporary Welsh). It focuses specifically on the development of detailed design frames for corpora across communicative modes (spoken, written and e-language), and the practical processes involved in the planning, collection, transcription, collation and (re)presentation of language data. The book is designed to be of significant value and relevance to those interested in critically engaging with corpus methodology. Although Welsh is the language under discussion, the processes and approaches discussed in the building of CorCenCC can be applied to a lesser or greater extent to other language contexts. This book provides a working model, and an account of how to build a corpus dataset from which step by step guidelines for creating other linguistic corpora in any language can be easily extrapolated. It will be of value to students and scholars of minority languages and corpus linguistics.
Reviews
Authors and Affiliations
About the authors
Dawn Knight is a Reader in Applied Linguistics at Cardiff University, UK, and Chair of the British Association for Applied Linguistics (BAAL). She was the Principal Investigator (PI) of the CorCenCC (National Corpus of Contemporary Welsh) project.
Steve Morris is an Honorary Research Fellow in Applied Linguistics at Swansea University, UK. He was a co-investigator on the CorCenCC (National Corpus of Contemporary Welsh) project.
Laura Arman is a Research Associate at Cardiff University, UK. Her research is centred on the linguistics of minoritised languages, with a focus on her native language of Welsh.
Jennifer Needs works for the UK Civil Service as a Welsh language translator. She previously worked at Swansea University and Cardiff University as a Welsh-medium researcher.
Mair Rees gained her PhD in Welsh literature from Cardiff University, UK in 2012. She was a creative editor for amajor Welsh-language publisher for four years before joining the CorCenCC team as a research assistant. She currently works part-time as a translator in addition to running her own small business.
Bibliographic Information
Book Title: Building a National Corpus
Book Subtitle: A Welsh Language Case Study
Authors: Dawn Knight, Steve Morris, Laura Arman, Jennifer Needs, Mair Rees
DOI: https://doi.org/10.1007/978-3-030-81858-6
Publisher: Palgrave Macmillan Cham
eBook Packages: Social Sciences, Social Sciences (R0)
Copyright Information: The Editor(s) (if applicable) and The Author(s), under exclusive licence to Springer Nature Switzerland AG 2021
Hardcover ISBN: 978-3-030-81857-9Published: 09 October 2021
eBook ISBN: 978-3-030-81858-6Published: 08 October 2021
Edition Number: 1
Number of Pages: XIV, 183
Number of Illustrations: 8 b/w illustrations, 14 illustrations in colour
Topics: Research Methods in Language and Linguistics, Applied Linguistics, Digital Humanities, Language Policy and Planning