LDC Catalog
WFU is a member of the Linguistic Data Consortium (LDC), starting with the 2012 membership year. You can request access to a 2012 (or later) corpus by emailing Carol.
WFU has already downloaded some corpora. They are grouped in the list below by the language of the corpus:
You need to be on campus -OR- on VPN to download the files. Contact Carol for more information on how to get access to these data sets.
When searching, it can be tricky to formulate your search to distinguish articles that discuss corpus linguistics as a methodology vs. applications of corpus linguistics. LLBA provides some subject headings that can help with this.
First, make sure you're using the Advanced Search screen.
Choose Subject Heading (all) — SU in the first drop-down menu
In the search box type:
Make sure you include the quotation marks.
In addition, a keyword search for corpus approach (no quotes) tends to return applied articles, and corpus methodology (no quotes) tends to return methodological articles.