Text corpus: Academic texts – Humanities

SND-ID: ext0100-1.

Access to data via

Creator/Principal investigator(s)

Markus Forsberg - University of Gothenburg, Swedish Language Bank

University of Gothenburg, Swedish Language Bank

Research principal

University of Gothenburg - Swedish Language Bank rorId


A corpus with academic texts (14,471,177 tokens, 673,820 sentences). This corpus can be searched through the Språkbanken Korp interface: http://spraakbanken.gu.se/korp/#lang=eng.

Licens: CC-BY

Data contains personal data


Method and outcome

Unit of analysis

Data format / data structure

Data collection
Geographic coverage

Geographic spread

Geographic location: Sweden

Administrative information

Responsible department/unit

Swedish Language Bank

Topic and keywords

Research area

General language studies and linguistics (Standard för svensk indelning av forskningsämnen 2011)

Language and linguistics (CESSDA Topic Classification)