Texts from Swedish Public Employment Service

SND-ID: ext0338-1.

Is part of collection at SND: Parallel Texts from Public Agencies

Citation

Creator/Principal investigator(s)

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Institute for Language and Folklore, Language Council of Sweden

Research principal

Institute for Language and Folklore - Language Council of Sweden rorId

Description

Parallel texts downloaded from the websites of the Swedish Public Employment Agency.

Parallel texts downloaded from the website of Swedish Public Employment Service.
What was actually downloaded were pdf files. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.

Data contains personal data

No

Method and outcome

Data format / data structure

Data collection
Language resources

Resource type

Corpus

Foreseen use

NLP application

Text corpus

  • Linguality

    Multilingual
  • Language

    • Swedish (swe)

    • English (eng)

    • Finnish (fin)

    • Spanish (spa)

    • French (fra)

    • German (deu)

    • Romanian (ron)

    More..
  • Modality

    Written Language
  • Size

    Words: 43207 (swe)

    Texts: 39 (swe)

    Words: 152928 (TOT)

    Texts: 152 (TOT)

  • Original source

    arbetsförmedlingen
    www.arbetsformedlingen.se
Geographic coverage

Geographic spread

Geographic location: Sweden

Administrative information

Responsible department/unit

Language Council of Sweden

Contributor(s)

Institute for Language and Folklore, Language Council of Sweden

Topic and keywords

Research area

Public administration studies (Standard för svensk indelning av forskningsämnen 2011)

Other social sciences not elsewhere specified (Standard för svensk indelning av forskningsämnen 2011)

General language studies and linguistics (Standard för svensk indelning av forskningsämnen 2011)

Keywords

Labour policy

Publications