Skip to main content

Mink

What is Mink?

Språkbanken supports research by providing data and developing language technology analyses and research platforms. Mink lets you use those analyses and platforms for data resources that you create yourself.

Use Mink at spraakbanken.gu.se/mink

Who can use Mink?

Anyone with an eduGAIN account can use that to login to Mink. This includes most people associated with a university or other academic institution. Other users can create an account at eduID which, itself, is connected to eduGAIN.

We are working toward offering a "demo version" that can be used without login.

What can Mink do?

Mink can manage Corpora and Lexicons.

Basic workflow

  1. Create resources and upload source files
  2. Configure and run analysis
  3. Download result files
  4. Install results in other platforms
  5. Invite selected peers to explore or edit resources

With corpora

  • File format for source files:
    • Text: plain text, XML, Microsoft Word (.docx), Open Document (.odt), PDF or CoNLL-U
    • Audio (using automatic speech recognition): WAV, MP3 or OGG
  • Analyses:
    • Part-of-speech tags (POS)
    • Base form (lemma)
    • Morphosyntactic tags (MSD)
    • Dependencies
    • Named entity recognition
    • See full list in the sbx-swe-mink_analyses analysis collection
  • File format for result files: extracted/transcribed plain text, XML, CoNLL-U, CSV, frequency list
  • Install results in Korp and Strix

With lexicons