Mink logo

Språkbanken Text
Help
Göteborgs universitet

Språkbanken Text is a research infrastructure for language data. We create and maintain language data analysed with state-of-the-art language technology methods for researchers everywhere, and take pride in providing free and open access where possible.

Mink is our effort to put Språkbanken Text’s research infrastructure into the hands of the researchers. You can use Mink to apply our language technology methods on texts that you have collected yourself. The resulting data can be downloaded or made available through our research tools, such as Korp and Strix, behind login.

Your source data and the results are private (see Privacy and data policy). Forthcoming versions will include features for sharing datasets with teams and publishing them publicly.

Read more in the Mink documentation

Sign in
Most university users can sign in directly.
Most non-university users need to set up an account first.
Screenshot of Mink's user interface
Screenshot of a configuration form with labels and buttons

Configurable language technology analysis

Your language data are analyzed by our language technology platform Sparv according to your settings.

Screenshot of a list of filenames and an upload area

Deposit your text

Upload your word-processor documents, PDFs, plain text files or XML data.

When using XML, the output of the analyses is added while keeping the input structure intact: Using Mink with custom annotations in source (blog entry in Swedish)

Screenshot of the Korp search tool

Explore in Korp and Strix

Export the resulting data, for search and statistical analysis, to Korp and Strix.

Screenshot from an access management system

Share and collaborate

Invite fellow researchers to view your data in our research tools, protected by login. If you want to share your data with the research community, contact us to discuss publication.

These features are scheduled for upcoming versions of Mink.

Mink or no Mink? Make sure to first get acquainted with any existing language data sets related to your interests! Språkbanken's growing collection of research data can be browsed on the Data section of our website. At a larger scale, Språkbanken forms a part of the CLARIN ERIC, whose collected assortment of data can be browsed in the Virtual Language Observatory.

Contact: sb-info@svenska.gu.se