Skip to content

Sparv – Språkbanken's Analysis Platform

Sparv is a powerful command line tool designed for annotating text with a wide range of linguistic annotations. Developed by Språkbanken Text, Sparv provides a comprehensive solution for text analysis, supporting various source formats, annotations, and export formats.

Sparv comes equipped with numerous modules that facilitate the entire workflow from importing text data, annotating it with linguistic features, exporting the results in different formats, to deploying the annotated corpus. Its modular architecture allows for easy extension through plugins, enabling support for additional external tools and custom annotations as needed.

While Sparv is primarily designed for Swedish text, it can also be used for other languages. Writing plugins to integrate third-party tools that support other languages is straightforward, making Sparv a flexible solution for multilingual text analysis.

The source code is available under the MIT license. If you have any questions, problems or suggestions please contact sb-sparv@svenska.gu.se.

This documentation is also available in PDF format. You can download the user manual and the developer's guide from the latest Sparv release on GitHub.

Tip

Watch releases screenshot Stay updated on new Sparv releases by subscribing to our GitHub repository:

  1. Log in to GitHub
  2. Go to https://github.com/spraakbanken/sparv
  3. Click "Watch" in the upper right corner
  4. Select "Custom"
  5. Check the "Releases" box
  6. Click "Apply"

Depending on your notification settings you will now receive notifications about new Sparv releases on GitHub's website, via email, or on your phone.