Sparv – Språkbanken's Analysis Platform¶
Sparv is a powerful command line tool designed for annotating text with a wide range of linguistic
annotations. Developed by Språkbanken Text, Sparv provides a comprehensive solution for
text analysis, supporting various source formats, annotations, and export formats.
Sparv comes equipped with numerous modules that facilitate the entire workflow from importing text data, annotating it with linguistic features, exporting the results in different formats, to deploying the annotated corpus. Its modular architecture allows for easy extension through plugins, enabling support for additional external tools and custom annotations as needed.
While Sparv is primarily designed for Swedish text, it can also be used for other languages. Writing plugins to integrate third-party tools that support other languages is straightforward, making Sparv a flexible solution for multilingual text analysis.
The source code is available under the MIT license. If you have any questions, problems or suggestions please contact sb-sparv@svenska.gu.se.
This documentation is also available in PDF format. You can download the user manual and the developer's guide from the latest Sparv release on GitHub.
Cite Sparv
Martin Hammarstedt, Anne Schumacher, Lars Borin, Markus Forsberg (2022): Sparv 5 User
Manual
Tip
Stay updated on new Sparv releases by
subscribing to our GitHub repository:
- Log in to GitHub
- Go to https://github.com/spraakbanken/sparv
- Click "Watch" in the upper right corner
- Select "Custom"
- Check the "Releases" box
- Click "Apply"
Depending on your notification settings you will now receive notifications about new Sparv releases on GitHub's website, via email, or on your phone.