XML corpus export with scrambled contents
Citation
Språkbanken Text (2023). export-xml-scrambled (updated: 2023-11-06). [Analysis]. Språkbanken Text.Example
This exporter is used with Sparv. Check out Sparv's quick start guide to get started!
To use this exporter, run Sparv with the argument 'xml_export:scrambled':
sparv run xml_export:scrambled
You need to tell Sparv which annotation to use for scrambling (typically <sentence>
or <paragraph>
). You can do
this with the xml_export.scramble_on
setting. E.g. if you want to have a sentence-scrambled export you will need to
add the following setting to your Sparv corpus configuration file:
xml_export:
scramble_on: <sentence>
For more info on how to use Sparv, check out the Sparv documentation.
Intended uses
This export is typically used when publishing the original corpus contents (with added annotations) would infringe on copyrights. Scrambling the contents ensures that one cannot reconstruct the original material.