Purpose
This program parses text files downloaded from large text banks. These downloads typically come with lots of articles one after the other in one big text and the idea is to split them up into stories and sort them by author, publication, date etc. It also helps you build corpora by creating a corpus built using only specific authors or publications, creating text files containing a month's texts at a time, etc.
The program can also handle languages like Chinese.
Mike Scott 2020
>