Text segmenter.

This is a very simple grammar to demonstrate how to recognize sentence delimiters, in order to split texts into sentences, e.g.:

It will split the following text in two:
"It is raining. Joe is sleeping."

It will not split the following text:
"Dr Joe is sleeping."

Use segment to segment raw .txt files. The resulting file will be an annotated segmented text with an empty Text Annotation Structure (TAS).

For a much more detailed grammar for sentences segmentation, see:

Silberztein, M. Finite state descriptions of various levels of linguistic phenomena. In Language Research, 28/4 (1992).

See www.nooj4nlp.org for more information on NooJ, its community and its full-size linguistic resources.
