× Description Download Publication(s) Contact
 Back to Software and Resources

SxPipe

Shallow language pipeline

Main website

Description

SxPipe is a modular and customizable processing chain dedicated to applying to raw corpora a cascade of surface processing steps (tokenisation, wordform detection, non-deterministic spelling correction…). It is used as a preliminary step before ALMAnaCH's parsers (e.g., FRMG) and for surface processing (named entities recognition, text normalization, unknown word extraction and processing...).

Contact

For more information or if you have any questions, please contact Benoît Sagot

Benoit.Sagot[at]inria.fr