[Morfessor-announce] Morfessor FlatCat released

Mailing list for Morfessor Announcements morfessor-announce at list.aalto.fi
Wed Aug 20 15:56:14 EEST 2014


Hi,

We are excited to reveal Morfessor FlatCat, a new morphological
segmentation method in the Morfessor family.

Morfessor FlatCat has now been officially published at
http://www.cis.hut.fi/projects/morpho/
The source code is available at
https://github.com/aalto-speech/flatcat

Morfessor FlatCat combines the semi-supervised training previously used in
Morfessor Baseline, with the morph categories from Morfessor Categories-MAP
and Categories-ML. There are four morph categories: three proper categories
(stem, prefix and suffix) and a non-morpheme category (marked ZZZ), for
segments that are not proper morphs because they are fragments of a larger
morph. The morph categories allow Morfessor FlatCat to be used as a
stemmer.

The name Morfessor FlatCat comes from the use of a lexicon without 
hierarchical structure (a flat lexicon), together with the morph
categories.

For a detailed description of the method, see the paper:
Grönroos, S.-A., Virpioja, S., Smit, P., and Kurimo, M. (2014).
Morfessor FlatCat: An HMM-based method for unsupervised and semi-supervised
learning of morphology.  In proceedings of the 25th International
Conference on Computational Linguistics.  Pages 1177-1185, Dublin, Ireland,
August 2014, Association for Computational Linguistics.
http://www.aclweb.org/anthology/C/C14/C14-1111.pdf

Morfessor FlatCat will be presented at the COLING conference in Dublin,
Ireland next week. Find us at Poster session II (Tuesday 26th August
14:00-15:15) in the 2nd Floor Lobby.

http://www.coling-2014.org/schedule/tuesday26august.php

--

Stig-Arne Grönroos
Doctoral student
Aalto University


More information about the Morfessor-announce mailing list