libucto1-dev - Unicode Tokenizer - development

Distribution Debian 7 (Wheezy)
Repository Debian Main i386
Package name libucto1-dev
Package version 0.5.2
Package release 2
Package architecture i386
Package type deb
Installed size 265 B
Download size 89.67 KB
Official Mirror
Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
punctuation, split sentences, generate n-grams), and  offers several other
basic preprocessing steps (change case, count words/characters and reverse
lines) that make your text suited for further processing such as indexing,
part-of-speech tagging, or machine translation.
Ucto is a product of the ILK Research Group, Tilburg University (The
This package provides the ucto header files required to compile C++ programs
that use ucto.


libucto1-dev_0.5.2-2_amd64.deb 0.5.2 amd64 Debian Main
Binary Package libucto1-dev_0.5.2-2_i386.deb
Source Package ucto

