– Create and generate a WordTabulator index of the source texts.
– Use WordTabulator to produce N-grams and phrases of the source text.
– Produce an index of word elements in source texts.
– Compare an index of word elements between two sets of source texts (word by word).
– Generate list of each word element in the two sets.
– Compute the Jaccard index of an index of word elements between two sets of source texts.
– Compute the Levenshtein distance of an index of word elements between two sets of source texts.

In this release we improved some functionality and did some bugfixes.

For more information about the program, please visit the link at the end of the article.

Version 1.0.3

February 17, 2005

Added support for Greek texts with standard encoding.

Version 1.0.2

February 14, 2005

Added support for function definitions with suffix, e.g. function0().

. Bugfixes for LineCount and NonWordCount functions.

Version 1.0.1

February 13, 2005

Added support for English text with the standard encoding.

Version 1.0

February 6, 2005

Initial release.


– Optimization of some functions and module:
Word count function, NonWord count function, LineCount function,
Line Count function, HTML count function.

– Support for the following encoding standard:

– Support for wide chars strings.

– Added several new functions:
– GetFileText – Get all the text in a text file or in an HTML/XML/SGML document.
– EncodeText – Encode a file/string/html/xml/sgmldocument/text to a binary format,
useful for the analysis of web pages.
– SetEncoding – Set an encoding for a text file/string/html/xml/sgmldocument/text.
– GetEncoding – Get the current encoding of a text file/string/html/xml/sgmldocument/text.

– Added module tt_lib.php, function tt_lib_end().

– Added module tt_lib.php, function tt_lib_word d82f892c90

