NAME

    zeilen

SYNOPSIS

    zeilen document.txt
    cat *.txt | zeilen

DESCRIPTION

Calculate the number of lines in multilingual text, based on a detailed report of the number of various kinds of characters and an rules for weighing these characters, such that the result can be used for calculating translation fees.

Produce a report such as

    2661 chars, 1010 common, 455 space, 1651 cased, 39 numerals, 324 punctuation
    1 line = 53 single-width chars = 28 kana = 12 han
    38

where 38 would be the resulting number of lines.

We assume one line to consist of 53 alphanumeric+blank+interpunctuation characters or 28 japanese kana or 12 han characters. These parameters should be finetuneable from the command line or configuration files.

PREREQUISITES

    Getopt::Std(3)
    perl(1)

SEE ALSO

    perldoc(1)

LIMITATIONS

Needs to be implemented and documented in more detail.

Currently no distinction is made between text and meta-text (such as html markup), and no non-text formats are supported.

BUGS

Not known.

IMPLEMENTATION

Functions

mycount

Process

read

calculate

report

The End