NAME

    zeilen


SYNOPSIS

    zeilen document.txt
    cat *.txt | zeilen


DESCRIPTION

Calculate the number of lines in multilingual text, based on a detailed report of the number of various kinds of characters and an rules for weighing these characters, such that the result can be used for calculating translation fees.

Produce a report such as

    2661 chars, 1010 common, 455 space, 1651 cased, 39 numerals, 324 punctuation
    1 line = 53 single-width chars = 28 kana = 12 han
    38

where 38 would be the resulting number of lines.

We assume one line to consist of 53 alphanumeric+blank+interpunctuation characters or 28 japanese kana or 12 han characters. These parameters should be finetuneable from the command line or configuration files.


PREREQUISITES

    Getopt::Std(3)
    perl(1)


SEE ALSO

    perldoc(1)


LIMITATIONS

Needs to be implemented and documented in more detail.

Currently no distinction is made between text and meta-text (such as html markup), and no non-text formats are supported.


BUGS

Not known.


IMPLEMENTATION

Header

Functions

mycount

Process

read

calculate

report

The End