zeilen
zeilen document.txt
cat *.txt | zeilen
Calculate the number of lines in multilingual text, based on a detailed report of the number of various kinds of characters and an rules for weighing these characters, such that the result can be used for calculating translation fees.
Produce a report such as
2661 chars, 1010 common, 455 space, 1651 cased, 39 numerals, 324 punctuation
1 line = 53 single-width chars = 28 kana = 12 han
38
where 38 would be the resulting number of lines.
We assume one line to consist of 53 alphanumeric+blank+interpunctuation characters or 28 japanese kana or 12 han characters. These parameters should be finetuneable from the command line or configuration files.
Getopt::Std(3)
perl(1)
perldoc(1)
Needs to be implemented and documented in more detail.
Currently no distinction is made between text and meta-text (such as html markup), and no non-text formats are supported.
Not known.