54 lines
908 B
Plaintext
54 lines
908 B
Plaintext
|
.TH WC 1
|
||
|
.SH NAME
|
||
|
wc \- word count
|
||
|
.SH SYNOPSIS
|
||
|
.B wc
|
||
|
[
|
||
|
.B -lwrbc
|
||
|
]
|
||
|
[
|
||
|
.I file ...
|
||
|
]
|
||
|
.SH DESCRIPTION
|
||
|
.I Wc
|
||
|
counts lines, words, runes, syntactically-invalid
|
||
|
.SM UTF
|
||
|
codes and bytes in the named
|
||
|
.IR files ,
|
||
|
or in the standard input if no file is named.
|
||
|
A word is a maximal string of characters
|
||
|
delimited by spaces, tabs or newlines.
|
||
|
The count of runes includes invalid codes.
|
||
|
.PP
|
||
|
If the optional argument is present,
|
||
|
just the specified counts (lines, words, runes, broken
|
||
|
.SM UTF
|
||
|
codes or bytes)
|
||
|
are selected by the letters
|
||
|
.BR l ,
|
||
|
.BR w ,
|
||
|
.BR r ,
|
||
|
.BR b ,
|
||
|
or
|
||
|
.BR c .
|
||
|
Otherwise, lines, words and bytes
|
||
|
.RB ( -lwc )
|
||
|
are reported.
|
||
|
.SH SOURCE
|
||
|
.B /sys/src/cmd/wc.c
|
||
|
.SH BUGS
|
||
|
The Unicode Standard has many blank characters scattered through it,
|
||
|
but
|
||
|
.I wc
|
||
|
looks for only
|
||
|
.SM ASCII
|
||
|
space, tab and newline.
|
||
|
.PP
|
||
|
.I Wc
|
||
|
should have options to count suboptimal
|
||
|
.SM UTF
|
||
|
codes
|
||
|
and bytes that cannot occur in any
|
||
|
.SM UTF
|
||
|
code.
|