This is a WWW representation of the work of Jim Breen who produced KANJIDIC. Lambert Schomaker wrote a little C program and Unix scripts, using kterm and the netpbm package, to transform KANJIDIC into a Web-readable form. The font was "-jis-fixed-medium-r-normal--24-230-75-75-c-240-jisx0208.1983-0" as provided with Linux/X11.
This was done, on the basis of discussions in the UNIPEN project concerning the coding of Kanji in the databases of on-line handwriting. The UNIPEN project is intended to create a common basis for the comparison of handwriting recognition algorithms as used in pen computers. This WWW version of KANJIDIC allows for a comparison of different coding schemes as compiled by Breen. Note: the top-left character is Kanji, then there is a sequence of kana/katakana/hiragana characters describing it (--> top right), and several ASCII codings are on the bottom of each panel. Look for details in the kanjidic.doc.
There is also a crude English to Kanji index.
AUSTRALIA: ftp.cc.monash.edu.au [130.194.1.106],
files /pub/nihongo/edict.* and kanjidic.*
A list with ftp nodes in your vicinity is shown when you ftp there.
As required by the Copyright statement at the end, the file kanjidic.doc is included.
The file contains a mixture of ASCII characters and kana/kanji encoded using the EUC (Extended Unix Code) coding.
Attention is drawn to the KANJIDIC LICENCE STATEMENT AND COPYRIGHT NOTICE included below in this document.
A similar file, KANJD212, is available for the 5,801 supplementary kanji in the JIS X 0212-1990 set. (not in this WWW format, yet, LS))
Also look at Jeffrey Friedl's CGI database approach to KANJIDIC.
COPYING AND DISTRIBUTION
Permission is granted to make and distribute verbatim copies of these files
provided this KANJIDIC.DOC file, the copyright notice and permission notice
is distributed with all copies. Any distribution of the files must take
place without a financial return, except a charge to cover the cost of the
distribution medium.
Permission is granted to make and distribute extracts or subsets of the
KANJIDIC files under the same conditions applying to verbatim copies.
Permission is granted to translate the English elements of the KANJIDIC file
into other languages, and to make and distribute copies of those translations
under the same conditions applying to verbatim copies.
KANJIDIC USAGE
These files may be freely used by individuals and small groups for reference
and research purposes, and may be accessed by software belonging to, or
operated by, such individuals and small groups.
The files, extracts from the files, and translations of the files must not be
sold as part of any commercial software package, nor must they be
incorporated in any published dictionary or other printed document without
the specific permission of the copyright holders.
COPYRIGHT
Copyright over the documents covered by this statement is held by James
William BREEN, subject to the exceptions outlined below.
The following people have granted permission for material for which they hold
copyright to be included in the files, and distributed under the above
conditions, while retaining their copyright over that material:
Jack HALPERN: The SKIP codes and Frequency codes in the KANJIDIC file.
With regard to the SKIP and Frequency codes, Mr Halpern stated as follows:
"The commercial utilization of the frequency numbers is prohibited
without written permission from Jack Halpern. Use by individuals and
small groups for reference and research purposes is permitted, on
condition that acknowledgement of the source and this notice are
included."
"SKIP is protected by copyright, copyleft and patent laws. The
commercial utilization of SKIP in any form is strictly forbidden
without the written permission of Jack Halpern, the copyright
holder."
Christian WITTERN and Koichi YASUOKA: The Pinyin information in the KANJIDIC
file.
Urs APP: the Four Corner codes and the Morohashi information in the KANJIDIC
file.
Mark SPAHN and Wolfgang HADAMITSKY: the kanji descriptors from their
dictionary.
since Sept '96