[CHISE-en:103] Re: CHISE IDS in IDSgrep, syntactically invalid IDSes

mskala at ansuz.sooke.bc.ca mskala at ansuz.sooke.bc.ca
Sat Jul 21 23:16:36 JST 2012


On Sat, 21 Jul 2012, 守岡知彦 / MORIOKA Tomohiko wrote:
> I would like to try it, so I downloaded and installed it, but it seems
> that data file(s) are missing.  Does it require the ``Tsukurimashou''
> package?

The packaged 0.2 version of IDSgrep requires either Tsukurimashou or
KanjiVG to generate a database.  If you check out the latest development
version from the SVN or Git repositories, that version can build using
CHISE IDS (but is still alpha quality).  I'm also attaching to this
message a copy of the KanjiVG-based compiled dictionary, which should help
you get started.  I don't distribute this file as part of the IDSgrep
package because it is under a Creative Commons license that isn't
GPL-compatible.

> CHISE IDS 0.25 is too old.  It is better to use the latest version in
> the Git repository:

Thanks for the pointer.  With my current code the Git version actually
gives an even longer list of errors.  However, many of those are in the
IDS-HZK files and maybe it's safe to ignore them, and many of the
remaining ones seem to be because the Git version uses variation
sequences, which my code doesn't understand yet.  I should be able to
extend it to handle variation sequences, and when I do, the error count
may be reduced a lot.

-- 
Matthew Skala
mskala at ansuz.sooke.bc.ca                 People before principles.
http://ansuz.sooke.bc.ca/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: kanjivg.eids
Type: application/octet-stream
Size: 175127 bytes
Desc: 
URL: <http://lists.chise.org/pipermail/chise-en/attachments/20120721/4b404df7/attachment-0001.obj>


More information about the CHISE-en mailing list