Lindera Dictionary

Lindera Dictionary is the base library for morphological analysis dictionaries. It provides dictionary loading, building, Viterbi-based segmentation, and CRF-based training functionality.

Key Features

  • Dictionary loading from filesystem or embedded data
  • Dictionary building from MeCab-format CSV source files
  • Viterbi algorithm for optimal segmentation
  • N-best path generation (Forward-DP Backward-A*)
  • Memory-mapped file support
  • CRF-based dictionary training (with train feature)

Contents