Dictionaries

Lindera supports various dictionaries for Japanese, Korean, and Chinese morphological analysis. Each dictionary is provided as a separate crate.

DictionaryLanguageCrateDescription
IPADICJapaneselindera-ipadicThe most common dictionary for Japanese
IPADIC NEologdJapaneselindera-ipadic-neologdIPADIC with neologisms (new words)
UniDicJapaneselindera-unidicUniform word unit definitions
ko-dicKoreanlindera-ko-dicKorean morphological analysis
CC-CEDICTChineselindera-cc-cedictChinese-English dictionary
JiebaChineselindera-jiebaJieba-based Chinese dictionary

Obtaining Dictionaries

Pre-built dictionaries are available for download from GitHub Releases. Download the dictionary archive for your target language and extract it to a local directory.

#![allow(unused)]
fn main() {
// Load an external dictionary from a local path
let dictionary = load_dictionary("/path/to/ipadic")?;
}

[!TIP] If you need a self-contained binary without external dictionary files, you can embed dictionaries using the embed-* feature flags and load them using the embedded:// scheme:

#![allow(unused)]
fn main() {
let dictionary = load_dictionary("embedded://ipadic")?;
}

See Feature Flags for details.

See each dictionary crate's documentation for format details, build instructions, and usage examples.