improved
[CAPSULE:0.1.15] ⎯ 2025-04-04
about 1 month ago
Performance improvements, private crate support, and prompt tuning.
Added
- Support for custom private Rust crates.
- Token generation speed: 97.63% parity with
llama.cpp
. - Performance metrics logged in debug output.
Changed
- Codebase refactor for modularity and readability.
- Updated ranking prompt and generation parameters.
- Updated
papaya
crate tov0.2.1
. - Separated token-to-text from generation worker.