

BitLlama
Pure Rust LLM inference engine with 1.58-bit ternary support and Test-Time Training
winget install --id=imonoonoko.BitLlama -e Description
BitLlama is a Pure Rust LLM inference engine featuring 1.58-bit ternary quantization, Test-Time Training (TTT), Soul learning system, MCP server/client, and private RAG. Supports Llama, Gemma, Mistral, Qwen, and BitNet models. OpenAI-compatible API server included.
BitLlama is available through winget as package ID imonoonoko.BitLlama, with version 1.0.0. Use the install command above to set it up on Windows without downloading a separate installer manually.
imonoonoko.BitLlama
Related Apps
Desktop GUI for BitLlama LLM inference engine with Soul learning and model management
Provides a language-agnostic way to express coding assumptions in .NET programs.
.NET Reactor is a powerful code protection and software licensing system for software written for the .NET Framework, and supports all languages that generate .NET assemblies.
02Engine — A faster, stronger, richer creative engine for the hardcore.
Micropatching security vulnerabilities for many programs and in real time