jundot/omlx
15,666
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
适用于Apple Silicon的LLM推理服务器,支持连续批处理和SSD缓存,通过macOS菜单栏管理