Pinned Loading
Repositories
Showing 10 of 31 repositories
- speculators Public
A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM
vllm-project/speculators’s past year of commit activity - semantic-router Public
System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge
vllm-project/semantic-router’s past year of commit activity