llama.cpp
Demonstration of speculative decoding and tree-based speculative decoding techniques
More info: