Nexus
Speculative Decoding Visualizer
Draft model generates 3 candidate sequences and target picks the best match
Prompt
Explain how speculative decoding works.
Draft Node
Qwen3 1.5B · RTX 3060
Token Transfer
Draft Batches →
← Best Sequence
Target Node
Qwen3 70B · H100