hemingkx / spec-bench Goto Github PK

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Home Page: https://sites.google.com/view/spec-bench

License: Apache License 2.0

Shell 0.23% Python 65.40% Rust 1.72% C 32.65%

spec-bench's Issues

Add Hydra

Maybe consider adding Hydra to these? It's a very similar approach to EAGLE, so I'd be curious to see what the performance difference is.

How to think a comparison is fair

Thanks for your work, I would like to ask, how do you think the comparison results shown by spec-bench are fair? For example, REST can control the size of the datastore that needs to be maintained; lookahead needs to control the length of N-grams and the size of the pool; how do you think the results provided by spec-bench are fair? I'm not quite sure, it would be greatly appreciated if you could provide further explanation.

PaSS methodology.

Thank you so much for your work in establishing this benchmark!

I wanted to ask if you would include PaSS (https://arxiv.org/pdf/2311.13581.pdf) to your benchmark analysis, since it is included in your references, and it allows nucleus sampling, greedy sampling, but it doe require the training of extra token embeddings.

A100的自回归tokens/s仅为40.24是否太慢了？

我们在4090和L40上进行了测试，自回归的tokens/s均达到了50以上，感觉不太合理。是否可能是cuda的版本不同造成的呢？

REST methodology verification process

I noticed that in Table 3 you present a summary of different decoding methodologies.

In particular, you present REST as a methodology amenable to Nucleus sampling, but on reading the REST methodology paper, it's not clear how they establish that their methodology maintains the same output distribution from a LLM.

hemingkx / spec-bench Goto Github PK

spec-bench's Issues

Add Hydra

How to think a comparison is fair

PaSS methodology.

A100的自回归tokens/s仅为40.24是否太慢了？

REST methodology verification process

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent