A high-throughput and memory-efficient inference and serving engine for LLMs
|
algorithm score
40.5
|
community score
N/A
|
stars
72.3k
|
0
0
log in to leave a review
no reviews yet. be the first!