Skip to content

Pull requests: mlc-ai/mlc-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Serving] Remove draft tokens after finishing request
#2953 opened Sep 30, 2024 by vinx13 Loading…
[Model] Add use_qk_norm option for Cohere model
#2877 opened Sep 2, 2024 by tlopex Loading…
[Serving] PagedKVCache Quantization
#2663 opened Jul 16, 2024 by davidpissarra Loading…
[Bench] Add bench for GSM8K eval
#2585 opened Jun 16, 2024 by Hzfengsy Loading…
[Bench] Add bench for MMLU eval
#2584 opened Jun 16, 2024 by Hzfengsy Loading…
Add docker container support
#1271 opened Nov 15, 2023 by Sing-Li Loading…
Implement Whisper in new concise nn.Module API
#868 opened Sep 5, 2023 by LeshengJin Loading…
ProTip! Updated in the last three days: updated:>2024-09-26.