-
Notifications
You must be signed in to change notification settings - Fork 59
Issues
is:issue state:open
is:issue state:open
Issue creation is restricted in this repository
Search results
[REQUEST]
enhancementNew feature or requestNew feature or requestStatus: Open.#34 In cli99/llm-analysis;[REQUEST] Implement modern attention schemes such as GQA or MLA
enhancementNew feature or requestNew feature or requestStatus: Open.#33 In cli99/llm-analysis;- Status: Open.#32 In cli99/llm-analysis;
[REQUEST]DeepSeek analysis
enhancementNew feature or requestNew feature or requestStatus: Open.#31 In cli99/llm-analysis;[BUG] NUM_GPUS_PER_NODE not respected in inference
bugSomething isn't workingSomething isn't workingStatus: Open.#30 In cli99/llm-analysis;- Status: Open.#29 In cli99/llm-analysis;
[REQUEST]some question about memory and latency analysis
enhancementNew feature or requestNew feature or requestStatus: Open.#27 In cli99/llm-analysis;- Status: Open.#24 In cli99/llm-analysis;
latency [BUG]
bugSomething isn't workingSomething isn't workingStatus: Open.#21 In cli99/llm-analysis;mistral and mixtral inference[BUG]
bugSomething isn't workingSomething isn't workingStatus: Open.#20 In cli99/llm-analysis;