[request] add 4060 ti 16 gb support #23

cerulliber · 2025-02-13T12:35:25Z

[request] add 4060 ti 16 gb support

thanks!

dreaming-panda · 2025-02-21T05:14:43Z

we are working on this now!

dreaming-panda · 2025-03-10T06:38:14Z

{
    "model": "hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4",
    "draft_model": "hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4",
    "offload": true,
    "max_length": 8192,
    "num_cache_layers": 0,
    "generation_length": 256,
    "max_turns": 16,
    "topk": 32,
    "temperature": 0.6,
    "topp": 0.9,
    "repetition_penalty": 1.05,
    "width": 16,
    "num_beams": 24,
    "depth": 24,
    "engine": "dynamic",
    "template": "meta-llama3",
    "dtype": "float16"
}

How about this configuration? Or which model do you want to run? Now we support Llama, Qwen, QwQ and Mistral.

Looking for your feedback! @cerulliber

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[request] add 4060 ti 16 gb support #23

[request] add 4060 ti 16 gb support #23

cerulliber commented Feb 13, 2025

dreaming-panda commented Feb 21, 2025

Uh oh!

dreaming-panda commented Mar 10, 2025

Uh oh!

[request] add 4060 ti 16 gb support #23

[request] add 4060 ti 16 gb support #23

Comments

cerulliber commented Feb 13, 2025

dreaming-panda commented Feb 21, 2025

Uh oh!

dreaming-panda commented Mar 10, 2025

Uh oh!