llama-cpp

2 articles
sort: new top best
clear filter
0 5/10

Step-by-step guide for running open-source LLMs locally with Claude Code using llama.cpp, demonstrating deployment of models like Qwen3.5 and GLM-4.7-Flash with quantization and GPU optimization for coding tasks.

Unsloth Claude Code Qwen3.5 GLM-4.7-Flash llama.cpp DeepSeek Gemma Qwen3-Coder-Next OpenAI
unsloth.ai · armcat · 1 day ago · details · hn
0 2/10

A guide for running coding agents entirely locally using llama.cpp and Pi, enabling Claude-like code assistance that runs on your own hardware with compatible open-source models from Hugging Face.

llama.cpp Pi Hugging Face OpenClaw llama-agent Qwen3.5-122B-A10B-GGUF GLM-4.7-Flash-GGUF Unsloth
huggingface.co · kristianpaul · 1 day ago · details · hn