localllama

Here are 11 public repositories matching this topic...

SqueezeAILab / KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

natural-language-processing compression text-generation transformer llama quantization mistral model-compression efficient-inference efficient-model large-language-models llm small-models localllm localllama

Updated Aug 13, 2024
Python

lef-fan / aria

Star

A local and uncensored AI entity.

python bot text-to-speech ai deep-learning speech pytorch tts assistant vad speech-to-text voice-assistant large-language-models llm xttsv2 localllama llamacpp-python kokoro-tts

Updated Aug 1, 2025
Python

yankeexe / llm-rag-with-reranker-demo

Star

LLM RAG Application with Cross-Encoders Re-ranking for YouTube video 🎥

awesome re-ranking rag streamlit cross-encoders langchain retrieval-augmented-generation ollama localllama

Updated Jan 30, 2025
Python

Belluxx / LlamaTerm

Star

Use your open source local model from the terminal

llm-inference local-llama localllama qwen2 llama3-meta-ai

Updated Jun 9, 2025
Python

av1d / LAISer

Star

Local AI Search assistant web or CLI for ollama and llama.cpp. Lightweight and easy to run, providing a Perplexity-like experience.

cli search-engine ai search-algorithm localllm localllama

Updated Nov 18, 2024
Python

BrunoArsioli / llama-optimus

Star

Lightweight Python tool using Optuna for tuning llama.cpp flags: towards optimal tok/s for your machine

benchmark automation optimization optuna llamacpp localai localllama

Updated Jun 30, 2025
Python

X-rayLaser / webdev-llm

Star

Easy to use and flexibile web-based frontend for open-weight LLMs for chatting and coding.

react gui ai chatbot web-ui assistant llama webdev webdevelopment llms llamacpp localllm ollama localllama llm-gui

Updated Aug 14, 2025
Python

hathibelagal-dev / llamashell

Star

A powerful shell that's powered by a locally running LLM (ideally Llama 3.x or Qwen 2.5)

shell cli transformers llama terminal-based localllm localllama agentic-ai llm-integration

Updated Apr 18, 2025
Python

palash-jain-cw / LocalLLMChatbot

Star

This project allows you to run your own local Large Language Model (LLM) chatbot using an API like Ollama.

chatbot ollama localllama

Updated Dec 31, 2024
Python

OleksiiHorishnii / tldr-email-summarizer

Star

Summarize emails received by Thunderbird mail client extension via locally run LLM. Early development.

localllama

Updated Feb 9, 2024
Python

A narrative/roleplay engine with TCOD levels driven by unreliable narrators - both in the literal and literature sense. Currently hooks into LMStudio and gemini for responses, allowing overrides of tasks to player control.

storytelling roguelike roleplay bugs tcod localllm localllama

Updated Aug 18, 2025
Python

Improve this page

Add a description, image, and links to the localllama topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the localllama topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

localllama

Here are 11 public repositories matching this topic...

SqueezeAILab / KVQuant

lef-fan / aria

yankeexe / llm-rag-with-reranker-demo

Belluxx / LlamaTerm

av1d / LAISer

BrunoArsioli / llama-optimus

X-rayLaser / webdev-llm

hathibelagal-dev / llamashell

palash-jain-cw / LocalLLMChatbot

OleksiiHorishnii / tldr-email-summarizer

qutoh / LMRL

Improve this page

Add this topic to your repo