#

ggml

Here are 107 public repositories matching this topic...

llama.cpp

ggml-org / llama.cpp

LLM inference in C/C++

llama ggml

Updated Apr 10, 2025
C++

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Updated Apr 9, 2025
Python

LostRuins / koboldcpp

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

llama language-model gemma mistral koboldai llm llamacpp ggml koboldcpp gguf

Updated Apr 9, 2025
C++

rustformers / llm

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

rust ai ml llm ggml

Updated Jun 24, 2024
Rust

leejet / stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

flux ai cplusplus image-generation diffusion text2image image2image img2img txt2img latent-diffusion stable-diffusion ggml flux-dev flux-schnell

Updated Mar 9, 2025
C++

guinmoon / LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

macos swift ios ai llama gpt-2 rwkv ggml gptneox starcoder

Updated Mar 11, 2025
Swift

RWKV / rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

machine-learning deep-learning quantization language-model llm rwkv ggml

Updated Mar 23, 2025
C++

RahulSChand / gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

gpu pytorch llama quantization language-model huggingface llm llamacpp ggml llama2

Updated Dec 3, 2024
JavaScript

gollama

sammcj / gollama

Go manage your Ollama models

macos linux ai models tui llm ggml ollama gguf

Updated Apr 4, 2025
Go

PABannier / bark.cpp

Suno AI's Bark model in C/C++ for fast text-to-speech generation

machine-learning text-to-speech inference tts ggml

Updated Nov 16, 2024
C++

the-crypt-keeper / can-ai-code

Self-evaluating interview for AI coders

ai transformers humaneval llm langchain llama-cpp ggml

Updated Feb 21, 2025
Python

abacaj / mpt-30B-inference

Run inference on MPT-30B using CPU

ggml ctransformers mpt-30b

Updated Jun 30, 2023
Python

Maknee / minigpt4.cpp

Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)

c machine-learning deep-learning cpp quantization multimodal ggml minigpt4

Updated Aug 8, 2023
C++

monatis / clip.cpp

CLIP inference in plain C/C++ with no extra dependencies

c cpp image-search clip multimodal ggml

Updated Aug 18, 2024
C++

LLaMA-Cult-and-More

shm007g / LLaMA-Cult-and-More

Large Language Models for All, 🦙 Cult and More, Stay in touch !

tensorflow transformers pytorch llama gpt alpaca loralib vicuna deepspeed gpt4 llm chatgpt ggml gptq

Updated Jun 1, 2023
HTML

staghado / vit.cpp

Inference Vision Transformer (ViT) in plain C/C++ with ggml

c cpu ai computer-vision cpp image-classification edge-computing vision-transformer whisper-cpp llamacpp ggml

Updated Apr 11, 2024
C++

shubham0204 / SmolChat-Android

Running any GGUF SLMs/LLMs locally, on-device in Android

android kotlin cpp llamacpp ggml small-language-models

Updated Apr 7, 2025
Kotlin

pollockjj / ComfyUI-MultiGPU

This custom_node for ComfyUI adds one-click "Virtual VRAM" for any GGUF UNet and CLIP loader, managing the offload of layers to DRAM or VRAM to maximize the latent space of your card. Also includes nodes for directly loading entire components (UNet, CLIP, VAE) onto the device you choose. Includes 16 examples covering common use cases.

pytorch unet-pytorch stable-diffusion comfyui ggml comfyui-workflow comfyui-nodes gguf-models

Updated Apr 4, 2025
Python

mgonzs13 / llama_ros

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

cpp embeddings llama gpt ros2 vlm reranking llm langchain llava llamacpp ggml gguf rerank llavacpp

Updated Apr 5, 2025
C++

mayooear / private-chatbot-mpt30b-langchain

Chat with your data privately using MPT-30b

gpt llm langchain ggml

Updated Jun 29, 2023
Python

Improve this page

Add a description, image, and links to the ggml topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ggml topic, visit your repo's landing page and select "manage topics."