Stars
Toolkit for linearizing PDFs for LLM datasets/training
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
SGLang is a fast serving framework for large language models and vision language models.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.
UltraGrid low-latency audio and video network transmission system
Deskflow lets you share one mouse and keyboard between multiple computers on Windows, macOS and Linux. It's like a software KVM (but without video).
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket-FLV/HTTP-TS/HTTP-fMP4/WebSocket-TS/WebSocket-fMP4/GB28181/SRT server and client framework based on C++11
Ultimate camera streaming application with support RTSP, RTMP, HTTP-FLV, WebRTC, MSE, HLS, MP4, MJPEG, HomeKit, FFmpeg, etc.
A self-paced course to learn Rust, one exercise at a time.
Next Generation Server Toolkit. Create web servers with everything you need and deploy them wherever you prefer.
A list of Free Software network services and web applications which can be hosted on your own servers
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
Stable Diffusion web UI
Draw perfect pressure-sensitive freehand lines.
This app demonstrates the controls available in WinUI and the Fluent Design System.
The swiss army knife of lossless video/audio editing
🇨🇳 功能全面的汉字工具库 (拼音 笔画 偏旁 成语 语音 可视化等) (Chinese character util)
FastFlix is a free GUI for H.264, HEVC and AV1 hardware and software encoding!
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码