Hand-picked awesome Python libraries and frameworks, organised by category 🐍
Interactive version: www.awesomepython.org
Updated 19 Dec 2024
- Newly Created Repositories - Awesome Python is regularly updated, and this category lists the most recently created GitHub repositories from all the other repositories here (10 repos)
- Code Quality - Code quality tooling: linters, formatters, pre-commit hooks, unused code removal (17 repos)
- Crypto and Blockchain - Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity (13 repos)
- Data - General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks (109 repos)
- Debugging - Debugging and tracing tools (10 repos)
- Diffusion Text to Image - Text-to-image diffusion model libraries, tools and apps for generating images from natural language (42 repos)
- Finance - Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives (33 repos)
- Game Development - Game development tools, engines and libraries (7 repos)
- GIS - Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections (29 repos)
- Graph - Graphs and network libraries: network analysis, graph machine learning, visualisation (6 repos)
- GUI - Graphical user interface libraries and toolkits (8 repos)
- Jupyter - Jupyter and JupyterLab and Notebook tools, libraries and plugins (27 repos)
- LLMs and ChatGPT - Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover (306 repos)
- Math and Science - Mathematical, numerical and scientific libraries (26 repos)
- Machine Learning - General - General and classical machine learning libraries. See below for other sections covering specialised ML areas (159 repos)
- Machine Learning - Deep Learning - Machine learning libraries that cross over with deep learning in some way (78 repos)
- Machine Learning - Interpretability - Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training (22 repos)
- Machine Learning - Ops - MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models (44 repos)
- Machine Learning - Reinforcement - Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF (24 repos)
- Machine Learning - Time Series - Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics (19 repos)
- Natural Language Processing - Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover (86 repos)
- Packaging - Python packaging, dependency management and bundling (28 repos)
- Pandas - Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations (24 repos)
- Performance - Performance, parallelisation and low level libraries (28 repos)
- Profiling - Memory and CPU/GPU profiling tools and libraries (11 repos)
- Security - Security related libraries: vulnerability discovery, SQL injection, environment auditing (14 repos)
- Simulation - Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover (34 repos)
- Study - Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials (58 repos)
- Template - Template tools and libraries: cookiecutter repos, generators, quick-starts (9 repos)
- Terminal - Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars (15 repos)
- Testing - Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins (24 repos)
- Typing - Typing libraries: static and run-time type checking, annotations (12 repos)
- Utility - General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools (207 repos)
- Vizualisation - Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL (35 repos)
- Web - Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management (58 repos)
Awesome Python is regularly updated, and this category lists the most recently created GitHub repositories from all the other repositories here.
-
microsoft/markitdown ⭐ 22,522
A utility for converting files to Markdown, supports: PDF, PPT, Word, Excel, Images etc -
tencent/HunyuanVideo ⭐ 6,092
HunyuanVideo: A Systematic Framework For Large Video Generation Model
🔗 aivideo.hunyuan.tencent.com -
yangchris11/samurai ⭐ 5,721
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
🔗 yangchris11.github.io/samurai -
microsoft/TRELLIS ⭐ 4,521
A large 3D asset generation model. It takes in text or image prompts and generates high-quality 3D assets in various formats, such as Radiance Fields, 3D Gaussians, and meshes.
🔗 trellis3d.github.io -
bhavnicksm/chonkie ⭐ 1,806
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
🔗 docs.chonkie.ai -
modelscope/ClearerVoice-Studio ⭐ 1,799
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc. -
huggingface/smollm ⭐ 1,354
Everything about the SmolLM & SmolLM2 family of models
🔗 huggingface.co/huggingfacetb -
stackloklabs/promptwright ⭐ 319
Promptwright is a Python library designed for generating large synthetic datasets using LLMs -
cfahlgren1/observers ⭐ 215
A Lightweight Library for AI Observability -
neuml/annotateai ⭐ 167
Automatically annotates papers using Large Language Models (LLMs)
Code quality tooling: linters, formatters, pre-commit hooks, unused code removal.
-
psf/black ⭐ 39,277
The uncompromising Python code formatter
🔗 black.readthedocs.io/en/stable -
astral-sh/ruff ⭐ 33,448
An extremely fast Python linter and code formatter, written in Rust.
🔗 docs.astral.sh/ruff -
pre-commit/pre-commit ⭐ 13,048
A framework for managing and maintaining multi-language pre-commit hooks.
🔗 pre-commit.com -
sqlfluff/sqlfluff ⭐ 8,183
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
🔗 www.sqlfluff.com -
pycqa/isort ⭐ 6,536
A Python utility / library to sort imports.
🔗 pycqa.github.io/isort -
davidhalter/jedi ⭐ 5,832
Awesome autocompletion, static analysis and refactoring library for python
🔗 jedi.readthedocs.io -
pycqa/pylint ⭐ 5,337
It's not just a linter that annoys you!
🔗 pylint.readthedocs.io/en/latest -
asottile/pyupgrade ⭐ 3,627
A tool (and pre-commit hook) to automatically upgrade syntax for newer versions of the language. -
jendrikseipp/vulture ⭐ 3,545
Find dead Python code -
pycqa/flake8 ⭐ 3,473
flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.
🔗 flake8.pycqa.org -
wemake-services/wemake-python-styleguide ⭐ 2,557
The strictest and most opinionated python linter ever!
🔗 wemake-python-styleguide.rtfd.io -
python-lsp/python-lsp-server ⭐ 1,972
Fork of the python-language-server project, maintained by the Spyder IDE team and the community -
codespell-project/codespell ⭐ 1,949
check code for common misspellings -
sourcery-ai/sourcery ⭐ 1,573
Instant AI code reviews
🔗 sourcery.ai -
akaihola/darker ⭐ 638
Apply black reformatting to Python files only in regions changed since a given commit. For a practical usage example, see the blog post at https://dev.to/akaihola/improving-python-code-incrementally-3f7a
🔗 pypi.org/project/darker -
tconbeer/sqlfmt ⭐ 418
sqlfmt formats your dbt SQL files so you don't have to
🔗 sqlfmt.com
Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity.
-
ccxt/ccxt ⭐ 33,481
A JavaScript / TypeScript / Python / C# / PHP cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges
🔗 docs.ccxt.com -
freqtrade/freqtrade ⭐ 31,903
Free, open source crypto trading bot
🔗 www.freqtrade.io -
crytic/slither ⭐ 5,375
Static Analyzer for Solidity and Vyper
🔗 blog.trailofbits.com/2018/10/19/slither-a-solidity-static-analysis-framework -
ethereum/web3.py ⭐ 5,063
A python interface for interacting with the Ethereum blockchain and ecosystem.
🔗 web3py.readthedocs.io -
ethereum/consensus-specs ⭐ 3,591
Ethereum Proof-of-Stake Consensus Specifications -
cyberpunkmetalhead/Binance-volatility-trading-bot ⭐ 3,428
This is a fully functioning Binance trading bot that measures the volatility of every coin on Binance and places trades with the highest gaining coins If you like this project consider donating though the Brave browser to allow me to continuously improve the script. -
ethereum/py-evm ⭐ 2,275
A Python implementation of the Ethereum Virtual Machine
🔗 py-evm.readthedocs.io/en/latest -
bmoscon/cryptofeed ⭐ 2,271
Cryptocurrency Exchange Websocket Data Feed Handler -
binance/binance-public-data ⭐ 1,580
Details on how to get Binance public data -
ofek/bit ⭐ 1,252
Bitcoin made easy.
🔗 ofek.dev/bit -
man-c/pycoingecko ⭐ 1,052
Python wrapper for the CoinGecko API -
dylanhogg/awesome-crypto ⭐ 68
A list of awesome crypto and blockchain projects
🔗 www.awesomecrypto.xyz
General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks.
-
scrapy/scrapy ⭐ 53,416
Scrapy, a fast high-level web crawling & scraping framework for Python.
🔗 scrapy.org -
apache/spark ⭐ 40,125
Apache Spark - A unified analytics engine for large-scale data processing
🔗 spark.apache.org -
mindsdb/mindsdb ⭐ 26,892
Platform for building AI that can learn and answer questions over federated data.
🔗 mindsdb.com -
getredash/redash ⭐ 26,536
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
🔗 redash.io -
jaidedai/EasyOCR ⭐ 24,766
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
🔗 www.jaided.ai -
microsoft/markitdown ⭐ 22,522
A utility for converting files to Markdown, supports: PDF, PPT, Word, Excel, Images etc -
qdrant/qdrant ⭐ 20,873
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
🔗 qdrant.tech -
humansignal/label-studio ⭐ 19,658
Label Studio is an open source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats.
🔗 labelstud.io -
joke2k/faker ⭐ 17,833
Faker is a Python package that generates fake data for you.
🔗 faker.readthedocs.io -
avaiga/taipy ⭐ 17,153
Turns Data and AI algorithms into production-ready web applications in no time.
🔗 www.taipy.io -
binux/pyspider ⭐ 16,512
A Powerful Spider(Web Crawler) System in Python.
🔗 docs.pyspider.org -
airbytehq/airbyte ⭐ 16,389
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
🔗 airbyte.com -
twintproject/twint ⭐ 15,866
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations. -
chroma-core/chroma ⭐ 15,728
the AI-native open-source embedding database
🔗 www.trychroma.com -
ds4sd/docling ⭐ 15,431
Docling parses documents and exports them to the desired format with ease and speed.
🔗 ds4sd.github.io/docling -
tiangolo/sqlmodel ⭐ 14,739
SQL databases in Python, designed for simplicity, compatibility, and robustness.
🔗 sqlmodel.tiangolo.com -
apache/arrow ⭐ 14,702
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
🔗 arrow.apache.org -
coleifer/peewee ⭐ 11,243
a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
🔗 docs.peewee-orm.com -
s0md3v/Photon ⭐ 11,110
Incredibly fast crawler designed for OSINT. -
sqlalchemy/sqlalchemy ⭐ 9,760
The Database Toolkit for Python
🔗 www.sqlalchemy.org -
simonw/datasette ⭐ 9,616
An open source multi-tool for exploring and publishing data
🔗 datasette.io -
bigscience-workshop/petals ⭐ 9,272
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
🔗 petals.dev -
voxel51/fiftyone ⭐ 8,942
Refine high-quality datasets and visual AI models
🔗 fiftyone.ai -
yzhao062/pyod ⭐ 8,633
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
🔗 pyod.readthedocs.io -
gristlabs/grist-core ⭐ 7,342
Grist is the evolution of spreadsheets.
🔗 www.getgrist.com -
tobymao/sqlglot ⭐ 6,836
Python SQL Parser and Transpiler
🔗 sqlglot.com -
alirezamika/autoscraper ⭐ 6,510
A Smart, Automatic, Fast and Lightweight Web Scraper for Python -
kaggle/kaggle-api ⭐ 6,323
Official Kaggle API -
madmaze/pytesseract ⭐ 5,899
A Python wrapper for Google Tesseract -
vi3k6i5/flashtext ⭐ 5,598
Extract Keywords from sentence or Replace keywords in sentences. -
airbnb/knowledge-repo ⭐ 5,487
A next-generation curated knowledge sharing platform for data scientists and other technical professions. -
pathwaycom/pathway ⭐ 5,433
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
🔗 pathway.com -
ibis-project/ibis ⭐ 5,369
Ibis is a Python library that provides a lightweight, universal interface for data wrangling. It helps Python users explore and transform data of any size, stored anywhere.
🔗 ibis-project.org -
facebookresearch/AugLy ⭐ 4,971
A data augmentations library for audio, image, text, and video.
🔗 ai.facebook.com/blog/augly-a-new-data-augmentation-library-to-help-build-more-robust-ai-models -
lancedb/lancedb ⭐ 4,904
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
🔗 lancedb.github.io/lancedb -
superduperdb/superduper ⭐ 4,848
Superduper: Build end-to-end AI applications and agent workflows on your existing data infrastructure and preferred tools - without migrating your data.
🔗 superduper.io -
jazzband/tablib ⭐ 4,634
Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
🔗 tablib.readthedocs.io -
amundsen-io/amundsen ⭐ 4,456
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
🔗 www.amundsen.io/amundsen -
lk-geimfari/mimesis ⭐ 4,453
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
🔗 mimesis.name -
mongodb/mongo-python-driver ⭐ 4,168
PyMongo - the Official MongoDB Python driver
🔗 www.mongodb.com/docs/languages/python/pymongo-driver/current -
giskard-ai/giskard ⭐ 4,109
🐢 Open-Source Evaluation & Testing for AI & LLM systems
🔗 docs.giskard.ai -
andialbrecht/sqlparse ⭐ 3,771
A non-validating SQL parser module for Python -
rom1504/img2dataset ⭐ 3,768
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine. -
jmcnamara/XlsxWriter ⭐ 3,668
A Python module for creating Excel XLSX files.
🔗 xlsxwriter.readthedocs.io -
deepchecks/deepchecks ⭐ 3,639
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
🔗 docs.deepchecks.com/stable -
praw-dev/praw ⭐ 3,538
PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
🔗 praw.readthedocs.io -
run-llama/llama-hub ⭐ 3,456
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
🔗 llamahub.ai -
pyeve/cerberus ⭐ 3,177
Lightweight, extensible data validation library for Python
🔗 python-cerberus.org -
rapidai/RapidOCR ⭐ 3,163
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO and PaddlePaddle.
🔗 rapidai.github.io/rapidocrdocs -
zoomeranalytics/xlwings ⭐ 3,009
xlwings is a Python library that makes it easy to call Python from Excel and vice versa. It works with Excel on Windows and macOS as well as with Google Sheets and Excel on the web.
🔗 www.xlwings.org -
docarray/docarray ⭐ 2,992
Represent, send, store and search multimodal data
🔗 docs.docarray.org -
datafold/data-diff ⭐ 2,949
Compare tables within or across databases
🔗 docs.datafold.com -
pallets/itsdangerous ⭐ 2,940
Safely pass trusted data to untrusted environments and back.
🔗 itsdangerous.palletsprojects.com -
sqlalchemy/alembic ⭐ 2,921
A database migrations tool for SQLAlchemy. -
goldsmith/Wikipedia ⭐ 2,891
A Pythonic wrapper for the Wikipedia API
🔗 wikipedia.readthedocs.org -
dlt-hub/dlt ⭐ 2,775
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
🔗 dlthub.com/docs -
awslabs/amazon-redshift-utils ⭐ 2,775
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment -
kayak/pypika ⭐ 2,552
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
🔗 pypika.readthedocs.io/en/latest -
pynamodb/PynamoDB ⭐ 2,460
A pythonic interface to Amazon's DynamoDB
🔗 pynamodb.readthedocs.io -
sdv-dev/SDV ⭐ 2,406
Synthetic data generation for tabular data
🔗 docs.sdv.dev/sdv -
uqfoundation/dill ⭐ 2,285
serialize all of Python
🔗 dill.rtfd.io -
emirozer/fake2db ⭐ 2,283
Generate fake but valid data filled databases for test purposes using most popular patterns(AFAIK). Current support is sqlite, mysql, postgresql, mongodb, redis, couchdb. -
samuelcolvin/arq ⭐ 2,195
Fast job queuing and RPC in python with asyncio and redis.
🔗 arq-docs.helpmanual.io -
graphistry/pygraphistry ⭐ 2,174
PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer -
accenture/AmpliGraph ⭐ 2,161
Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org -
mlabonne/llm-datasets ⭐ 2,089
High-quality datasets, tools, and concepts for LLM fine-tuning. -
huggingface/datatrove ⭐ 2,077
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks. -
sfu-db/connector-x ⭐ 2,032
Fastest library to load data from DB to DataFrames in Rust and Python
🔗 sfu-db.github.io/connector-x -
aminalaee/sqladmin ⭐ 1,947
SQLAlchemy Admin for FastAPI and Starlette
🔗 aminalaee.dev/sqladmin -
agronholm/sqlacodegen ⭐ 1,931
Automatic model code generator for SQLAlchemy -
milvus-io/bootcamp ⭐ 1,922
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
🔗 milvus.io -
uber/petastorm ⭐ 1,803
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code. -
aio-libs/aiomysql ⭐ 1,768
aiomysql is a library for accessing a MySQL database from the asyncio
🔗 aiomysql.rtfd.io -
simple-salesforce/simple-salesforce ⭐ 1,706
A very simple Salesforce.com REST API client for Python -
simonw/sqlite-utils ⭐ 1,700
Python CLI utility and library for manipulating SQLite databases
🔗 sqlite-utils.datasette.io -
collerek/ormar ⭐ 1,683
python async orm with fastapi in mind and pydantic validation
🔗 collerek.github.io/ormar -
zarr-developers/zarr-python ⭐ 1,540
An implementation of chunked, compressed, N-dimensional arrays for Python.
🔗 zarr.readthedocs.io -
eleutherai/the-pile ⭐ 1,509
The Pile is a large, diverse, open source language modelling data set that consists of many smaller datasets combined together. -
ydataai/ydata-synthetic ⭐ 1,455
Synthetic data generators for tabular and time-series data
🔗 docs.synthetic.ydata.ai -
scholarly-python-package/scholarly ⭐ 1,435
Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
🔗 scholarly.readthedocs.io -
sdispater/orator ⭐ 1,425
The Orator ORM provides a simple yet beautiful ActiveRecord implementation.
🔗 orator-orm.com -
mchong6/JoJoGAN ⭐ 1,420
Official PyTorch repo for JoJoGAN: One Shot Face Stylization -
aio-libs/aiopg ⭐ 1,401
aiopg is a library for accessing a PostgreSQL database from the asyncio
🔗 aiopg.readthedocs.io -
google/tensorstore ⭐ 1,358
Library for reading and writing large multi-dimensional arrays.
🔗 google.github.io/tensorstore -
quixio/quix-streams ⭐ 1,232
Python stream processing for Kafka
🔗 docs.quix.io -
eliasdabbas/advertools ⭐ 1,160
advertools - online marketing productivity and analysis tools
🔗 advertools.readthedocs.io -
aio-libs/aiocache ⭐ 1,160
Asyncio cache manager for redis, memcached and memory
🔗 aiocache.readthedocs.io -
pytorch/data ⭐ 1,137
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries. -
brettkromkamp/contextualise ⭐ 1,057
Contextualise is an effective tool particularly suited for organising information-heavy projects and activities consisting of unstructured and widely diverse data and information resources
🔗 contextualise.dev -
uber/fiber ⭐ 1,042
Distributed Computing for AI Made Simple
🔗 uber.github.io/fiber -
intake/intake ⭐ 1,014
Intake is a lightweight package for finding, investigating, loading and disseminating data.
🔗 intake.readthedocs.io -
duckdb/dbt-duckdb ⭐ 940
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org) -
scikit-hep/awkward ⭐ 849
Manipulate JSON-like data with NumPy-like idioms.
🔗 awkward-array.org -
goccy/bigquery-emulator ⭐ 848
BigQuery emulator provides a way to launch a BigQuery server on your local machine for testing and development. -
macbre/sql-metadata ⭐ 816
Uses tokenized query returned by python-sqlparse and generates query metadata
🔗 pypi.python.org/pypi/sql-metadata -
koaning/human-learn ⭐ 800
Natural Intelligence is still a pretty good idea.
🔗 koaning.github.io/human-learn -
googleapis/python-bigquery ⭐ 749
Python Client for Google BigQuery -
hyperqueryhq/whale ⭐ 725
🐳 The stupidly simple CLI workspace for your data warehouse.
🔗 rsyi.gitbook.io/whale -
igorbenav/fastcrud ⭐ 698
FastCRUD is a Python package for FastAPI, offering robust async CRUD operations and flexible endpoint creation utilities. -
dgarnitz/vectorflow ⭐ 677
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
🔗 www.getvectorflow.com -
kagisearch/vectordb ⭐ 654
A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.
🔗 vectordb.com -
unstructured-io/unstructured-api ⭐ 581
API for Open-Source Pre-Processing Tools for Unstructured Data -
jina-ai/vectordb ⭐ 568
A Python vector database you just need - no more, no less. -
koaning/bulk ⭐ 554
Bulk is a quick UI developer tool to apply some bulk labels. -
koaning/doubtlab ⭐ 505
Doubt your data, find bad labels.
🔗 koaning.github.io/doubtlab -
apache/iceberg-python ⭐ 501
PyIceberg is a Python library for programmatic access to Iceberg table metadata as well as to table data in Iceberg format.
🔗 py.iceberg.apache.org -
titan-systems/titan ⭐ 437
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. -
stackloklabs/promptwright ⭐ 319
Promptwright is a Python library designed for generating large synthetic datasets using LLMs
Debugging and tracing tools.
-
cool-rr/PySnooper ⭐ 16,411
Never use print for debugging again -
shobrook/rebound ⭐ 4,109
Instantly load StackOverflow results in your terminal when an exception is thrown -
inducer/pudb ⭐ 3,000
Full-screen console debugger for Python
🔗 documen.tician.de/pudb -
alexmojaki/heartrate ⭐ 1,764
Simple real time visualisation of the execution of a Python program. -
alexmojaki/birdseye ⭐ 1,658
Graphical Python debugger which lets you easily view the values of all evaluated expressions
🔗 birdseye.readthedocs.io -
pdbpp/pdbpp ⭐ 1,313
pdb++, a drop-in replacement for pdb (the Python debugger) -
alexmojaki/snoop ⭐ 1,284
A powerful set of Python debugging tools, based on PySnooper -
samuelcolvin/python-devtools ⭐ 990
Dev tools for python
🔗 python-devtools.helpmanual.io
Text-to-image diffusion model libraries, tools and apps for generating images from natural language.
-
automatic1111/stable-diffusion-webui ⭐ 144,029
Stable Diffusion web UI -
compvis/stable-diffusion ⭐ 68,649
A latent text-to-image diffusion model
🔗 ommer-lab.com/research/latent-diffusion-models -
comfyanonymous/ComfyUI ⭐ 59,256
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
🔗 www.comfy.org -
stability-ai/stablediffusion ⭐ 39,395
High-Resolution Image Synthesis with Latent Diffusion Models -
lllyasviel/ControlNet ⭐ 30,808
Let us control diffusion models! -
huggingface/diffusers ⭐ 26,505
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
🔗 huggingface.co/docs/diffusers -
invoke-ai/InvokeAI ⭐ 23,853
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
🔗 invoke-ai.github.io/invokeai -
apple/ml-stable-diffusion ⭐ 16,973
Stable Diffusion with Core ML on Apple Silicon -
borisdayma/dalle-mini ⭐ 14,756
DALL·E Mini - Generate images from a text prompt
🔗 www.craiyon.com -
openbmb/MiniCPM-V ⭐ 12,796
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone -
divamgupta/diffusionbee-stable-diffusion-ui ⭐ 12,714
Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.
🔗 diffusionbee.com -
compvis/latent-diffusion ⭐ 12,020
High-Resolution Image Synthesis with Latent Diffusion Models -
instantid/InstantID ⭐ 11,181
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
🔗 instantid.github.io -
lucidrains/DALLE2-pytorch ⭐ 11,176
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch -
facebookresearch/dinov2 ⭐ 9,373
PyTorch code and models for the DINOv2 self-supervised learning method. -
ashawkey/stable-dreamfusion ⭐ 8,338
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion. -
carson-katri/dream-textures ⭐ 7,864
Stable Diffusion built-in to Blender -
xavierxiao/Dreambooth-Stable-Diffusion ⭐ 7,620
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion -
idea-research/GroundingDINO ⭐ 6,949
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
🔗 arxiv.org/abs/2303.05499 -
timothybrooks/instruct-pix2pix ⭐ 6,430
PyTorch implementation of InstructPix2Pix, an instruction-based image editing model, based on the original CompVis/stable_diffusion repo. -
opengvlab/InternVL ⭐ 6,207
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
🔗 internvl.readthedocs.io/en/latest -
openai/consistency_models ⭐ 6,194
Official repo for consistency models. -
salesforce/BLIP ⭐ 4,871
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation -
nateraw/stable-diffusion-videos ⭐ 4,468
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts -
lkwq007/stablediffusion-infinity ⭐ 3,857
Outpainting with Stable Diffusion on an infinite canvas -
jina-ai/discoart ⭐ 3,842
🪩 Create Disco Diffusion artworks in one line -
mlc-ai/web-stable-diffusion ⭐ 3,609
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
🔗 mlc.ai/web-stable-diffusion -
openai/glide-text2im ⭐ 3,558
GLIDE: a diffusion-based text-conditional image synthesis model -
openai/improved-diffusion ⭐ 3,320
Release for Improved Denoising Diffusion Probabilistic Models -
saharmor/dalle-playground ⭐ 2,769
A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini) -
stability-ai/stability-sdk ⭐ 2,427
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
🔗 platform.stability.ai -
google-research/big_vision ⭐ 2,420
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more. -
thudm/CogVLM2 ⭐ 2,156
GPT4V-level open-source multi-modal model based on Llama3-8B -
coyote-a/ultimate-upscale-for-automatic1111 ⭐ 1,667
Ultimate SD Upscale extension for AUTOMATIC1111 Stable Diffusion web UI -
divamgupta/stable-diffusion-tensorflow ⭐ 1,590
Stable Diffusion in TensorFlow / Keras -
open-compass/VLMEvalKit ⭐ 1,464
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
🔗 huggingface.co/spaces/opencompass/open_vlm_leaderboard -
nvlabs/prismer ⭐ 1,299
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
🔗 shikun.io/projects/prismer -
chenyangqiqi/FateZero ⭐ 1,119
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
🔗 fate-zero-edit.github.io -
thereforegames/unprompted ⭐ 782
Templating language written for Stable Diffusion workflows. Available as an extension for the Automatic1111 WebUI. -
sharonzhou/long_stable_diffusion ⭐ 682
Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion) -
tanelp/tiny-diffusion ⭐ 680
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets. -
laion-ai/dalle2-laion ⭐ 500
Pretrained Dalle2 from laion
Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives.
-
openbb-finance/OpenBB ⭐ 34,283
Investment Research for Everyone, Everywhere.
🔗 openbb.co -
quantopian/zipline ⭐ 17,778
Zipline, a Pythonic Algorithmic Trading Library
🔗 www.zipline.io -
microsoft/qlib ⭐ 15,687
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, ...
🔗 qlib.readthedocs.io/en/latest -
mementum/backtrader ⭐ 15,037
Python Backtesting library for trading strategies
🔗 www.backtrader.com -
ranaroussi/yfinance ⭐ 15,028
Download market data from Yahoo! Finance's API
🔗 aroussi.com/post/python-yahoo-finance -
ai4finance-foundation/FinGPT ⭐ 14,154
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
🔗 ai4finance.org -
ai4finance-foundation/FinRL ⭐ 10,215
FinRL: Financial Reinforcement Learning. 🔥
🔗 ai4finance.org -
quantconnect/Lean ⭐ 10,023
Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
🔗 lean.io -
ta-lib/ta-lib-python ⭐ 9,825
Python wrapper for TA-Lib (http://ta-lib.org/).
🔗 ta-lib.github.io/ta-lib-python -
goldmansachs/gs-quant ⭐ 7,894
Python toolkit for quantitative finance
🔗 developer.gs.com/discover/products/gs-quant -
quantopian/pyfolio ⭐ 5,736
Portfolio and risk analytics in Python
🔗 quantopian.github.io/pyfolio -
kernc/backtesting.py ⭐ 5,605
🔎 📈 🐍 💰 Backtest trading strategies in Python.
🔗 kernc.github.io/backtesting.py -
twopirllc/pandas-ta ⭐ 5,510
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators
🔗 twopirllc.github.io/pandas-ta -
ranaroussi/quantstats ⭐ 5,047
Portfolio analytics for quants, written in Python -
google/tf-quant-finance ⭐ 4,584
High-performance TensorFlow library for quantitative finance. -
polakowo/vectorbt ⭐ 4,481
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
🔗 vectorbt.dev -
gbeced/pyalgotrade ⭐ 4,446
Python Algorithmic Trading Library
🔗 gbeced.github.io/pyalgotrade -
borisbanushev/stockpredictionai ⭐ 4,344
In this noteboook I will create a complete process for predicting stock price movements. Follow along and we will achieve some pretty good results. For that purpose we will use a Generative Adversarial Network (GAN) with LSTM, a type of Recurrent Neural Network, as generator, and a Convolutional Neural Networ... -
matplotlib/mplfinance ⭐ 3,743
Financial Markets Data Visualization using Matplotlib
🔗 pypi.org/project/mplfinance -
cuemacro/finmarketpy ⭐ 3,478
Python library for backtesting trading strategies & analyzing financial markets (formerly pythalesians)
🔗 www.cuemacro.com -
quantopian/alphalens ⭐ 3,415
Performance analysis of predictive (alpha) stock factors
🔗 quantopian.github.io/alphalens -
zvtvz/zvt ⭐ 3,286
modular quant framework.
🔗 zvt.readthedocs.io/en/latest -
robcarver17/pysystemtrade ⭐ 2,684
Systematic Trading in python -
quantopian/research_public ⭐ 2,447
Quantitative research and educational materials
🔗 www.quantopian.com/lectures -
pmorissette/bt ⭐ 2,308
bt - flexible backtesting for Python
🔗 pmorissette.github.io/bt -
domokane/FinancePy ⭐ 2,168
A Python Finance Library that focuses on the pricing and risk-management of Financial Derivatives, including fixed-income, equity, FX and credit derivatives. -
blankly-finance/blankly ⭐ 2,163
🚀 💸 Easily build, backtest and deploy your algo in just a few lines of code. Trade stocks, cryptos, and forex across exchanges w/ one package.
🔗 package.blankly.finance -
pmorissette/ffn ⭐ 2,051
ffn - a financial function library for Python
🔗 pmorissette.github.io/ffn -
cuemacro/findatapy ⭐ 1,710
Python library to download market data via Bloomberg, Eikon, Quandl, Yahoo etc. -
quantopian/empyrical ⭐ 1,319
Common financial risk and performance metrics. Used by zipline and pyfolio.
🔗 quantopian.github.io/empyrical -
idanya/algo-trader ⭐ 797
Trading bot with support for realtime trading, backtesting, custom strategies and much more. -
gbeced/basana ⭐ 601
A Python async and event driven framework for algorithmic trading, with a focus on crypto currencies. -
chancefocus/PIXIU ⭐ 563
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
Game development tools, engines and libraries.
-
pygame/pygame ⭐ 7,562
🐍🎮 pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, OpenGL.
🔗 www.pygame.org -
panda3d/panda3d ⭐ 4,562
Powerful, mature open-source cross-platform game engine for Python and C++, developed by Disney and CMU
🔗 www.panda3d.org -
microsoft/TRELLIS ⭐ 4,521
A large 3D asset generation model. It takes in text or image prompts and generates high-quality 3D assets in various formats, such as Radiance Fields, 3D Gaussians, and meshes.
🔗 trellis3d.github.io -
pokepetter/ursina ⭐ 2,230
A game engine powered by python and panda3d.
🔗 pokepetter.github.io/ursina -
pyglet/pyglet ⭐ 1,905
pyglet is a cross-platform windowing and multimedia library for Python, for developing games and other visually rich applications.
🔗 pyglet.org -
pythonarcade/arcade ⭐ 1,716
Easy to use Python library for creating 2D arcade games.
🔗 arcade.academy
Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections.
-
domlysz/BlenderGIS ⭐ 7,844
Blender addons to make the bridge between Blender and geographic data -
python-visualization/folium ⭐ 6,951
Python Data. Leaflet.js Maps.
🔗 python-visualization.github.io/folium -
osgeo/gdal ⭐ 4,956
GDAL is an open source MIT licensed translator library for raster and vector geospatial data formats.
🔗 gdal.org -
gboeing/osmnx ⭐ 4,924
OSMnx is a Python package to easily download, model, analyze, and visualize street networks and other geospatial features from OpenStreetMap.
🔗 osmnx.readthedocs.io -
geopandas/geopandas ⭐ 4,550
Python tools for geographic data
🔗 geopandas.org -
shapely/shapely ⭐ 3,935
Manipulation and analysis of geometric objects
🔗 shapely.readthedocs.io/en/stable -
giswqs/geemap ⭐ 3,504
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
🔗 geemap.org -
holoviz/datashader ⭐ 3,330
Quickly and accurately render even the largest data.
🔗 datashader.org -
opengeos/leafmap ⭐ 3,233
A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment
🔗 leafmap.org -
opengeos/segment-geospatial ⭐ 3,089
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
🔗 samgeo.gishub.org -
microsoft/torchgeo ⭐ 2,802
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
🔗 www.osgeo.org/projects/torchgeo -
google/earthengine-api ⭐ 2,709
Python and JavaScript bindings for calling the Earth Engine API. -
rasterio/rasterio ⭐ 2,285
Rasterio reads and writes geospatial raster datasets
🔗 rasterio.readthedocs.io -
mcordts/cityscapesScripts ⭐ 2,190
README and scripts for the Cityscapes Dataset -
azavea/raster-vision ⭐ 2,092
An open source library and framework for deep learning on satellite and aerial imagery.
🔗 docs.rastervision.io -
apache/sedona ⭐ 1,968
A cluster computing framework for processing large-scale geospatial data
🔗 sedona.apache.org -
gboeing/osmnx-examples ⭐ 1,581
Gallery of OSMnx tutorials, usage examples, and feature demonstations.
🔗 osmnx.readthedocs.io -
jupyter-widgets/ipyleaflet ⭐ 1,494
A Jupyter - Leaflet.js bridge
🔗 ipyleaflet.readthedocs.io -
microsoft/GlobalMLBuildingFootprints ⭐ 1,443
Worldwide building footprints derived from satellite imagery -
pysal/pysal ⭐ 1,342
PySAL: Python Spatial Analysis Library Meta-Package
🔗 pysal.org/pysal -
anitagraser/movingpandas ⭐ 1,247
Movement trajectory classes and functions built on top of GeoPandas
🔗 movingpandas.org -
residentmario/geoplot ⭐ 1,160
High-level geospatial data visualization library for Python.
🔗 residentmario.github.io/geoplot/index.html -
sentinel-hub/eo-learn ⭐ 1,133
Earth observation processing framework for machine learning in Python
🔗 eo-learn.readthedocs.io/en/latest -
opengeos/streamlit-geospatial ⭐ 891
A multi-page streamlit app for geospatial
🔗 huggingface.co/spaces/giswqs/streamlit -
osgeo/grass ⭐ 857
GRASS GIS - free and open-source geospatial processing engine
🔗 grass.osgeo.org -
makepath/xarray-spatial ⭐ 847
Raster-based Spatial Analytics for Python
🔗 xarray-spatial.readthedocs.io -
developmentseed/titiler ⭐ 799
Build your own Raster dynamic map tile services
🔗 developmentseed.org/titiler -
scikit-mobility/scikit-mobility ⭐ 736
scikit-mobility: mobility analysis in Python
🔗 scikit-mobility.github.io/scikit-mobility
Graphs and network libraries: network analysis, graph machine learning, visualisation.
-
networkx/networkx ⭐ 15,035
Network Analysis in Python
🔗 networkx.org -
stellargraph/stellargraph ⭐ 2,956
StellarGraph - Machine Learning on Graphs
🔗 stellargraph.readthedocs.io -
westhealth/pyvis ⭐ 1,016
Python package for creating and visualizing interactive network graphs.
🔗 pyvis.readthedocs.io/en/latest -
microsoft/graspologic ⭐ 830
graspologic is a package for graph statistical algorithms
🔗 graspologic-org.github.io/graspologic -
rampasek/GraphGPS ⭐ 678
Recipe for a General, Powerful, Scalable Graph Transformer -
dylanhogg/llmgraph ⭐ 340
Create knowledge graphs with LLMs
Graphical user interface libraries and toolkits.
-
pysimplegui/PySimpleGUI ⭐ 13,467
Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.
🔗 www.pysimplegui.com -
hoffstadt/DearPyGui ⭐ 13,406
Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies
🔗 dearpygui.readthedocs.io/en/latest -
parthjadhav/Tkinter-Designer ⭐ 9,316
An easy and fast way to create a Python GUI 🐍 -
samuelcolvin/FastUI ⭐ 8,330
FastUI is a new way to build web application user interfaces defined by declarative Python code.
🔗 fastui-demo.onrender.com -
r0x0r/pywebview ⭐ 4,816
Build GUI for your Python program with JavaScript, HTML, and CSS
🔗 pywebview.flowrl.com -
beeware/toga ⭐ 4,400
A Python native, OS native GUI toolkit.
🔗 toga.readthedocs.io/en/latest -
dddomodossola/remi ⭐ 3,535
Python REMote Interface library. Platform independent. In about 100 Kbytes, perfect for your diet. -
wxwidgets/Phoenix ⭐ 2,342
wxPython's Project Phoenix. A new implementation of wxPython, better, stronger, faster than he was before.
🔗 wxpython.org
Jupyter and JupyterLab and Notebook tools, libraries and plugins.
-
jupyterlab/jupyterlab ⭐ 14,250
JupyterLab computational environment.
🔗 jupyterlab.readthedocs.io -
jupyter/notebook ⭐ 11,808
Jupyter Interactive Notebook
🔗 jupyter-notebook.readthedocs.io -
marimo-team/marimo ⭐ 8,209
A reactive Python notebook: run a cell or interact with a UI element, and marimo automatically runs dependent cells, keeping code and outputs consistent. marimo notebooks are stored as pure Python, executable as scripts, and deployable as apps.
🔗 marimo.io -
mwouts/jupytext ⭐ 6,664
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
🔗 jupytext.readthedocs.io -
nteract/papermill ⭐ 6,015
📚 Parameterize, execute, and analyze notebooks
🔗 papermill.readthedocs.io/en/latest -
connorferster/handcalcs ⭐ 5,691
Python library for converting Python calculations into rendered latex. -
voila-dashboards/voila ⭐ 5,489
Voilà turns Jupyter notebooks into standalone web applications
🔗 voila.readthedocs.io -
jupyterlite/jupyterlite ⭐ 3,930
Wasm powered Jupyter running in the browser 💡
🔗 jupyterlite.rtfd.io/en/stable/try/lab -
executablebooks/jupyter-book ⭐ 3,884
Create beautiful, publication-quality books and documents from computational content.
🔗 jupyterbook.org -
jupyterlab/jupyterlab-desktop ⭐ 3,764
JupyterLab desktop application, based on Electron. -
jupyterlab/jupyter-ai ⭐ 3,268
A generative AI extension for JupyterLab
🔗 jupyter-ai.readthedocs.io -
jupyter-widgets/ipywidgets ⭐ 3,171
Interactive Widgets for the Jupyter Notebook
🔗 ipywidgets.readthedocs.io -
quantopian/qgrid ⭐ 3,050
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks -
jupyter/nbdime ⭐ 2,682
Tools for diffing and merging of Jupyter notebooks.
🔗 nbdime.readthedocs.io -
mito-ds/mito ⭐ 2,310
The mitosheet package, trymito.io, and other public Mito code.
🔗 trymito.io -
jupyter/nbviewer ⭐ 2,218
nbconvert as a web service: Render Jupyter Notebooks as static web pages
🔗 nbviewer.jupyter.org -
maartenbreddels/ipyvolume ⭐ 1,948
3d plotting for Python in the Jupyter notebook based on IPython widgets using WebGL -
jupyter-lsp/jupyterlab-lsp ⭐ 1,824
Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol
🔗 jupyterlab-lsp.readthedocs.io -
jupyter/nbconvert ⭐ 1,758
Jupyter Notebook Conversion
🔗 nbconvert.readthedocs.io -
8080labs/pyforest ⭐ 1,107
With pyforest you can use all your favorite Python libraries without importing them before. If you use a package that is not imported yet, pyforest imports the package for you and adds the code to the first Jupyter cell.
🔗 8080labs.com -
nbqa-dev/nbQA ⭐ 1,054
Run ruff, isort, pyupgrade, mypy, pylint, flake8, and more on Jupyter Notebooks
🔗 nbqa.readthedocs.io/en/latest/index.html -
vizzuhq/ipyvizzu ⭐ 955
Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.
🔗 ipyvizzu.vizzuhq.com -
koaning/drawdata ⭐ 802
Draw datasets from within Jupyter.
🔗 calmcode.io/labs/drawdata.html -
aws/graph-notebook ⭐ 744
Library extending Jupyter notebooks to integrate with Apache TinkerPop, openCypher, and RDF SPARQL.
🔗 github.com/aws/graph-notebook -
linealabs/lineapy ⭐ 662
Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
🔗 lineapy.org -
xiaohk/stickyland ⭐ 535
Break the linear presentation of Jupyter Notebooks with sticky cells!
🔗 xiaohk.github.io/stickyland -
infuseai/colab-xterm ⭐ 396
Open a terminal in colab, including the free tier.
Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover.
-
significant-gravitas/AutoGPT ⭐ 168,948
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🔗 agpt.co -
hwchase17/langchain ⭐ 95,847
🦜🔗 Build context-aware reasoning applications
🔗 python.langchain.com -
nomic-ai/gpt4all ⭐ 71,034
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
🔗 nomic.ai/gpt4all -
ggerganov/llama.cpp ⭐ 68,885
LLM inference in C/C++ -
xtekky/gpt4free ⭐ 62,588
The official gpt4free repository | various collection of powerful language models
🔗 g4f.ai -
killianlucas/open-interpreter ⭐ 57,296
A natural language interface for computers
🔗 openinterpreter.com -
facebookresearch/llama ⭐ 56,688
Inference code for Llama models -
imartinez/private-gpt ⭐ 54,369
Interact with your documents using the power of GPT, 100% privately, no data leaks
🔗 privategpt.dev -
gpt-engineer-org/gpt-engineer ⭐ 52,583
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app -
xai-org/grok-1 ⭐ 49,668
This repository contains JAX example code for loading and running the Grok-1 open-weights model. -
geekan/MetaGPT ⭐ 45,759
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
🔗 deepwisdom.ai -
oobabooga/text-generation-webui ⭐ 40,982
A Gradio web UI for Large Language Models. -
thudm/ChatGLM-6B ⭐ 40,791
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型 -
hpcaitech/ColossalAI ⭐ 38,871
Making large AI models cheaper, faster and more accessible
🔗 www.colossalai.org -
karpathy/nanoGPT ⭐ 37,729
The simplest, fastest repository for training/finetuning medium-sized GPTs. -
logspace-ai/langflow ⭐ 37,253
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
🔗 www.langflow.org -
jerryjliu/llama_index ⭐ 37,179
LlamaIndex is a data framework for your LLM applications
🔗 docs.llamaindex.ai -
lm-sys/FastChat ⭐ 37,171
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. -
laion-ai/Open-Assistant ⭐ 37,108
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
🔗 open-assistant.io -
quivrhq/quivr ⭐ 36,833
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
🔗 core.quivr.com -
hiyouga/LLaMA-Factory ⭐ 35,671
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🔗 arxiv.org/abs/2403.13372 -
microsoft/autogen ⭐ 35,507
A programming framework for agentic AI 🤖 (PyPi: autogen-agentchat)
🔗 microsoft.github.io/autogen -
moymix/TaskMatrix ⭐ 34,560
Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting. -
pythagora-io/gpt-pilot ⭐ 32,002
The first real AI developer -
vllm-project/vllm ⭐ 31,438
A high-throughput and memory-efficient inference and serving engine for LLMs
🔗 docs.vllm.ai -
tatsu-lab/stanford_alpaca ⭐ 29,623
Code and documentation to train Stanford's Alpaca models, and generate the data.
🔗 crfm.stanford.edu/2023/03/13/alpaca.html -
meta-llama/llama3 ⭐ 27,403
The official Meta Llama 3 GitHub site -
danielmiessler/fabric ⭐ 25,807
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
🔗 danielmiessler.com/p/fabric-origin-story -
vision-cair/MiniGPT-4 ⭐ 25,472
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
🔗 minigpt-4.github.io -
infiniflow/ragflow ⭐ 24,795
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🔗 ragflow.io -
karpathy/llm.c ⭐ 24,646
LLM training in simple, pure C/CUDA. There is no need for 245MB of PyTorch or 107MB of cPython -
microsoft/JARVIS ⭐ 23,766
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf -
embedchain/mem0 ⭐ 23,182
The Memory layer for your AI apps
🔗 mem0.ai -
openai/gpt-2 ⭐ 22,620
Code for the paper "Language Models are Unsupervised Multitask Learners"
🔗 openai.com/blog/better-language-models -
microsoft/semantic-kernel ⭐ 22,204
Integrate cutting-edge LLM technology quickly and easily into your apps
🔗 aka.ms/semantic-kernel -
joaomdmoura/crewAI ⭐ 22,146
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
🔗 crewai.com -
openai/chatgpt-retrieval-plugin ⭐ 21,087
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language. -
haotian-liu/LLaVA ⭐ 20,594
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
🔗 llava.hliu.cc -
yoheinakajima/babyagi ⭐ 20,542
GPT-4 powered task-driven autonomous agent
🔗 babyagi.org -
karpathy/minGPT ⭐ 20,340
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training -
microsoft/graphrag ⭐ 20,262
A modular graph-based Retrieval-Augmented Generation (RAG) system
🔗 microsoft.github.io/graphrag -
stanfordnlp/dspy ⭐ 19,718
DSPy: The framework for programming—not prompting—language models
🔗 dspy.ai -
mlc-ai/mlc-llm ⭐ 19,333
Universal LLM Deployment Engine with ML Compilation
🔗 llm.mlc.ai -
guidance-ai/guidance ⭐ 19,214
A guidance language for controlling large language models. -
rasahq/rasa ⭐ 19,034
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
🔗 rasa.com/docs/rasa -
unslothai/unsloth ⭐ 18,922
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
🔗 unsloth.ai -
tloen/alpaca-lora ⭐ 18,688
Instruct-tune LLaMA on consumer hardware -
stitionai/devika ⭐ 18,561
Devika is an advanced AI software engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. -
deepset-ai/haystack ⭐ 18,017
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversatio...
🔗 haystack.deepset.ai -
cinnamon/kotaemon ⭐ 17,798
An open-source RAG UI for chatting with your documents. Built with both end users and developers in mind
🔗 cinnamon.github.io/kotaemon -
karpathy/llama2.c ⭐ 17,523
Inference Llama 2 in one file of pure C -
openai/swarm ⭐ 16,668
A framework exploring ergonomic, lightweight multi-agent orchestration. -
huggingface/peft ⭐ 16,639
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
🔗 huggingface.co/docs/peft -
facebookresearch/codellama ⭐ 16,074
Inference code for CodeLlama models -
phidatahq/phidata ⭐ 15,801
Phidata is a toolkit for building AI Assistants using function calling.
🔗 docs.phidata.com -
thudm/ChatGLM2-6B ⭐ 15,732
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型 -
transformeroptimus/SuperAGI ⭐ 15,558
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
🔗 superagi.com -
facebookresearch/llama-recipes ⭐ 15,500
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta L... -
idea-research/Grounded-Segment-Anything ⭐ 15,336
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
🔗 arxiv.org/abs/2401.14159 -
assafelovic/gpt-researcher ⭐ 15,171
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
🔗 gptr.dev -
openai/evals ⭐ 15,140
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks. -
mayooear/gpt4-pdf-chatbot-langchain ⭐ 14,955
GPT4 & LangChain Chatbot for large PDF docs
🔗 www.youtube.com/watch?v=ih9pbgvvoo4 -
berriai/litellm ⭐ 14,767
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
🔗 docs.litellm.ai/docs -
fauxpilot/fauxpilot ⭐ 14,621
FauxPilot - an open-source alternative to GitHub Copilot server -
qwenlm/Qwen ⭐ 14,598
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. -
dao-ailab/flash-attention ⭐ 14,524
Fast and memory-efficient exact attention -
mlc-ai/web-llm ⭐ 13,934
High-performance In-browser LLM Inference Engine
🔗 webllm.mlc.ai -
letta-ai/letta ⭐ 13,346
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
🔗 letta.com -
blinkdl/RWKV-LM ⭐ 12,738
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding. -
paddlepaddle/PaddleNLP ⭐ 12,202
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
🔗 paddlenlp.readthedocs.io -
vanna-ai/vanna ⭐ 12,201
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
🔗 vanna.ai/docs -
microsoft/BitNet ⭐ 12,190
Official inference framework for 1-bit LLMs (e.g., BitNet b1.58). It offers a suite of optimized kernels, that support fast and lossless inference of 1.58-bit models -
openlmlab/MOSS ⭐ 11,972
An open-source tool-augmented conversational language model from Fudan University
🔗 txsun1997.github.io/blogs/moss.html -
smol-ai/developer ⭐ 11,836
the first library to let you embed a developer agent in your own app!
🔗 twitter.com/smolmodels -
shishirpatil/gorilla ⭐ 11,544
Enables LLMs to use tools by invoking APIs. Given a query, Gorilla comes up with the semantically and syntactically correct API.
🔗 gorilla.cs.berkeley.edu -
h2oai/h2ogpt ⭐ 11,478
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
🔗 h2o.ai -
skyvern-ai/skyvern ⭐ 10,948
Skyvern automates browser-based workflows using LLMs and computer vision. It provides a simple API endpoint to fully automate manual workflows, replacing brittle or unreliable automation solutions.
🔗 www.skyvern.com -
microsoft/LoRA ⭐ 10,890
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
🔗 arxiv.org/abs/2106.09685 -
lightning-ai/litgpt ⭐ 10,870
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
🔗 lightning.ai -
lightning-ai/litgpt ⭐ 10,870
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
🔗 lightning.ai -
databrickslabs/dolly ⭐ 10,821
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
🔗 www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html -
nvidia/Megatron-LM ⭐ 10,737
Ongoing research training transformer models at scale
🔗 docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start -
google-research/vision_transformer ⭐ 10,593
Vision Transformer and MLP-Mixer Architectures -
lvwerra/trl ⭐ 10,265
Train transformer language models with reinforcement learning.
🔗 hf.co/docs/trl -
artidoro/qlora ⭐ 10,093
QLoRA: Efficient Finetuning of Quantized LLMs
🔗 arxiv.org/abs/2305.14314 -
outlines-dev/outlines ⭐ 9,949
Structured Text Generation from LLMs
🔗 dottxt-ai.github.io/outlines -
mistralai/mistral-inference ⭐ 9,780
Official inference library for Mistral models
🔗 mistral.ai -
microsoft/promptflow ⭐ 9,600
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
🔗 microsoft.github.io/promptflow -
blinkdl/ChatRWKV ⭐ 9,433
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. -
mshumer/gpt-prompt-engineer ⭐ 9,401
Simply input a description of your task and some test cases, and the system will generate, test, and rank a multitude of prompts to find the ones that perform the best. -
karpathy/minbpe ⭐ 9,236
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization. -
apple/ml-ferret ⭐ 8,500
Ferret: Refer and Ground Anything Anywhere at Any Granularity -
jxnl/instructor ⭐ 8,464
Instructor is a Python library that makes it a breeze to work with structured outputs from large language models (LLMs). Built on top of Pydantic, it provides a simple, transparent, and user-friendly API to manage validation, retries, and streaming responses.
🔗 python.useinstructor.com -
optimalscale/LMFlow ⭐ 8,304
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
🔗 optimalscale.github.io/lmflow -
thudm/CodeGeeX ⭐ 8,283
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
🔗 codegeex.cn -
llmware-ai/llmware ⭐ 8,260
Unified framework for building enterprise RAG pipelines with small, specialized models
🔗 llmware-ai.github.io/llmware -
abetlen/llama-cpp-python ⭐ 8,256
Simple Python bindings for @ggerganov's llama.cpp library.
🔗 llama-cpp-python.readthedocs.io -
eleutherai/gpt-neo ⭐ 8,240
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
🔗 www.eleuther.ai -
axolotl-ai-cloud/axolotl ⭐ 8,046
Go ahead and axolotl questions
🔗 axolotl-ai-cloud.github.io/axolotl -
sjtu-ipads/PowerInfer ⭐ 7,997
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs -
jzhang38/TinyLlama ⭐ 7,990
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. -
lianjiatech/BELLE ⭐ 7,963
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型) -
vaibhavs10/insanely-fast-whisper ⭐ 7,820
An opinionated CLI to transcribe Audio files w/ Whisper on-device! Powered by 🤗 Transformers, Optimum & flash-attn -
swivid/F5-TTS ⭐ 7,785
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
🔗 arxiv.org/abs/2410.06885 -
plachtaa/VALL-E-X ⭐ 7,714
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/ -
thudm/GLM-130B ⭐ 7,666
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023) -
explodinggradients/ragas ⭐ 7,490
Supercharge Your LLM Application Evaluations 🚀
🔗 docs.ragas.io -
sweepai/sweep ⭐ 7,470
Sweep: open-source AI-powered Software Developer for small features and bug fixes.
🔗 sweep.dev -
openlm-research/open_llama ⭐ 7,400
OpenLLaMA: An Open Reproduction of LLaMA -
andrewyng/aisuite ⭐ 7,376
Simple, unified interface to multiple Generative AI providers. aisuite makes it easy for developers to use multiple LLM through a standardized interface. -
bigcode-project/starcoder ⭐ 7,336
Home of StarCoder: fine-tuning & inference! -
chainlit/chainlit ⭐ 7,326
Build Conversational AI in minutes ⚡️
🔗 docs.chainlit.io -
zilliztech/GPTCache ⭐ 7,272
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
🔗 gptcache.readthedocs.io -
eleutherai/lm-evaluation-harness ⭐ 7,151
A framework for few-shot evaluation of language models.
🔗 www.eleuther.ai -
langchain-ai/langgraph ⭐ 7,062
LangGraph is a library for building stateful, multi-actor applications with LLMs, built on top of (and intended to be used with) LangChain.
🔗 langchain-ai.github.io/langgraph -
apple/corenet ⭐ 6,996
CoreNet is a deep neural network toolkit that allows researchers and engineers to train standard and novel small and large-scale models for variety of tasks, including foundation models (e.g., CLIP and LLM), object classification, object detection, and semantic segmentation. -
bhaskatripathi/pdfGPT ⭐ 6,986
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
🔗 huggingface.co/spaces/bhaskartripathi/pdfgpt_turbo -
eleutherai/gpt-neox ⭐ 6,981
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
🔗 www.eleuther.ai -
skypilot-org/skypilot ⭐ 6,867
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
🔗 skypilot.readthedocs.io -
mit-han-lab/streaming-llm ⭐ 6,708
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
🔗 arxiv.org/abs/2309.17453 -
internlm/InternLM ⭐ 6,550
Official release of InternLM2.5 base and chat models. 1M context support
🔗 internlm.intern-ai.org.cn -
whitead/paper-qa ⭐ 6,533
High accuracy RAG for answering questions from scientific documents with citations -
future-house/paper-qa ⭐ 6,533
High-accuracy retrieval augmented generation (RAG) on PDFs or text files, with a focus on the scientific literature -
langchain-ai/opengpts ⭐ 6,503
An open source effort to create a similar experience to OpenAI's GPTs and Assistants API. -
weaviate/Verba ⭐ 6,432
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate -
sgl-project/sglang ⭐ 6,394
SGLang is a fast serving framework for large language models and vision language models.
🔗 sgl-project.github.io -
run-llama/rags ⭐ 6,331
Build ChatGPT over your data, all with natural language -
nat/openplayground ⭐ 6,287
An LLM playground you can run on your laptop -
lightning-ai/lit-llama ⭐ 6,011
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed. -
vikhyat/moondream ⭐ 5,989
A tiny open-source computer-vision language model designed to run efficiently on edge devices
🔗 moondream.ai -
pathwaycom/llm-app ⭐ 5,758
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
🔗 pathway.com/developers/templates -
minedojo/Voyager ⭐ 5,713
An Open-Ended Embodied Agent with Large Language Models
🔗 voyager.minedojo.org -
pytorch-labs/gpt-fast ⭐ 5,694
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python. -
langchain-ai/chat-langchain ⭐ 5,512
Locally hosted chatbot specifically focused on question answering over the LangChain documentation
🔗 chat.langchain.com -
microsoft/promptbase ⭐ 5,448
promptbase is an evolving collection of resources, best practices, and example scripts for eliciting the best performance from foundation models. -
lyogavin/airllm ⭐ 5,397
AirLLM optimizes inference memory usage, allowing 70B large language models to run inference on a single 4GB GPU card without quantization, distillation and pruning. And you can run 405B Llama3.1 on 8GB vram now. -
dsdanielpark/Bard-API ⭐ 5,317
The unofficial python package that returns response of Google Bard through cookie value.
🔗 pypi.org/project/bardapi -
mnotgod96/AppAgent ⭐ 5,215
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
🔗 appagent-official.github.io -
qwenlm/Qwen-VL ⭐ 5,150
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud. -
simonw/llm ⭐ 4,990
Access large language models from the command-line
🔗 llm.datasette.io -
promptfoo/promptfoo ⭐ 4,930
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
🔗 promptfoo.dev -
cg123/mergekit ⭐ 4,910
Tools for merging pretrained large language models. -
arcee-ai/mergekit ⭐ 4,910
Tools for merging pretrained large language models. -
openbmb/ToolBench ⭐ 4,868
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
🔗 openbmb.github.io/toolbench -
allenai/OLMo ⭐ 4,865
OLMo is a repository for training and using AI2's state-of-the-art open language models. It is built by scientists, for scientists.
🔗 allenai.org/olmo -
meta-llama/llama-stack ⭐ 4,786
Llama Stack standardizes the building blocks needed to bring genai applications to market. These blocks cover model training and fine-tuning, evaluation, and running AI agents in production -
microsoft/LLMLingua ⭐ 4,702
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
🔗 llmlingua.com -
togethercomputer/RedPajama-Data ⭐ 4,591
The RedPajama-Data repository contains code for preparing large datasets for training large language models. -
modelscope/ms-swift ⭐ 4,529
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
🔗 swift.readthedocs.io/zh-cn/latest -
1rgs/jsonformer ⭐ 4,483
A Bulletproof Way to Generate Structured JSON from Language Models -
kyegomez/tree-of-thoughts ⭐ 4,386
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
🔗 discord.gg/qutxnk2nmf -
microsoft/BioGPT ⭐ 4,322
Implementation of BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining -
nvidia/NeMo-Guardrails ⭐ 4,241
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems. -
open-compass/opencompass ⭐ 4,240
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
🔗 opencompass.org.cn -
instruction-tuning-with-gpt-4/GPT-4-LLM ⭐ 4,230
Instruction Tuning with GPT-4
🔗 instruction-tuning-with-gpt-4.github.io -
guardrails-ai/guardrails ⭐ 4,207
Open-source Python package for specifying structure and type, validating and correcting the outputs of large language models (LLMs)
🔗 www.guardrailsai.com/docs -
yizhongw/self-instruct ⭐ 4,189
Aligning pretrained language models with instruction data generated by themselves. -
h2oai/h2o-llmstudio ⭐ 4,049
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
🔗 h2o.ai -
mshumer/gpt-llm-trainer ⭐ 3,974
Input a description of your task, and the system will generate a dataset, parse it, and fine-tune a LLaMA 2 model for you -
meta-llama/llama-stack-apps ⭐ 3,941
Agentic components of the Llama Stack APIs -
ravenscroftj/turbopilot ⭐ 3,823
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU -
katanaml/sparrow ⭐ 3,761
Sparrow is a solution for efficient data extraction and processing from various documents and images like invoices and receipts
🔗 katanaml.io -
turboderp/exllamav2 ⭐ 3,743
A fast inference library for running LLMs locally on modern consumer-class GPUs -
microsoft/LMOps ⭐ 3,730
General technology for enabling AI capabilities w/ LLMs and MLLMs
🔗 aka.ms/generalai -
eth-sri/lmql ⭐ 3,723
A language for constraint-guided and efficient LLM programming.
🔗 lmql.ai -
linkedin/Liger-Kernel ⭐ 3,694
Efficient Triton Kernels for LLM Training
🔗 arxiv.org/pdf/2410.10989 -
mmabrouk/llm-workflow-engine ⭐ 3,670
Power CLI and Workflow manager for LLMs (core package) -
luodian/Otter ⭐ 3,566
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
🔗 otter-ntu.github.io -
pipecat-ai/pipecat ⭐ 3,512
Open Source framework for voice and multimodal conversational AI -
minimaxir/simpleaichat ⭐ 3,491
Python package for easily interfacing with chat apps, with robust features and minimal code complexity. -
agiresearch/AIOS ⭐ 3,477
AIOS, a Large Language Model (LLM) Agent operating system, embeds large language model into Operating Systems (OS) as the brain of the OS, enabling an operating system "with soul" -- an important step towards AGI.
🔗 aios.foundation -
defog-ai/sqlcoder ⭐ 3,428
SoTA LLM for converting natural language questions to SQL queries -
minimaxir/gpt-2-simple ⭐ 3,398
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts -
iryna-kondr/scikit-llm ⭐ 3,384
Seamlessly integrate LLMs into scikit-learn.
🔗 beastbyte.ai -
truefoundry/cognita ⭐ 3,362
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
🔗 cognita.truefoundry.com -
next-gpt/NExT-GPT ⭐ 3,330
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
🔗 next-gpt.github.io -
lm-sys/RouteLLM ⭐ 3,316
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality! -
deep-diver/LLM-As-Chatbot ⭐ 3,298
LLM as a Chatbot Service -
jaymody/picoGPT ⭐ 3,266
An unnecessarily tiny implementation of GPT-2 in NumPy. -
bclavie/RAGatouille ⭐ 3,105
Bridging the gap between state-of-the-art research and alchemical RAG pipeline practices. -
crewaiinc/crewAI-examples ⭐ 3,070
A collection of examples that show how to use CrewAI framework to automate workflows. -
microsoft/torchscale ⭐ 3,032
Foundation Architecture for (M)LLMs
🔗 aka.ms/generalai -
baichuan-inc/Baichuan-13B ⭐ 2,977
A 13B large language model developed by Baichuan Intelligent Technology
🔗 huggingface.co/baichuan-inc/baichuan-13b-chat -
li-plus/chatglm.cpp ⭐ 2,952
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V) -
freedomintelligence/LLMZoo ⭐ 2,934
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡ -
huggingface/text-embeddings-inference ⭐ 2,903
A blazing fast inference solution for text embeddings models
🔗 huggingface.co/docs/text-embeddings-inference/quick_tour -
pydantic/pydantic-ai ⭐ 2,874
PydanticAI is a Python Agent Framework designed to make it less painful to build production grade applications with Generative AI.
🔗 ai.pydantic.dev -
cohere-ai/cohere-toolkit ⭐ 2,868
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications. -
juncongmoo/pyllama ⭐ 2,808
LLaMA: Open and Efficient Foundation Language Models -
mistralai/mistral-finetune ⭐ 2,779
A light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models. It is based on LoRA. -
verazuo/jailbreak_llms ⭐ 2,766
Official repo for the ACM CCS 2024 paper "Do Anything Now'': Characterizing and Evaluating In-The-Wild Jailbreak Prompts
🔗 jailbreak-llms.xinyueshen.me -
meta-llama/PurpleLlama ⭐ 2,762
Set of tools to assess and improve LLM security. -
langroid/langroid ⭐ 2,739
Harness LLMs with Multi-Agent Programming
🔗 langroid.github.io/langroid -
alpha-vllm/LLaMA2-Accessory ⭐ 2,728
An Open-source Toolkit for LLM Development
🔗 llama2-accessory.readthedocs.io -
hegelai/prompttools ⭐ 2,726
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
🔗 prompttools.readthedocs.io -
paperswithcode/galai ⭐ 2,683
Model API for GALACTICA -
mit-han-lab/llm-awq ⭐ 2,571
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration -
lightning-ai/LitServe ⭐ 2,545
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
🔗 lightning.ai/docs/litserve -
databricks/dbrx ⭐ 2,513
Code examples and resources for DBRX, a large language model developed by Databricks
🔗 www.databricks.com -
noahshinn/reflexion ⭐ 2,466
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning -
ofa-sys/OFA ⭐ 2,430
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework -
young-geng/EasyLM ⭐ 2,421
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax. -
cheshire-cat-ai/core ⭐ 2,414
Production ready AI agent framework
🔗 cheshirecat.ai -
civitai/sd_civitai_extension ⭐ 2,335
All of the Civitai models inside Automatic 1111 Stable Diffusion Web UI -
sylphai-inc/AdalFlow ⭐ 2,283
Unified auto-differentiative framework for both zero-shot prompt optimization and few-shot optimization. It advances existing auto-optimization research, including Text-Grad and DsPy
🔗 adalflow.sylph.ai -
pytorch/executorch ⭐ 2,254
An end-to-end solution for enabling on-device inference capabilities across mobile and edge devices including wearables, embedded devices and microcontrollers. It is part of the PyTorch Edge ecosystem and enables efficient deployment of PyTorch models to edge devices.
🔗 pytorch.org/executorch -
intel/neural-compressor ⭐ 2,243
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
🔗 intel.github.io/neural-compressor -
predibase/lorax ⭐ 2,236
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
🔗 loraexchange.ai -
truera/trulens ⭐ 2,218
Evaluation and Tracking for LLM Experiments
🔗 www.trulens.org -
spcl/graph-of-thoughts ⭐ 2,196
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
🔗 arxiv.org/pdf/2308.09687.pdf -
janhq/cortex.cpp ⭐ 2,172
Cortex is a Local AI API Platform that is used to run and customize LLMs.
🔗 cortex.so -
openai/finetune-transformer-lm ⭐ 2,163
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
🔗 s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf -
evolvinglmms-lab/lmms-eval ⭐ 2,127
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
🔗 lmms-lab.framer.ai -
tairov/llama2.mojo ⭐ 2,103
Inference Llama 2 in one file of pure 🔥
🔗 www.modular.com/blog/community-spotlight-how-i-built-llama2-by-aydyn-tairov -
openai/image-gpt ⭐ 2,053
Archived. Code and models from the paper "Generative Pretraining from Pixels" -
openai/simple-evals ⭐ 2,015
Lightweight library for evaluating language models -
lucidrains/toolformer-pytorch ⭐ 1,977
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI -
neulab/prompt2model ⭐ 1,972
prompt2model - Generate Deployable Models from Natural Language Instructions -
azure-samples/graphrag-accelerator ⭐ 1,962
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
🔗 github.com/microsoft/graphrag -
ist-daslab/gptq ⭐ 1,958
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
🔗 arxiv.org/abs/2210.17323 -
openai/gpt-2-output-dataset ⭐ 1,946
Dataset of GPT-2 outputs for research in detection, biases, and more -
microsoft/Megatron-DeepSpeed ⭐ 1,917
Ongoing research training transformer language models at scale, including: BERT & GPT-2 -
epfllm/meditron ⭐ 1,908
Meditron is a suite of open-source medical Large Language Models (LLMs).
🔗 huggingface.co/epfl-llm -
akariasai/self-rag ⭐ 1,874
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
🔗 selfrag.github.io -
facebookresearch/chameleon ⭐ 1,861
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
🔗 arxiv.org/abs/2405.09818 -
minimaxir/aitextgen ⭐ 1,843
A robust Python tool for text-based AI training and generation using GPT-2.
🔗 docs.aitextgen.io -
casper-hansen/AutoAWQ ⭐ 1,807
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
🔗 casper-hansen.github.io/autoawq -
openai/gpt-discord-bot ⭐ 1,791
Example Discord bot written in Python that uses the completions API to have conversations with thetext-davinci-003
model, and the moderations API to filter the messages. -
ray-project/llm-applications ⭐ 1,727
A comprehensive guide to building RAG-based LLM applications for production. -
argilla-io/distilabel ⭐ 1,713
Distilabel is the framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
🔗 distilabel.argilla.io -
noamgat/lm-format-enforcer ⭐ 1,590
Enforce the output format (JSON Schema, Regex etc) of a language model -
agenta-ai/agenta ⭐ 1,551
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM Observability all in one place.
🔗 www.agenta.ai -
jina-ai/thinkgpt ⭐ 1,551
Agent techniques to augment your LLM and push it beyong its limits -
qwenlm/Qwen-Audio ⭐ 1,505
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud. -
agentops-ai/tokencost ⭐ 1,488
Easy token price estimates for 400+ LLMs. TokenOps.
🔗 agentops.ai -
cstankonrad/long_llama ⭐ 1,452
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method. -
meetkai/functionary ⭐ 1,445
Chat language model that can use tools and interpret the results -
farizrahman4u/loopgpt ⭐ 1,437
Re-implementation of Auto-GPT as a python package, written with modularity and extensibility in mind. -
run-llama/llama-lab ⭐ 1,424
Llama Lab is a repo dedicated to building cutting-edge projects using LlamaIndex -
roboflow/maestro ⭐ 1,409
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
🔗 maestro.roboflow.com -
chatarena/chatarena ⭐ 1,373
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs. -
huggingface/smollm ⭐ 1,354
Everything about the SmolLM & SmolLM2 family of models
🔗 huggingface.co/huggingfacetb -
explosion/spacy-transformers ⭐ 1,353
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
🔗 spacy.io/usage/embeddings-transformers -
bigscience-workshop/Megatron-DeepSpeed ⭐ 1,341
Ongoing research training transformer language models at scale, including: BERT & GPT-2 -
huggingface/nanotron ⭐ 1,321
Minimalistic large language model 3D-parallelism training -
karpathy/nano-llama31 ⭐ 1,263
This repo is to Llama 3.1 what nanoGPT is to GPT-2. i.e. it is a minimal, dependency-free implementation of the Llama 3.1 architecture -
ray-project/ray-llm ⭐ 1,240
RayLLM - LLMs on Ray
🔗 aviary.anyscale.com -
linksoul-ai/AutoAgents ⭐ 1,219
[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.
🔗 huggingface.co/spaces/linksoul/autoagents -
srush/MiniChain ⭐ 1,219
A tiny library for coding with large language models.
🔗 srush-minichain.hf.space -
brainblend-ai/atomic-agents ⭐ 1,216
Atomic Agents provides a set of tools and agents that can be combined to create powerful applications. It is built on top of Instructor and leverages the power of Pydantic for data and schema validation and serialization. -
keirp/automatic_prompt_engineer ⭐ 1,176
Large Language Models Are Human-Level Prompt Engineers -
facebookresearch/MobileLLM ⭐ 1,175
Training code of MobileLLM introduced in our work: "MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases" -
mlfoundations/dclm ⭐ 1,168
DataComp for Language Models -
hao-ai-lab/LookaheadDecoding ⭐ 1,159
Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
🔗 arxiv.org/abs/2402.02057 -
explosion/spacy-llm ⭐ 1,148
🦙 Integrating LLMs into structured NLP pipelines
🔗 spacy.io/usage/large-language-models -
answerdotai/rerankers ⭐ 1,137
Welcome to rerankers! Our goal is to provide users with a simple API to use any reranking models. -
ibm/Dromedary ⭐ 1,128
Dromedary: towards helpful, ethical and reliable LLMs. -
lupantech/chameleon-llm ⭐ 1,091
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
🔗 chameleon-llm.github.io -
rlancemartin/auto-evaluator ⭐ 1,064
Evaluation tool for LLM QA chains
🔗 autoevaluator.langchain.com -
shengranhu/ADAS ⭐ 1,058
Automated Design of Agentic Systems using Meta Agent Search to show agents can invent novel and powerful agent designs -
microsoft/Llama-2-Onnx ⭐ 1,025
A Microsoft optimized version of the Llama 2 model, available from Meta -
nomic-ai/pygpt4all ⭐ 1,023
Official supported Python bindings for llama.cpp + gpt4all
🔗 nomic-ai.github.io/pygpt4all -
ctlllll/LLM-ToolMaker ⭐ 1,022
Large Language Models as Tool Makers -
pinecone-io/canopy ⭐ 980
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
🔗 www.pinecone.io -
ajndkr/lanarky ⭐ 978
The web framework for building LLM microservices
🔗 lanarky.ajndkr.com -
likejazz/llama3.np ⭐ 978
llama3.np is a pure NumPy implementation for Llama 3 model. -
cerebras/modelzoo ⭐ 973
Examples of common deep learning models that can be trained on Cerebras hardware -
topoteretes/cognee ⭐ 959
Reliable LLM Memory for AI Applications and AI Agents
🔗 www.cognee.ai -
huggingface/optimum-nvidia ⭐ 895
Optimum-NVIDIA delivers the best inference performance on the NVIDIA platform through Hugging Face. Run LLaMA 2 at 1,200 tokens/second (up to 28x faster than the framework) -
huggingface/evaluation-guidebook ⭐ 880
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval! -
soulter/hugging-chat-api ⭐ 871
HuggingChat Python API🤗 -
huggingface/lighteval ⭐ 864
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron. -
datadreamer-dev/DataDreamer ⭐ 863
DataDreamer is a powerful open-source Python library for prompting, synthetic data generation, and training workflows. It is designed to be simple, extremely efficient, and research-grade.
🔗 datadreamer.dev -
muennighoff/sgpt ⭐ 854
SGPT: GPT Sentence Embeddings for Semantic Search
🔗 arxiv.org/abs/2202.08904 -
prometheus-eval/prometheus-eval ⭐ 812
Evaluate your LLM's response with Prometheus and GPT4 💯 -
langchain-ai/langsmith-cookbook ⭐ 792
LangSmith is a platform for building production-grade LLM applications.
🔗 langsmith-cookbook.vercel.app -
junruxiong/IncarnaMind ⭐ 784
Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs
🔗 www.incarnamind.com -
oliveirabruno01/babyagi-asi ⭐ 783
BabyAGI: an Autonomous and Self-Improving agent, or BASI -
opengenerativeai/GenossGPT ⭐ 753
One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) 🌈🐂 Replace OpenAI GPT with any LLMs in your app with one line.
🔗 genoss.ai -
wandb/weave ⭐ 740
Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.
🔗 wandb.me/weave -
modelcontextprotocol/python-sdk ⭐ 739
The Model Context Protocol allows applications to provide context for LLMs in a standardized way, separating the concerns of providing context from the actual LLM interaction.
🔗 modelcontextprotocol.io -
opengvlab/OmniQuant ⭐ 734
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs. -
nousresearch/Hermes-Function-Calling ⭐ 733
Code for the Hermes Pro Large Language Model to perform function calling based on the provided schema. It allows users to query the model and retrieve information related to stock prices, company fundamentals, financial statements -
salesforce/xgen ⭐ 718
Salesforce open-source LLMs with 8k sequence length. -
deedy/mac_computer_use ⭐ 662
A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.
🔗 x.com/deedydas/status/1849481225041559910 -
squeezeailab/SqueezeLLM ⭐ 655
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
🔗 arxiv.org/abs/2306.07629 -
tag-research/TAG-Bench ⭐ 611
Table-Augmented Generation (TAG) is a unified and general-purpose paradigm for answering natural language questions over databases
🔗 arxiv.org/pdf/2408.14717 -
lupantech/ScienceQA ⭐ 609
Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering". -
joshuac215/agent-service-toolkit ⭐ 597
A full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit.
🔗 agent-service-toolkit.streamlit.app -
tsinghuadatabasegroup/DB-GPT ⭐ 574
LLM As Database Administrator
🔗 dbgpt.dbmind.cn -
zhudotexe/kani ⭐ 559
kani (カニ) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)
🔗 kani.readthedocs.io -
microsoft/VPTQ ⭐ 541
Extreme Low-bit Vector Post-Training Quantization for Large Language Models -
modal-labs/llm-finetuning ⭐ 540
Guide for fine-tuning Llama/Mistral/CodeLlama models and more -
hazyresearch/ama_prompting ⭐ 540
Ask Me Anything language model prompting -
declare-lab/instruct-eval ⭐ 532
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
🔗 declare-lab.github.io/instruct-eval -
vahe1994/SpQR ⭐ 529
Quantization algorithm and the model evaluation code for SpQR method for LLM compression -
magnivorg/prompt-layer-library ⭐ 525
🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.
🔗 www.promptlayer.com -
continuum-llms/chatgpt-memory ⭐ 519
Allows to scale the ChatGPT API to multiple simultaneous sessions with infinite contextual and adaptive memory powered by GPT and Redis datastore. -
eugeneyan/obsidian-copilot ⭐ 515
🤖 A prototype assistant for writing and thinking
🔗 eugeneyan.com/writing/obsidian-copilot -
hazyresearch/H3 ⭐ 515
Language Modeling with the H3 State Space Model -
minishlab/model2vec ⭐ 506
Model2Vec is a technique to turn any sentence transformer into a really small static model, reducing model size by 15x and making the models up to 500x faster, with a small drop in performance
🔗 minishlab.github.io -
judahpaul16/gpt-home ⭐ 475
ChatGPT at home! Basically a better Google Nest Hub or Amazon Alexa home assistant. Built on the Raspberry Pi using the OpenAI API.
🔗 hub.docker.com/r/judahpaul/gpt-home -
huggingface/text-clustering ⭐ 467
Easily embed, cluster and semantically label text datasets -
mlc-ai/xgrammar ⭐ 442
XGrammar is an open-source library for efficient, flexible, and portable structured generation. It supports general context-free grammar to enable a broad range of structures while bringing careful system optimizations to enable fast executions.
🔗 xgrammar.mlc.ai -
stanford-oval/suql ⭐ 218
SUQL: Conversational Search over Structured and Unstructured Data with LLMs
🔗 arxiv.org/abs/2311.09818 -
dottxt-ai/outlines-core ⭐ 142
Core functionality for structured generation, formerly implemented in Outlines, with a focus on performance and portability.
Mathematical, numerical and scientific libraries.
-
numpy/numpy ⭐ 28,243
The fundamental package for scientific computing with Python.
🔗 numpy.org -
taichi-dev/taichi ⭐ 25,597
Productive, portable, and performant GPU programming in Python: Taichi Lang is an open-source, imperative, parallel programming language for high-performance numerical computation.
🔗 taichi-lang.org -
scipy/scipy ⭐ 13,172
SciPy library main repository
🔗 scipy.org -
sympy/sympy ⭐ 13,072
A computer algebra system written in pure Python
🔗 sympy.org -
google/or-tools ⭐ 11,337
Google Optimization Tools (a.k.a., OR-Tools) is an open-source, fast and portable software suite for solving combinatorial optimization problems.
🔗 developers.google.com/optimization -
z3prover/z3 ⭐ 10,435
Z3 is a theorem prover from Microsoft Research with a Python language binding. -
google-deepmind/alphageometry ⭐ 4,198
Solving Olympiad Geometry without Human Demonstrations -
mikedh/trimesh ⭐ 3,032
Python library for loading and using triangular meshes.
🔗 trimesh.org -
pyro-ppl/numpyro ⭐ 2,332
Probabilistic programming with NumPy powered by JAX for autograd and JIT compilation to GPU/TPU/CPU.
🔗 num.pyro.ai -
mckinsey/causalnex ⭐ 2,254
A Python library that helps data scientists to infer causation rather than observing correlation.
🔗 causalnex.readthedocs.io -
pyomo/pyomo ⭐ 2,058
An object-oriented algebraic modeling language in Python for structured optimization problems.
🔗 www.pyomo.org -
facebookresearch/theseus ⭐ 1,787
A library for differentiable nonlinear optimization -
arviz-devs/arviz ⭐ 1,619
Exploratory analysis of Bayesian models with Python
🔗 python.arviz.org -
google-research/torchsde ⭐ 1,589
Differentiable SDE solvers with GPU support and efficient sensitivity analysis. -
dynamicslab/pysindy ⭐ 1,467
A package for the sparse identification of nonlinear dynamical systems from data
🔗 pysindy.readthedocs.io/en/latest -
geomstats/geomstats ⭐ 1,269
Computations and statistics on manifolds with geometric structures.
🔗 geomstats.ai -
cma-es/pycma ⭐ 1,119
pycma is a Python implementation of CMA-ES and a few related numerical optimization tools. -
pymc-labs/CausalPy ⭐ 922
A Python package for causal inference in quasi-experimental settings
🔗 causalpy.readthedocs.io -
sj001/AI-Feynman ⭐ 646
Implementation of AI Feynman: a Physics-Inspired Method for Symbolic Regression -
willianfuks/tfcausalimpact ⭐ 621
Python Causal Impact Implementation Based on Google's R Package. Built using TensorFlow Probability. -
lean-dojo/LeanDojo ⭐ 583
Tool for data extraction and interacting with Lean programmatically.
🔗 leandojo.org -
brandondube/prysm ⭐ 271
Prysm is an open-source library for physical and first-order modeling of optical systems and analysis of related data: numerical and physical optics, integrated modeling, phase retrieval, segmented systems, polynomials and fitting, sequential raytracing.
🔗 prysm.readthedocs.io/en/stable -
lean-dojo/ReProver ⭐ 233
Retrieval-Augmented Theorem Provers for Lean
🔗 leandojo.org -
albahnsen/pycircular ⭐ 98
pycircular is a Python module for circular data analysis -
gbillotey/Fractalshades ⭐ 28
Arbitrary-precision fractal explorer - Python package
General and classical machine learning libraries. See below for other sections covering specialised ML areas.
-
openai/openai-cookbook ⭐ 60,505
Examples and guides for using the OpenAI API
🔗 cookbook.openai.com -
scikit-learn/scikit-learn ⭐ 60,371
scikit-learn: machine learning in Python
🔗 scikit-learn.org -
suno-ai/bark ⭐ 36,362
🔊 Text-Prompted Generative Audio Model -
tencentarc/GFPGAN ⭐ 35,976
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration. -
google-research/google-research ⭐ 34,443
This repository contains code released by Google Research
🔗 research.google -
facebookresearch/faiss ⭐ 31,817
A library for efficient similarity search and clustering of dense vectors.
🔗 faiss.ai -
google/jax ⭐ 30,691
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
🔗 jax.readthedocs.io -
open-mmlab/mmdetection ⭐ 29,761
OpenMMLab Detection Toolbox and Benchmark
🔗 mmdetection.readthedocs.io -
lutzroeder/netron ⭐ 28,575
Visualizer for neural network, deep learning and machine learning models
🔗 netron.app -
ageron/handson-ml2 ⭐ 28,079
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2. -
google/mediapipe ⭐ 27,856
Cross-platform, customizable ML solutions for live and streaming media.
🔗 ai.google.dev/edge/mediapipe -
dmlc/xgboost ⭐ 26,383
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
🔗 xgboost.readthedocs.io/en/stable -
roboflow/supervision ⭐ 24,413
We write your reusable computer vision tools. 💜
🔗 supervision.roboflow.com -
harisiqbal88/PlotNeuralNet ⭐ 22,307
Latex code for making neural networks diagrams -
jina-ai/serve ⭐ 21,166
☁️ Build multimodal AI applications with cloud-native stack
🔗 jina.ai/serve -
onnx/onnx ⭐ 18,047
Open standard for machine learning interoperability
🔗 onnx.ai -
ml-explore/mlx ⭐ 17,721
MLX is an array framework for machine learning on Apple silicon, brought to you by Apple machine learning research.
🔗 ml-explore.github.io/mlx -
microsoft/LightGBM ⭐ 16,757
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
🔗 lightgbm.readthedocs.io/en/latest -
ddbourgin/numpy-ml ⭐ 15,701
Machine learning, in numpy
🔗 numpy-ml.readthedocs.io -
tensorflow/tensor2tensor ⭐ 15,629
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research. -
microsoft/onnxruntime ⭐ 14,913
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
🔗 onnxruntime.ai -
aleju/imgaug ⭐ 14,448
Image augmentation for machine learning experiments.
🔗 imgaug.readthedocs.io -
microsoft/nni ⭐ 14,071
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
🔗 nni.readthedocs.io -
jindongwang/transferlearning ⭐ 13,521
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
🔗 transferlearning.xyz -
neonbjb/tortoise-tts ⭐ 13,331
A multi-voice TTS system trained with an emphasis on quality -
deepmind/deepmind-research ⭐ 13,313
This repository contains implementations and illustrative code to accompany DeepMind publications -
spotify/annoy ⭐ 13,305
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk -
deepmind/alphafold ⭐ 12,953
Implementation of the inference pipeline of AlphaFold v2 -
ggerganov/ggml ⭐ 11,326
Tensor library for machine learning -
optuna/optuna ⭐ 11,035
A hyperparameter optimization framework
🔗 optuna.org -
facebookresearch/AnimatedDrawings ⭐ 10,658
Code to accompany "A Method for Animating Children's Drawings of the Human Figure" -
statsmodels/statsmodels ⭐ 10,219
Statsmodels: statistical modeling and econometrics in Python
🔗 www.statsmodels.org/devel -
twitter/the-algorithm-ml ⭐ 10,140
Source code for Twitter's Recommendation Algorithm
🔗 blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm -
cleanlab/cleanlab ⭐ 9,811
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
🔗 cleanlab.ai -
epistasislab/tpot ⭐ 9,762
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
🔗 epistasislab.github.io/tpot -
thudm/CogVideo ⭐ 9,596
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023) -
megvii-basedetection/YOLOX ⭐ 9,493
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/ -
wandb/wandb ⭐ 9,232
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
🔗 wandb.ai -
pycaret/pycaret ⭐ 9,002
An open-source, low-code machine learning library in Python
🔗 www.pycaret.org -
pymc-devs/pymc ⭐ 8,755
Bayesian Modeling and Probabilistic Programming in Python
🔗 docs.pymc.io -
facebookresearch/xformers ⭐ 8,737
Hackable and optimized Transformers building blocks, supporting a composable construction.
🔗 facebookresearch.github.io/xformers -
uberi/speech_recognition ⭐ 8,465
Speech recognition module for Python, supporting several engines and APIs, online and offline.
🔗 pypi.python.org/pypi/speechrecognition -
open-mmlab/mmsegmentation ⭐ 8,379
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
🔗 mmsegmentation.readthedocs.io/en/main -
awslabs/autogluon ⭐ 8,137
Fast and Accurate ML in 3 Lines of Code
🔗 auto.gluon.ai -
catboost/catboost ⭐ 8,128
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
🔗 catboost.ai -
huggingface/accelerate ⭐ 8,034
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
🔗 huggingface.co/docs/accelerate -
automl/auto-sklearn ⭐ 7,663
Automated Machine Learning with scikit-learn
🔗 automl.github.io/auto-sklearn -
lmcinnes/umap ⭐ 7,502
Uniform Manifold Approximation and Projection -
featurelabs/featuretools ⭐ 7,291
An open source python library for automated feature engineering
🔗 www.featuretools.com -
hyperopt/hyperopt ⭐ 7,282
Distributed Asynchronous Hyperparameter Optimization in Python
🔗 hyperopt.github.io/hyperopt -
py-why/dowhy ⭐ 7,167
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
🔗 www.pywhy.org/dowhy -
hips/autograd ⭐ 7,043
Efficiently computes derivatives of NumPy code. -
open-mmlab/mmagic ⭐ 6,971
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
🔗 mmagic.readthedocs.io/en/latest -
scikit-learn-contrib/imbalanced-learn ⭐ 6,862
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
🔗 imbalanced-learn.org -
probml/pyprobml ⭐ 6,570
Python code for "Probabilistic Machine learning" book by Kevin Murphy -
nicolashug/Surprise ⭐ 6,427
A Python scikit for building and analyzing recommender systems
🔗 surpriselib.com -
ml-explore/mlx-examples ⭐ 6,321
Examples in the MLX framework -
cleverhans-lab/cleverhans ⭐ 6,216
An adversarial example library for constructing attacks, building defenses, and benchmarking both -
kevinmusgrave/pytorch-metric-learning ⭐ 6,034
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
🔗 kevinmusgrave.github.io/pytorch-metric-learning -
project-monai/MONAI ⭐ 5,940
AI Toolkit for Healthcare Imaging
🔗 monai.io -
open-mmlab/mmcv ⭐ 5,938
OpenMMLab Computer Vision Foundation
🔗 mmcv.readthedocs.io/en/latest -
yangchris11/samurai ⭐ 5,721
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
🔗 yangchris11.github.io/samurai -
google-gemini/cookbook ⭐ 5,519
A collection of guides and examples for the Gemini API, including quickstart tutorials for writing prompts.
🔗 ai.google.dev/gemini-api/docs -
uber/causalml ⭐ 5,123
Uplift modeling and causal inference with machine learning algorithms -
online-ml/river ⭐ 5,113
🌊 Online machine learning in Python
🔗 riverml.xyz -
mdbloice/Augmentor ⭐ 5,081
Image augmentation library in Python for machine learning.
🔗 augmentor.readthedocs.io/en/stable -
google-deepmind/graphcast ⭐ 5,058
GraphCast: Learning skillful medium-range global weather forecasting -
rasbt/mlxtend ⭐ 4,921
A library of extension and helper modules for Python's data analysis and machine learning libraries.
🔗 rasbt.github.io/mlxtend -
marqo-ai/marqo ⭐ 4,671
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
🔗 www.marqo.ai -
skvark/opencv-python ⭐ 4,580
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
🔗 pypi.org/project/opencv-python -
apple/coremltools ⭐ 4,463
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
🔗 coremltools.readme.io -
sanchit-gandhi/whisper-jax ⭐ 4,458
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU. -
nmslib/hnswlib ⭐ 4,422
Header-only C++/python library for fast approximate nearest neighbors
🔗 github.com/nmslib/hnswlib -
lucidrains/deep-daze ⭐ 4,375
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun -
districtdatalabs/yellowbrick ⭐ 4,300
Visual analysis and diagnostic tools to facilitate machine learning model selection.
🔗 www.scikit-yb.org -
nv-tlabs/GET3D ⭐ 4,256
Generative Model of High Quality 3D Textured Shapes Learned from Images -
huggingface/autotrain-advanced ⭐ 4,132
AutoTrain Advanced: faster and easier training and deployments of state-of-the-art machine learning models
🔗 huggingface.co/autotrain -
cmusphinx/pocketsphinx ⭐ 3,970
A small speech recognizer -
microsoft/FLAML ⭐ 3,955
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
🔗 microsoft.github.io/flaml -
ourownstory/neural_prophet ⭐ 3,903
NeuralProphet: A simple forecasting package
🔗 neuralprophet.com -
py-why/EconML ⭐ 3,884
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to brin...
🔗 www.microsoft.com/en-us/research/project/alice -
huggingface/notebooks ⭐ 3,712
Notebooks using the Hugging Face libraries 🤗 -
zjunlp/DeepKE ⭐ 3,610
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
🔗 deepke.zjukg.cn -
huggingface/speech-to-speech ⭐ 3,588
Speech To Speech: an effort for an open-sourced and modular GPT4-o -
rucaibox/RecBole ⭐ 3,481
A unified, comprehensive and efficient recommendation library
🔗 recbole.io -
yoheinakajima/instagraph ⭐ 3,478
Converts text input or URL into knowledge graph and displays -
facebookresearch/vissl ⭐ 3,261
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
🔗 vissl.ai -
pytorch/glow ⭐ 3,243
Compiler for Neural Network hardware accelerators -
lightly-ai/lightly ⭐ 3,197
A python library for self-supervised learning on images.
🔗 docs.lightly.ai/self-supervised-learning -
lucidrains/musiclm-pytorch ⭐ 3,181
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch -
hrnet/HRNet-Semantic-Segmentation ⭐ 3,172
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919 -
mljar/mljar-supervised ⭐ 3,071
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
🔗 mljar.com -
shankarpandala/lazypredict ⭐ 3,037
Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning -
huggingface/safetensors ⭐ 2,935
Implements a new simple format for storing tensors safely (as opposed to pickle) and that is still fast (zero-copy).
🔗 huggingface.co/docs/safetensors -
scikit-learn-contrib/hdbscan ⭐ 2,820
A high performance implementation of HDBSCAN clustering.
🔗 hdbscan.readthedocs.io/en/latest -
scikit-optimize/scikit-optimize ⭐ 2,743
Sequential model-based optimization with ascipy.optimize
interface
🔗 scikit-optimize.github.io -
google-research/t5x ⭐ 2,706
T5X is a modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models (starting with language) at many scales. -
huggingface/optimum ⭐ 2,604
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
🔗 huggingface.co/docs/optimum/main -
apple/ml-ane-transformers ⭐ 2,568
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE) -
freedmand/semantra ⭐ 2,519
Semantra is a multipurpose tool for semantically searching documents. Query by meaning rather than just by matching text. -
rom1504/clip-retrieval ⭐ 2,432
Easily compute clip embeddings and build a clip retrieval system with them
🔗 rom1504.github.io/clip-retrieval -
scikit-learn-contrib/category_encoders ⭐ 2,414
A library of sklearn compatible categorical variable encoders
🔗 contrib.scikit-learn.org/category_encoders -
eric-mitchell/direct-preference-optimization ⭐ 2,225
Reference implementation for DPO (Direct Preference Optimization) -
huggingface/huggingface_hub ⭐ 2,152
The official Python client for the Huggingface Hub.
🔗 huggingface.co/docs/huggingface_hub -
aws/sagemaker-python-sdk ⭐ 2,107
A library for training and deploying machine learning models on Amazon SageMaker
🔗 sagemaker.readthedocs.io -
huggingface/evaluate ⭐ 2,049
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
🔗 huggingface.co/docs/evaluate -
contextlab/hypertools ⭐ 1,829
A Python toolbox for gaining geometric insights into high-dimensional data
🔗 hypertools.readthedocs.io/en/latest -
rentruewang/koila ⭐ 1,823
Prevent PyTorch'sCUDA error: out of memory
in just 1 line of code.
🔗 koila.rentruewang.com -
linkedin/greykite ⭐ 1,815
A flexible, intuitive and fast forecasting library -
bmabey/pyLDAvis ⭐ 1,807
Python library for interactive topic model visualization. Port of the R LDAvis package. -
scikit-learn-contrib/lightning ⭐ 1,729
Large-scale linear classification, regression and ranking in Python
🔗 contrib.scikit-learn.org/lightning -
tensorflow/addons ⭐ 1,694
Useful extra functionality for TensorFlow 2.x maintained by SIG-addons -
castorini/pyserini ⭐ 1,693
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
🔗 pyserini.io -
microsoft/i-Code ⭐ 1,679
The ambition of the i-Code project is to build integrative and composable multimodal AI. The "i" stands for integrative multimodal learning. -
microsoft/Olive ⭐ 1,637
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
🔗 microsoft.github.io/olive -
visual-layer/fastdup ⭐ 1,628
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability. -
laekov/fastmoe ⭐ 1,571
A fast MoE impl for PyTorch
🔗 fastmoe.ai -
qdrant/fastembed ⭐ 1,570
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
🔗 qdrant.github.io/fastembed -
kubeflow/katib ⭐ 1,514
Automated Machine Learning on Kubernetes
🔗 www.kubeflow.org/docs/components/katib -
google/vizier ⭐ 1,485
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
🔗 oss-vizier.readthedocs.io -
jina-ai/finetuner ⭐ 1,481
🎯 Task-oriented embedding tuning for BERT, CLIP, etc.
🔗 finetuner.jina.ai -
csinva/imodels ⭐ 1,405
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
🔗 csinva.io/imodels -
patchy631/machine-learning ⭐ 1,386
Machine Learning Tutorials Repository -
microsoft/Semi-supervised-learning ⭐ 1,375
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
🔗 usb.readthedocs.io -
spotify/voyager ⭐ 1,333
🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
🔗 spotify.github.io/voyager -
borealisai/advertorch ⭐ 1,312
A Toolbox for Adversarial Robustness Research -
koaning/scikit-lego ⭐ 1,287
Extra blocks for scikit-learn pipelines.
🔗 koaning.github.io/scikit-lego -
awslabs/dgl-ke ⭐ 1,277
High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
🔗 dglke.dgl.ai/doc -
automl/TabPFN ⭐ 1,234
Official implementation of the TabPFN paper (https://arxiv.org/abs/2207.01848) and the tabpfn package.
🔗 priorlabs.ai -
lightning-ai/lightning-thunder ⭐ 1,221
Thunder is a source-to-source compiler for PyTorch. It makes PyTorch programs faster by combining and using different hardware executors at once -
pytorch/FBGEMM ⭐ 1,214
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/ -
google-research/deeplab2 ⭐ 1,007
DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks. -
davidmrau/mixture-of-experts ⭐ 1,002
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538 -
nvidia/cuda-python ⭐ 992
CUDA Python Low-level Bindings
🔗 nvidia.github.io/cuda-python -
opentensor/bittensor ⭐ 934
Internet-scale Neural Networks
🔗 www.bittensor.com -
lmcinnes/pynndescent ⭐ 903
A Python nearest neighbor descent for approximate nearest neighbors -
oml-team/open-metric-learning ⭐ 897
OML is a PyTorch-based framework to train and validate the models producing high-quality embeddings.
🔗 open-metric-learning.readthedocs.io/en/latest/index.html -
hazyresearch/safari ⭐ 869
Convolutions for Sequence Modeling -
huggingface/optimum-quanto ⭐ 842
A pytorch quantization backend for optimum -
criteo/autofaiss ⭐ 821
Automatically create Faiss knn indices with the most optimal similarity search parameters.
🔗 criteo.github.io/autofaiss -
replicate/replicate-python ⭐ 773
Python client for Replicate
🔗 replicate.com -
awslabs/python-deequ ⭐ 732
Python API for Deequ, a library built on Spark for defining "unit tests for data", which measure data quality in large datasets -
pymc-labs/pymc-marketing ⭐ 719
Bayesian marketing toolbox in PyMC. Media Mix (MMM), customer lifetime value (CLV), buy-till-you-die (BTYD) models and more.
🔗 www.pymc-marketing.io -
facebookresearch/balance ⭐ 688
The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to some target population of interest.
🔗 import-balance.org -
nicolas-hbt/pygraft ⭐ 675
Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
🔗 pygraft.readthedocs.io/en/latest -
googleapis/python-aiplatform ⭐ 656
A Python SDK for Vertex AI, a fully managed, end-to-end platform for data science and machine learning. -
qdrant/quaterion ⭐ 644
Blazing fast framework for fine-tuning similarity learning models
🔗 quaterion.qdrant.tech -
hpcaitech/EnergonAI ⭐ 629
Large-scale model inference. -
huggingface/exporters ⭐ 623
Export Hugging Face models to Core ML and TensorFlow Lite -
intellabs/bayesian-torch ⭐ 547
A library for Bayesian neural network layers and uncertainty estimation in Deep Learning extending the core of PyTorch -
microsoft/Focal-Transformer ⭐ 546
[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers" -
cvxgrp/pymde ⭐ 539
Minimum-distortion embedding with PyTorch
🔗 pymde.org -
intel/intel-npu-acceleration-library ⭐ 531
The Intel NPU Acceleration Library is a Python library designed to boost the efficiency of your applications by leveraging the power of the Intel Neural Processing Unit (NPU) to perform high-speed computations on compatible hardware. -
nevronai/MetisFL ⭐ 529
The first open Federated Learning framework implemented in C++ and Python.
🔗 metisfl.org -
mrdbourke/m1-machine-learning-test ⭐ 529
Code for testing various M1 Chip benchmarks with TensorFlow. -
linkedin/FastTreeSHAP ⭐ 524
Fast SHAP value computation for interpreting tree-based models -
lightning-ai/litdata ⭐ 378
Transform datasets at scale. Optimize datasets for fast AI model training. -
dylanhogg/gptauthor ⭐ 62
GPTAuthor is an AI tool for writing long form, multi-chapter stories given a story prompt.
Machine learning libraries that cross over with deep learning in some way.
-
tensorflow/tensorflow ⭐ 186,728
An Open Source Machine Learning Framework for Everyone
🔗 tensorflow.org -
pytorch/pytorch ⭐ 84,795
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🔗 pytorch.org -
openai/whisper ⭐ 72,381
Robust Speech Recognition via Large-Scale Weak Supervision -
keras-team/keras ⭐ 62,173
Deep Learning for humans
🔗 keras.io -
deepfakes/faceswap ⭐ 52,643
Deepfakes Software For All
🔗 www.faceswap.dev -
facebookresearch/segment-anything ⭐ 48,000
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model. -
microsoft/DeepSpeed ⭐ 35,782
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
🔗 www.deepspeed.ai -
rwightman/pytorch-image-models ⭐ 32,514
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
🔗 huggingface.co/docs/timm -
facebookresearch/detectron2 ⭐ 30,722
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
🔗 detectron2.readthedocs.io/en/latest -
xinntao/Real-ESRGAN ⭐ 28,701
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration. -
lightning-ai/pytorch-lightning ⭐ 28,577
The deep learning framework to pretrain, finetune and deploy AI models. PyTorch Lightning is just organized PyTorch - Lightning disentangles PyTorch code to decouple the science from the engineering.
🔗 lightning.ai -
google-research/tuning_playbook ⭐ 27,482
A playbook for systematically maximizing the performance of deep learning models. -
openai/CLIP ⭐ 26,356
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image -
facebookresearch/Detectron ⭐ 26,288
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet. -
matterport/Mask_RCNN ⭐ 24,756
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow -
paddlepaddle/Paddle ⭐ 22,322
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
🔗 www.paddlepaddle.org -
pyg-team/pytorch_geometric ⭐ 21,531
Graph Neural Network Library for PyTorch
🔗 pyg.org -
lucidrains/vit-pytorch ⭐ 20,961
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch -
apache/mxnet ⭐ 20,790
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
🔗 mxnet.apache.org -
sanster/IOPaint ⭐ 19,763
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
🔗 www.iopaint.com -
danielgatis/rembg ⭐ 17,236
Rembg is a tool to remove images background -
rasbt/deeplearning-models ⭐ 16,780
A collection of various deep learning architectures, models, and tips -
albumentations-team/albumentations ⭐ 14,361
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
🔗 albumentations.ai -
microsoft/Swin-Transformer ⭐ 14,028
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
🔗 arxiv.org/abs/2103.14030 -
facebookresearch/detr ⭐ 13,717
End-to-End Object Detection with Transformers -
nvidia/DeepLearningExamples ⭐ 13,644
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure. -
dmlc/dgl ⭐ 13,580
Python package built to ease deep learning on graph, on top of existing DL frameworks.
🔗 dgl.ai -
mlfoundations/open_clip ⭐ 10,508
Open source implementation of OpenAI's CLIP (Contrastive Language-Image Pre-training). -
kornia/kornia ⭐ 10,040
Geometric Computer Vision Library for Spatial AI
🔗 kornia.readthedocs.io -
keras-team/autokeras ⭐ 9,167
AutoML library for deep learning
🔗 autokeras.com -
modelscope/facechain ⭐ 9,163
FaceChain is a deep-learning toolchain for generating your Digital-Twin. -
facebookresearch/pytorch3d ⭐ 8,869
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
🔗 pytorch3d.org -
pyro-ppl/pyro ⭐ 8,593
Deep universal probabilistic programming with Python and PyTorch
🔗 pyro.ai -
arogozhnikov/einops ⭐ 8,566
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
🔗 einops.rocks -
nvidia/apex ⭐ 8,443
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch -
facebookresearch/ImageBind ⭐ 8,407
ImageBind One Embedding Space to Bind Them All -
lucidrains/imagen-pytorch ⭐ 8,115
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch -
google/trax ⭐ 8,113
Trax — Deep Learning with Clear Code and Speed -
xpixelgroup/BasicSR ⭐ 6,933
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
🔗 basicsr.readthedocs.io/en/latest -
google/flax ⭐ 6,179
Flax is a neural network library for JAX that is designed for flexibility.
🔗 flax.readthedocs.io -
tencent/HunyuanVideo ⭐ 6,092
HunyuanVideo: A Systematic Framework For Large Video Generation Model
🔗 aivideo.hunyuan.tencent.com -
skorch-dev/skorch ⭐ 5,900
A scikit-learn compatible neural network library that wraps PyTorch -
facebookresearch/mmf ⭐ 5,509
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
🔗 mmf.sh -
mosaicml/composer ⭐ 5,183
Supercharge Your Model Training
🔗 docs.mosaicml.com -
deci-ai/super-gradients ⭐ 4,618
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
🔗 www.supergradients.com -
facebookincubator/AITemplate ⭐ 4,570
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference. -
pytorch/ignite ⭐ 4,550
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
🔗 pytorch-ignite.ai -
nvidiagameworks/kaolin ⭐ 4,541
A PyTorch Library for Accelerating 3D Deep Learning Research -
williamyang1991/VToonify ⭐ 3,551
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer -
cvg/LightGlue ⭐ 3,464
LightGlue: Local Feature Matching at Light Speed (ICCV 2023) -
facebookresearch/PyTorch-BigGraph ⭐ 3,391
Generate embeddings from large-scale graph-structured data.
🔗 torchbiggraph.readthedocs.io -
google-research/scenic ⭐ 3,349
Scenic: A Jax Library for Computer Vision Research and Beyond -
pytorch/botorch ⭐ 3,118
Bayesian optimization in PyTorch
🔗 botorch.org -
alpa-projects/alpa ⭐ 3,083
Training and serving large-scale neural networks with auto parallelization.
🔗 alpa.ai -
deepmind/dm-haiku ⭐ 2,914
JAX-based neural network library
🔗 dm-haiku.readthedocs.io -
explosion/thinc ⭐ 2,824
🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
🔗 thinc.ai -
nerdyrodent/VQGAN-CLIP ⭐ 2,619
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab. -
danielegrattarola/spektral ⭐ 2,372
Graph Neural Networks with Keras and Tensorflow 2.
🔗 graphneural.network -
google-research/electra ⭐ 2,340
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators -
fepegar/torchio ⭐ 2,081
Medical imaging toolkit for deep learning
🔗 torchio.org -
neuralmagic/sparseml ⭐ 2,079
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models -
pytorch/torchrec ⭐ 1,963
Pytorch domain library for recommendation systems
🔗 pytorch.org/torchrec -
modelscope/ClearerVoice-Studio ⭐ 1,799
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc. -
tensorflow/mesh ⭐ 1,596
Mesh TensorFlow: Model Parallelism Made Easier -
tensorly/tensorly ⭐ 1,575
TensorLy: Tensor Learning in Python.
🔗 tensorly.org -
vt-vl-lab/FGVC ⭐ 1,553
[ECCV 2020] Flow-edge Guided Video Completion -
calculatedcontent/WeightWatcher ⭐ 1,481
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks -
jeshraghian/snntorch ⭐ 1,374
Deep and online learning with spiking neural networks in Python
🔗 snntorch.readthedocs.io/en/latest -
hysts/pytorch_image_classification ⭐ 1,369
PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet -
xl0/lovely-tensors ⭐ 1,119
Tensors, for human consumption
🔗 xl0.github.io/lovely-tensors -
deepmind/android_env ⭐ 1,024
RL research on Android devices. -
tensorflow/similarity ⭐ 1,013
TensorFlow Similarity is a python package focused on making similarity learning quick and easy. -
keras-team/keras-cv ⭐ 1,010
Industry-strength Computer Vision workflows with Keras -
kakaobrain/rq-vae-transformer ⭐ 800
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22) -
deepmind/chex ⭐ 795
Chex is a library of utilities for helping to write reliable JAX code
🔗 chex.readthedocs.io -
mlfoundations/datacomp ⭐ 663
DataComp: In search of the next generation of multimodal datasets
🔗 datacomp.ai -
whitead/dmol-book ⭐ 622
Deep learning for molecules and materials book
🔗 dmol.pub -
allenai/reward-bench ⭐ 454
RewardBench is a benchmark designed to evaluate the capabilities and safety of reward models (including those trained with Direct Preference Optimization, DPO)
🔗 huggingface.co/spaces/allenai/reward-bench
Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training.
-
slundberg/shap ⭐ 23,026
A game theoretic approach to explain the output of any machine learning model.
🔗 shap.readthedocs.io -
marcotcr/lime ⭐ 11,650
Lime: Explaining the predictions of any machine learning classifier -
interpretml/interpret ⭐ 6,314
Fit interpretable models. Explain blackbox machine learning.
🔗 interpret.ml/docs -
pytorch/captum ⭐ 4,964
Model interpretability and understanding for PyTorch
🔗 captum.ai -
tensorflow/lucid ⭐ 4,678
A collection of infrastructure and tools for research in neural network interpretability. -
arize-ai/phoenix ⭐ 4,160
AI Observability & Evaluation
🔗 docs.arize.com/phoenix -
pair-code/lit ⭐ 3,499
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
🔗 pair-code.github.io/lit -
teamhg-memex/eli5 ⭐ 2,761
A library for debugging/inspecting machine learning classifiers and explaining their predictions
🔗 eli5.readthedocs.io -
maif/shapash ⭐ 2,747
🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
🔗 maif.github.io/shapash -
seldonio/alibi ⭐ 2,419
Algorithms for explaining machine learning models
🔗 docs.seldon.io/projects/alibi/en/stable -
oegedijk/explainerdashboard ⭐ 2,319
Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
🔗 explainerdashboard.readthedocs.io -
eleutherai/pythia ⭐ 2,304
Interpretability analysis and scaling laws to understand how knowledge develops and evolves during training in autoregressive transformers -
jalammar/ecco ⭐ 1,986
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
🔗 ecco.readthedocs.io -
google-deepmind/penzai ⭐ 1,691
A JAX library for writing models as legible, functional pytree data structures, along with tools for visualizing, modifying, and analyzing them. Penzai focuses on making it easy to do stuff with models after they have been trained
🔗 penzai.readthedocs.io -
trusted-ai/AIX360 ⭐ 1,639
Interpretability and explainability of data and machine learning models
🔗 aix360.res.ibm.com -
transformerlensorg/TransformerLens ⭐ 1,634
A library for mechanistic interpretability of GPT-style language models
🔗 transformerlensorg.github.io/transformerlens -
cdpierse/transformers-interpret ⭐ 1,301
Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code. -
selfexplainml/PiML-Toolbox ⭐ 1,217
PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics
🔗 selfexplainml.github.io/piml-toolbox -
ethicalml/xai ⭐ 1,131
XAI is a Machine Learning library that is designed with AI explainability in its core. XAI contains various tools that enable for analysis and evaluation of data and models
🔗 ethical.institute/principles.html#commitment-3 -
salesforce/OmniXAI ⭐ 882
OmniXAI: A Library for eXplainable AI -
andyzoujm/representation-engineering ⭐ 736
Representation Engineering: A Top-Down Approach to AI Transparency
🔗 www.ai-transparency.org -
jbloomaus/SAELens ⭐ 511
Training Sparse Autoencoders on LLms. Analyse sparse autoencoders and neural network internals.
🔗 jbloomaus.github.io/saelens
MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models.
-
apache/airflow ⭐ 37,505
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
🔗 airflow.apache.org -
ray-project/ray ⭐ 34,323
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
🔗 ray.io -
mlflow/mlflow ⭐ 18,962
Open source platform for the machine learning lifecycle
🔗 mlflow.org -
spotify/luigi ⭐ 17,935
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in. -
prefecthq/prefect ⭐ 17,704
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
🔗 prefect.io -
kestra-io/kestra ⭐ 14,493
⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
🔗 kestra.io -
horovod/horovod ⭐ 14,287
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
🔗 horovod.ai -
iterative/dvc ⭐ 13,993
🦉 Data Versioning and ML Experiments
🔗 dvc.org -
dagster-io/dagster ⭐ 12,021
An orchestration platform for the development, production, and observation of data assets.
🔗 dagster.io -
ludwig-ai/ludwig ⭐ 11,224
Low-code framework for building custom LLMs, neural networks, and other AI models
🔗 ludwig.ai -
bentoml/OpenLLM ⭐ 10,175
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
🔗 bentoml.com -
great-expectations/great_expectations ⭐ 10,042
Always know what to expect from your data.
🔗 docs.greatexpectations.io -
dbt-labs/dbt-core ⭐ 10,042
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
🔗 getdbt.com -
kedro-org/kedro ⭐ 10,040
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
🔗 kedro.org -
huggingface/text-generation-inference ⭐ 9,198
A Rust, Python and gRPC server for text generation inference. Used in production at HuggingFace to power Hugging Chat, the Inference API and Inference Endpoint.
🔗 hf.co/docs/text-generation-inference -
netflix/metaflow ⭐ 8,321
Open Source Platform for developing, scaling and deploying serious ML, AI, and data science systems
🔗 metaflow.org -
activeloopai/deeplake ⭐ 8,220
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
🔗 activeloop.ai -
mage-ai/mage-ai ⭐ 8,011
🧙 Build, run, and manage data pipelines for integrating and transforming data.
🔗 www.mage.ai -
bentoml/BentoML ⭐ 7,202
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
🔗 bentoml.com -
langfuse/langfuse ⭐ 6,972
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
🔗 langfuse.com/docs -
flyteorg/flyte ⭐ 5,833
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
🔗 flyte.org -
allegroai/clearml ⭐ 5,727
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
🔗 clear.ml/docs -
feast-dev/feast ⭐ 5,638
The Open Source Feature Store for Machine Learning
🔗 feast.dev -
evidentlyai/evidently ⭐ 5,489
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
🔗 discord.gg/xzjkranp8b -
aimhubio/aim ⭐ 5,250
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
🔗 aimstack.io -
adap/flower ⭐ 5,200
Flower: A Friendly Federated AI Framework
🔗 flower.ai -
internlm/lmdeploy ⭐ 4,778
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
🔗 lmdeploy.readthedocs.io/en/latest -
zenml-io/zenml ⭐ 4,198
ZenML 🙏: The bridge between ML and Ops. https://zenml.io.
🔗 zenml.io -
orchest/orchest ⭐ 4,088
Build data pipelines, the easy way 🛠️
🔗 orchest.readthedocs.io/en/stable -
internlm/xtuner ⭐ 4,030
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
🔗 xtuner.readthedocs.io/zh-cn/latest -
kubeflow/pipelines ⭐ 3,634
Machine Learning Pipelines for Kubeflow
🔗 www.kubeflow.org/docs/components/pipelines -
polyaxon/polyaxon ⭐ 3,577
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
🔗 polyaxon.com -
ploomber/ploomber ⭐ 3,518
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
🔗 docs.ploomber.io -
towhee-io/towhee ⭐ 3,249
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
🔗 towhee.io -
determined-ai/determined ⭐ 3,050
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
🔗 determined.ai -
leptonai/leptonai ⭐ 2,664
A Pythonic framework to simplify AI service building
🔗 lepton.ai -
azure/PyRIT ⭐ 1,960
The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and ML engineers to red team foundation models and their applications.
🔗 azure.github.io/pyrit -
dagworks-inc/hamilton ⭐ 1,894
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
🔗 hamilton.dagworks.io/en/latest -
meltano/meltano ⭐ 1,862
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
🔗 meltano.com -
dstackai/dstack ⭐ 1,623
dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, & TPU.
🔗 dstack.ai/docs -
hi-primus/optimus ⭐ 1,486
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
🔗 hi-optimus.com -
kubeflow/examples ⭐ 1,415
A repository to host extended examples and tutorials -
dagworks-inc/burr ⭐ 1,348
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
🔗 burr.dagworks.io -
nccr-itmo/FEDOT ⭐ 649
Automated modeling and machine learning framework FEDOT
🔗 fedot.readthedocs.io
Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF
-
openai/gym ⭐ 34,933
A toolkit for developing and comparing reinforcement learning algorithms.
🔗 www.gymlibrary.dev -
unity-technologies/ml-agents ⭐ 17,305
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
🔗 unity.com/products/machine-learning-agents -
openai/baselines ⭐ 15,863
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms -
google/dopamine ⭐ 10,583
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
🔗 github.com/google/dopamine -
thu-ml/tianshou ⭐ 8,039
An elegant PyTorch deep reinforcement learning library.
🔗 tianshou.org -
lucidrains/PaLM-rlhf-pytorch ⭐ 7,722
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM -
farama-foundation/Gymnasium ⭐ 7,543
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
🔗 gymnasium.farama.org -
tensorlayer/TensorLayer ⭐ 7,336
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
🔗 tensorlayerx.com -
keras-rl/keras-rl ⭐ 5,528
Deep Reinforcement Learning for Keras.
🔗 keras-rl.readthedocs.io -
deepmind/dm_control ⭐ 3,841
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo. -
ai4finance-foundation/ElegantRL ⭐ 3,770
Massively Parallel Deep Reinforcement Learning. 🔥
🔗 ai4finance.org -
facebookresearch/ReAgent ⭐ 3,574
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
🔗 reagent.ai -
deepmind/acme ⭐ 3,531
A library of reinforcement learning components and agents -
opendilab/DI-engine ⭐ 3,127
DI-engine is a generalized decision intelligence engine for PyTorch and JAX. It provides python-first and asynchronous-native task and middleware abstractions
🔗 di-engine-docs.readthedocs.io -
eureka-research/Eureka ⭐ 2,848
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
🔗 eureka-research.github.io -
pettingzoo-team/PettingZoo ⭐ 2,669
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
🔗 pettingzoo.farama.org -
kzl/decision-transformer ⭐ 2,419
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling. -
pytorch/rl ⭐ 2,395
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
🔗 pytorch.org/rl -
anthropics/hh-rlhf ⭐ 1,638
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
🔗 arxiv.org/abs/2204.05862 -
arise-initiative/robosuite ⭐ 1,380
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
🔗 robosuite.ai -
humancompatibleai/imitation ⭐ 1,341
Clean PyTorch implementations of imitation and reward learning algorithms
🔗 imitation.readthedocs.io -
denys88/rl_games ⭐ 950
RL Games: High performance RL library -
google-deepmind/meltingpot ⭐ 631
A suite of test scenarios for multi-agent reinforcement learning.
Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover.
-
huggingface/transformers ⭐ 135,973
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🔗 huggingface.co/transformers -
pytorch/fairseq ⭐ 30,643
Facebook AI Research Sequence-to-Sequence Toolkit written in Python. -
explosion/spaCy ⭐ 30,407
💫 Industrial-strength Natural Language Processing (NLP) in Python
🔗 spacy.io -
myshell-ai/OpenVoice ⭐ 29,996
Instant voice cloning by MIT and MyShell.
🔗 research.myshell.ai/open-voice -
microsoft/unilm ⭐ 20,325
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
🔗 aka.ms/generalai -
huggingface/datasets ⭐ 19,334
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
🔗 huggingface.co/docs/datasets -
vikparuchuri/marker ⭐ 18,342
Marker converts PDF, EPUB, and MOBI to markdown. It's 10x faster than nougat, more accurate on most documents, and has low hallucination risk.
🔗 www.datalab.to -
rare-technologies/gensim ⭐ 15,716
Topic Modelling for Humans
🔗 radimrehurek.com/gensim -
ukplab/sentence-transformers ⭐ 15,507
State-of-the-Art Text Embeddings
🔗 www.sbert.net -
vikparuchuri/surya ⭐ 14,475
OCR, layout analysis, reading order, table recognition in 90+ languages
🔗 www.datalab.to -
gunthercox/ChatterBot ⭐ 14,119
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
🔗 chatterbot.readthedocs.io -
flairnlp/flair ⭐ 13,981
A very simple framework for state-of-the-art Natural Language Processing (NLP)
🔗 flairnlp.github.io/flair -
nltk/nltk ⭐ 13,679
NLTK Source
🔗 www.nltk.org -
m-bain/whisperX ⭐ 12,765
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) -
openai/tiktoken ⭐ 12,600
tiktoken is a fast BPE tokeniser for use with OpenAI's models. -
jina-ai/clip-as-service ⭐ 12,485
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
🔗 clip-as-service.jina.ai -
nvidia/NeMo ⭐ 12,358
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
🔗 docs.nvidia.com/nemo-framework/user-guide/latest/overview.html -
allenai/allennlp ⭐ 11,770
An open-source NLP research library, built on PyTorch.
🔗 www.allennlp.org -
facebookresearch/seamless_communication ⭐ 10,996
Foundational Models for State-of-the-Art Speech and Text Translation -
facebookresearch/ParlAI ⭐ 10,497
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
🔗 parl.ai -
google/sentencepiece ⭐ 10,348
Unsupervised text tokenizer for Neural Network-based text generation. -
doccano/doccano ⭐ 9,623
Open source annotation tool for machine learning practitioners. -
neuml/txtai ⭐ 9,599
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
🔗 neuml.github.io/txtai -
sloria/TextBlob ⭐ 9,178
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
🔗 textblob.readthedocs.io -
facebookresearch/nougat ⭐ 9,039
Implementation of Nougat Neural Optical Understanding for Academic Documents
🔗 facebookresearch.github.io/nougat -
speechbrain/speechbrain ⭐ 9,025
A PyTorch-based Speech Toolkit
🔗 speechbrain.github.io -
togethercomputer/OpenChatKit ⭐ 9,009
OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots -
clips/pattern ⭐ 8,754
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
🔗 github.com/clips/pattern/wiki -
espnet/espnet ⭐ 8,570
End-to-End Speech Processing Toolkit
🔗 espnet.github.io/espnet -
deeppavlov/DeepPavlov ⭐ 6,742
An open source library for deep learning end-to-end dialog systems and chatbots.
🔗 deeppavlov.ai -
facebookresearch/metaseq ⭐ 6,518
A codebase for working with Open Pre-trained Transformers, originally forked from fairseq. -
kingoflolz/mesh-transformer-jax ⭐ 6,308
Model parallel transformers in JAX and Haiku -
maartengr/BERTopic ⭐ 6,226
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
🔗 maartengr.github.io/bertopic -
prefecthq/marvin ⭐ 5,348
✨ Build AI interfaces that spark joy
🔗 askmarvin.ai -
aiwaves-cn/agents ⭐ 5,338
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents -
salesforce/CodeGen ⭐ 4,947
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex. -
minimaxir/textgenrnn ⭐ 4,944
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code. -
layout-parser/layout-parser ⭐ 4,941
A Unified Toolkit for Deep Learning Based Document Image Analysis
🔗 layout-parser.github.io -
facebookresearch/DrQA ⭐ 4,481
Reading Wikipedia to Answer Open-Domain Questions -
makcedward/nlpaug ⭐ 4,467
Data augmentation for NLP
🔗 makcedward.github.io -
thilinarajapakse/simpletransformers ⭐ 4,121
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
🔗 simpletransformers.ai -
argilla-io/argilla ⭐ 4,089
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
🔗 docs.argilla.io -
maartengr/KeyBERT ⭐ 3,598
Minimal keyword extraction with BERT
🔗 maartengr.github.io/keybert -
life4/textdistance ⭐ 3,402
📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage. -
promptslab/Promptify ⭐ 3,316
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
🔗 discord.gg/m88xfymbk6 -
jsvine/markovify ⭐ 3,308
A simple, extensible Markov chain generator. -
bytedance/lightseq ⭐ 3,218
LightSeq: A High Performance Library for Sequence Processing and Generation -
errbotio/errbot ⭐ 3,136
Errbot is a chatbot, a daemon that connects to your favorite chat service and bring your tools and some fun into the conversation.
🔗 errbot.io -
neuralmagic/deepsparse ⭐ 3,042
Sparsity-aware deep learning inference runtime for CPUs
🔗 neuralmagic.com/deepsparse -
huawei-noah/Pretrained-Language-Model ⭐ 3,034
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab. -
quivrhq/MegaParse ⭐ 2,998
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
🔗 pypi.org/project/megaparse -
ddangelov/Top2Vec ⭐ 2,956
Top2Vec learns jointly embedded topic, document and word vectors. -
jbesomi/texthero ⭐ 2,894
Text preprocessing, representation and visualization from zero to hero.
🔗 texthero.org -
huggingface/neuralcoref ⭐ 2,860
✨Fast Coreference Resolution in spaCy with Neural Networks
🔗 huggingface.co/coref -
salesforce/CodeT5 ⭐ 2,822
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
🔗 arxiv.org/abs/2305.07922 -
bigscience-workshop/promptsource ⭐ 2,714
Toolkit for creating, sharing and using natural language prompts. -
huggingface/setfit ⭐ 2,255
SetFit is an efficient and prompt-free framework for few-shot fine-tuning of Sentence Transformers.
🔗 hf.co/docs/setfit -
jamesturk/jellyfish ⭐ 2,073
🪼 a python library for doing approximate and phonetic matching of strings.
🔗 jamesturk.github.io/jellyfish -
alibaba/EasyNLP ⭐ 2,071
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit -
thudm/P-tuning-v2 ⭐ 1,989
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks -
featureform/featureform ⭐ 1,822
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
🔗 www.featureform.com -
marella/ctransformers ⭐ 1,822
Python bindings for the Transformer models implemented in C/C++ using GGML library. -
bhavnicksm/chonkie ⭐ 1,806
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
🔗 docs.chonkie.ai -
deepset-ai/FARM ⭐ 1,742
🏡 Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
🔗 farm.deepset.ai -
franck-dernoncourt/NeuroNER ⭐ 1,700
Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
🔗 neuroner.com -
explosion/spacy-models ⭐ 1,653
💫 Models for the spaCy Natural Language Processing (NLP) library
🔗 spacy.io -
google-research/language ⭐ 1,631
Shared repository for open-sourced projects from the Google AI Language team.
🔗 ai.google/research/teams/language -
plasticityai/magnitude ⭐ 1,629
A fast, efficient universal vector embedding utility package. -
arxiv-vanity/arxiv-vanity ⭐ 1,612
Renders papers from arXiv as responsive web pages so you don't have to squint at a PDF.
🔗 www.arxiv-vanity.com -
chrismattmann/tika-python ⭐ 1,516
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community. -
urchade/GLiNER ⭐ 1,503
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
🔗 arxiv.org/abs/2311.08526 -
dmmiller612/bert-extractive-summarizer ⭐ 1,406
Easy to use extractive text summarization with BERT -
intellabs/fastRAG ⭐ 1,380
Efficient Retrieval Augmentation and Generation Framework -
gunthercox/chatterbot-corpus ⭐ 1,375
A multilingual dialog corpus
🔗 chatterbot-corpus.readthedocs.io -
nomic-ai/nomic ⭐ 1,369
Interact, analyze and structure massive text, image, embedding, audio and video datasets
🔗 atlas.nomic.ai -
jonasgeiping/cramming ⭐ 1,299
Cramming the training of a (BERT-type) language model into limited compute. -
pemistahl/lingua-py ⭐ 1,180
The most accurate natural language detection library for Python, suitable for short text and mixed-language text -
openai/grade-school-math ⭐ 1,105
GSM8K, a dataset of 8.5K high quality linguistically diverse grade school math word problems -
abertsch72/unlimiformer ⭐ 1,057
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input" -
unitaryai/detoxify ⭐ 971
Toxic Comment Classification with Pytorch Lightning and Transformers
🔗 www.unitary.ai -
norskregnesentral/skweak ⭐ 921
skweak: A software toolkit for weak supervision applied to NLP tasks -
explosion/spacy-streamlit ⭐ 811
👑 spaCy building blocks and visualizers for Streamlit apps
🔗 share.streamlit.io/ines/spacy-streamlit-demo/master/app.py -
keras-team/keras-hub ⭐ 810
Pretrained model hub for Keras 3 -
paddlepaddle/RocketQA ⭐ 769
🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models. -
webis-de/small-text ⭐ 598
Small-Text provides state-of-the-art Active Learning for Text Classification. Several pre-implemented Query Strategies, Initialization Strategies, and Stopping Critera are provided, which can be easily mixed and matched to build active learning experiments or applications.
🔗 small-text.readthedocs.io -
babelscape/rebel ⭐ 505
REBEL is a seq2seq model that simplifies Relation Extraction (EMNLP 2021).
Python packaging, dependency management and bundling.
-
pyenv/pyenv ⭐ 39,741
pyenv lets you easily switch between multiple versions of Python. -
python-poetry/poetry ⭐ 31,916
Python packaging and dependency management made easy
🔗 python-poetry.org -
astral-sh/uv ⭐ 28,727
An extremely fast Python package installer and resolver, written in Rust. Designed as a drop-in replacement for pip and pip-compile.
🔗 docs.astral.sh/uv -
pypa/pipenv ⭐ 24,921
A virtualenv management tool that supports a multitude of systems and nicely bridges the gaps between pip, python and virtualenv.
🔗 pipenv.pypa.io -
mitsuhiko/rye ⭐ 13,883
a Hassle-Free Python Experience
🔗 rye.astral.sh -
pyinstaller/pyinstaller ⭐ 11,970
Freeze (package) Python programs into stand-alone executables
🔗 www.pyinstaller.org -
pypa/pipx ⭐ 10,714
Install and Run Python Applications in Isolated Environments
🔗 pipx.pypa.io -
pdm-project/pdm ⭐ 7,994
A modern Python package and dependency manager supporting the latest PEP standards
🔗 pdm-project.org -
jazzband/pip-tools ⭐ 7,779
A set of tools to keep your pinned Python dependencies fresh (pip-compile + pip-sync)
🔗 pip-tools.rtfd.io -
mamba-org/mamba ⭐ 6,994
The Fast Cross-Platform Package Manager: mamba is a reimplementation of the conda package manager in C++
🔗 mamba.readthedocs.io -
conda-forge/miniforge ⭐ 6,652
A conda-forge distribution.
🔗 conda-forge.org/download -
conda/conda ⭐ 6,495
A system-level, binary package and environment manager running on all major operating systems and platforms.
🔗 docs.conda.io/projects/conda -
pypa/hatch ⭐ 6,129
Modern, extensible Python project management
🔗 hatch.pypa.io/latest -
indygreg/PyOxidizer ⭐ 5,537
A modern Python application packaging and distribution tool -
pypa/virtualenv ⭐ 4,839
A tool to create isolated Python environments. Since Python 3.3, a subset of it has been integrated into the standard lib venv module.
🔗 virtualenv.pypa.io -
spack/spack ⭐ 4,458
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
🔗 spack.io -
prefix-dev/pixi ⭐ 3,470
pixi is a cross-platform, multi-language package manager and workflow tool built on the foundation of the conda ecosystem.
🔗 pixi.sh -
pantsbuild/pex ⭐ 2,881
A tool for generating .pex (Python EXecutable) files, lock files and venvs.
🔗 docs.pex-tool.org -
beeware/briefcase ⭐ 2,691
Tools to support converting a Python project into a standalone native application.
🔗 briefcase.readthedocs.io -
pypa/flit ⭐ 2,173
Simplified packaging of Python modules
🔗 flit.pypa.io -
linkedin/shiv ⭐ 1,773
shiv is a command line utility for building fully self contained Python zipapps as outlined in PEP 441, but with all their dependencies included. -
marcelotduarte/cx_Freeze ⭐ 1,372
cx_Freeze creates standalone executables from Python scripts, with the same performance, is cross-platform and should work on any platform that Python itself works on.
🔗 marcelotduarte.github.io/cx_freeze -
ofek/pyapp ⭐ 1,259
Runtime installer for Python applications
🔗 ofek.dev/pyapp -
pypa/gh-action-pypi-publish ⭐ 957
The blessed GitHub Action, for publishing your 📦 distribution files to PyPI, the tokenless way: https://github.com/marketplace/actions/pypi-publish
🔗 packaging.python.org/guides/publishing-package-distribution-releases-using-github-actions-ci-cd-workflows -
py2exe/py2exe ⭐ 881
Create standalone Windows programs from Python code
🔗 www.py2exe.org -
prefix-dev/rip ⭐ 655
RIP is a library that allows the resolving and installing of Python PyPI packages from Rust into a virtual environment. It's based on our experience with building Rattler and aims to provide the same experience but for PyPI instead of Conda.
🔗 prefix.dev -
snok/install-poetry ⭐ 593
Github action for installing and configuring Poetry -
python-poetry/install.python-poetry.org ⭐ 214
The official Poetry installation script
🔗 install.python-poetry.org
Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations.
-
pandas-dev/pandas ⭐ 43,976
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
🔗 pandas.pydata.org -
pola-rs/polars ⭐ 30,772
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
🔗 docs.pola.rs -
duckdb/duckdb ⭐ 24,824
DuckDB is an analytical in-process SQL database management system
🔗 www.duckdb.org -
gventuri/pandas-ai ⭐ 13,642
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
🔗 pandas-ai.com -
kanaries/pygwalker ⭐ 13,476
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
🔗 kanaries.net/pygwalker -
ydataai/ydata-profiling ⭐ 12,592
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
🔗 docs.profiling.ydata.ai -
rapidsai/cudf ⭐ 8,507
cuDF is a GPU DataFrame library for loading joining, aggregating, filtering, and otherwise manipulating data
🔗 docs.rapids.ai/api/cudf/stable -
aws/aws-sdk-pandas ⭐ 3,942
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
🔗 aws-sdk-pandas.readthedocs.io -
nalepae/pandarallel ⭐ 3,695
A simple and efficient tool to parallelize Pandas operations on all available CPUs
🔗 nalepae.github.io/pandarallel -
unionai-oss/pandera ⭐ 3,432
A light-weight, flexible, and expressive statistical data testing library
🔗 www.union.ai/pandera -
adamerose/PandasGUI ⭐ 3,200
A GUI for Pandas DataFrames -
blaze/blaze ⭐ 3,184
NumPy and Pandas interface to Big Data
🔗 blaze.pydata.org -
pydata/pandas-datareader ⭐ 2,963
Extract data from a wide range of Internet sources into a pandas DataFrame.
🔗 pydata.github.io/pandas-datareader/stable/index.html -
scikit-learn-contrib/sklearn-pandas ⭐ 2,816
Pandas integration with sklearn -
jmcarpenter2/swifter ⭐ 2,549
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner -
eventual-inc/Daft ⭐ 2,411
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
🔗 getdaft.io -
delta-io/delta-rs ⭐ 2,377
A native Rust library for Delta Lake, with bindings into Python
🔗 delta-io.github.io/delta-rs -
fugue-project/fugue ⭐ 2,016
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
🔗 fugue-tutorials.readthedocs.io -
pyjanitor-devs/pyjanitor ⭐ 1,369
Clean APIs for data cleaning. Python implementation of R package Janitor
🔗 pyjanitor-devs.github.io/pyjanitor -
machow/siuba ⭐ 1,161
Python library for using dplyr like syntax with pandas and SQL
🔗 siuba.org -
holoviz/hvplot ⭐ 1,147
A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews
🔗 hvplot.holoviz.org -
renumics/spotlight ⭐ 1,133
Interactively explore unstructured datasets from your dataframe.
🔗 renumics.com -
tkrabel/bamboolib ⭐ 939
bamboolib - a GUI for pandas DataFrames
🔗 bamboolib.com -
mwouts/itables ⭐ 807
This packages changes how Pandas and Polars DataFrames are rendered in Jupyter Notebooks. With itables you can display your tables as interactive DataTables that you can sort, paginate, scroll or filter.
🔗 mwouts.github.io/itables
Performance, parallelisation and low level libraries.
-
celery/celery ⭐ 25,007
Distributed Task Queue (development branch)
🔗 docs.celeryq.dev -
google/flatbuffers ⭐ 23,443
FlatBuffers: Memory Efficient Serialization Library
🔗 flatbuffers.dev -
pybind/pybind11 ⭐ 15,889
Seamless operability between C++11 and Python
🔗 pybind11.readthedocs.io -
exaloop/codon ⭐ 15,191
A high-performance, zero-overhead, extensible Python compiler using LLVM
🔗 docs.exaloop.io/codon -
dask/dask ⭐ 12,661
Parallel computing with task scheduling
🔗 dask.org -
numba/numba ⭐ 10,029
NumPy aware dynamic Python compiler using LLVM
🔗 numba.pydata.org -
modin-project/modin ⭐ 9,920
Modin: Scale your Pandas workflows by changing a single line of code
🔗 modin.readthedocs.io -
nebuly-ai/optimate ⭐ 8,377
A collection of libraries to optimise AI model performances
🔗 www.nebuly.com -
vaexio/vaex ⭐ 8,309
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
🔗 vaex.io -
mher/flower ⭐ 6,509
Real-time monitor and web admin for Celery distributed task queue
🔗 flower.readthedocs.io -
python-trio/trio ⭐ 6,250
Trio – a friendly Python library for async concurrency and I/O
🔗 trio.readthedocs.io -
ultrajson/ultrajson ⭐ 4,343
Ultra fast JSON decoder and encoder written in C with Python bindings
🔗 pypi.org/project/ujson -
tlkh/asitop ⭐ 3,579
Perf monitoring CLI tool for Apple Silicon
🔗 tlkh.github.io/asitop -
facebookincubator/cinder ⭐ 3,541
Cinder is Meta's internal performance-oriented production version of CPython.
🔗 trycinder.com -
airtai/faststream ⭐ 3,214
FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
🔗 faststream.airt.ai/latest -
ipython/ipyparallel ⭐ 2,594
IPython Parallel: Interactive Parallel Computing in Python
🔗 ipyparallel.readthedocs.io -
intel/intel-extension-for-transformers ⭐ 2,144
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡ -
h5py/h5py ⭐ 2,092
HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
🔗 www.h5py.org -
agronholm/anyio ⭐ 1,840
High level asynchronous concurrency and networking framework that works on top of either trio or asyncio -
tiangolo/asyncer ⭐ 1,718
Asyncer, async and await, focused on developer experience.
🔗 asyncer.tiangolo.com -
faster-cpython/ideas ⭐ 1,698
Discussion and work tracker for Faster CPython project. -
intel/intel-extension-for-pytorch ⭐ 1,639
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform -
dask/distributed ⭐ 1,581
A distributed task scheduler for Dask
🔗 distributed.dask.org -
nschloe/perfplot ⭐ 1,347
📈 Performance analysis for Python snippets -
intel/scikit-learn-intelex ⭐ 1,231
Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
🔗 intel.github.io/scikit-learn-intelex -
markshannon/faster-cpython ⭐ 942
How to make CPython faster. -
zerointensity/pointers.py ⭐ 917
Bringing the hell of pointers to Python.
🔗 pointers.zintensity.dev -
brandtbucher/specialist ⭐ 641
Visualize CPython's specializing, adaptive interpreter. 🔥
Memory and CPU/GPU profiling tools and libraries.
-
bloomberg/memray ⭐ 13,410
Memray is a memory profiler for Python
🔗 bloomberg.github.io/memray -
benfred/py-spy ⭐ 12,941
Sampling profiler for Python programs -
plasma-umass/scalene ⭐ 12,251
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals -
joerick/pyinstrument ⭐ 6,700
🚴 Call stack profiler for Python. Shows you why your code is slow!
🔗 pyinstrument.readthedocs.io -
gaogaotiantian/viztracer ⭐ 5,359
A debugging and profiling tool that can trace and visualize python code execution
🔗 viztracer.readthedocs.io -
pythonprofilers/memory_profiler ⭐ 4,396
Monitor Memory usage of Python code
🔗 pypi.python.org/pypi/memory_profiler -
reloadware/reloadium ⭐ 2,826
Hot Reloading and Profiling for Python -
pyutils/line_profiler ⭐ 2,777
Line-by-line profiling for Python -
jiffyclub/snakeviz ⭐ 2,363
An in-browser Python profile viewer
🔗 jiffyclub.github.io/snakeviz -
p403n1x87/austin ⭐ 1,936
Python frame stack sampler for CPython
🔗 pypi.org/project/austin-dist -
pythonspeed/filprofiler ⭐ 844
A Python memory profiler for data processing and scientific computing applications
🔗 pythonspeed.com/products/filmemoryprofiler
Security related libraries: vulnerability discovery, SQL injection, environment auditing.
-
swisskyrepo/PayloadsAllTheThings ⭐ 61,786
A list of useful payloads and bypass for Web Application Security and Pentest/CTF
🔗 swisskyrepo.github.io/payloadsallthethings -
sqlmapproject/sqlmap ⭐ 32,780
Automatic SQL injection and database takeover tool
🔗 sqlmap.org -
certbot/certbot ⭐ 31,685
Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol. -
aquasecurity/trivy ⭐ 23,937
Find vulnerabilities, misconfigurations, secrets, SBOM in containers, Kubernetes, code repositories, clouds and more
🔗 trivy.dev -
bridgecrewio/checkov ⭐ 7,193
Checkov is a static code analysis tool for infrastructure as code (IaC) and also a software composition analysis (SCA) tool for images and open source packages.
🔗 www.checkov.io -
nccgroup/ScoutSuite ⭐ 6,781
Multi-Cloud Security Auditing Tool -
stamparm/maltrail ⭐ 6,624
Malicious traffic detection system -
pycqa/bandit ⭐ 6,530
Bandit is a tool designed to find common security issues in Python code.
🔗 bandit.readthedocs.io -
rhinosecuritylabs/pacu ⭐ 4,414
The AWS exploitation framework, designed for testing the security of Amazon Web Services environments.
🔗 rhinosecuritylabs.com/aws/pacu-open-source-aws-exploitation-framework -
dashingsoft/pyarmor ⭐ 3,663
A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.
🔗 pyarmor.dashingsoft.com -
luijait/DarkGPT ⭐ 2,143
DarkGPT is an OSINT assistant based on GPT-4-200K (recommended use) designed to perform queries on leaked databases, thus providing an artificial intelligence assistant that can be useful in your traditional OSINT processes. -
pyupio/safety ⭐ 1,752
Safety checks Python dependencies for known security vulnerabilities and suggests the proper remediations for vulnerabilities detected.
🔗 safetycli.com/product/safety-cli -
trailofbits/pip-audit ⭐ 986
Audits Python environments, requirements files and dependency trees for known security vulnerabilities, and can automatically fix them
🔗 pypi.org/project/pip-audit -
fadi002/de4py ⭐ 865
toolkit for python reverse engineering
🔗 de4py.000.pe
Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover.
-
atsushisakai/PythonRobotics ⭐ 23,574
Python sample codes for robotics algorithms.
🔗 atsushisakai.github.io/pythonrobotics -
bulletphysics/bullet3 ⭐ 12,749
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
🔗 bulletphysics.org -
isl-org/Open3D ⭐ 11,576
Open3D: A Modern Library for 3D Data Processing
🔗 www.open3d.org -
qiskit/qiskit ⭐ 5,330
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
🔗 www.ibm.com/quantum/qiskit -
astropy/astropy ⭐ 4,477
Astronomy and astrophysics core library
🔗 www.astropy.org -
nvidia/warp ⭐ 4,320
A Python framework for high performance GPU simulation and graphics
🔗 nvidia.github.io/warp -
quantumlib/Cirq ⭐ 4,298
A Python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum (NISQ) circuits. -
chakazul/Lenia ⭐ 3,556
Lenia is a 2D cellular automata with continuous space, time and states. It produces a huge variety of interesting methematical life forms
🔗 chakazul.github.io/lenia/javascript/lenia.html -
openai/mujoco-py ⭐ 2,891
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3. -
genesis-embodied-ai/Genesis ⭐ 2,773
Genesis is a physics platform, and generative data engine, designed for general purpose Robotics/Embodied AI/Physical AI applications -
rdkit/rdkit ⭐ 2,699
The official sources for the RDKit library -
projectmesa/mesa ⭐ 2,590
Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.
🔗 mesa.readthedocs.io -
taichi-dev/difftaichi ⭐ 2,498
10 differentiable physical simulators built with Taichi differentiable programming (DiffTaichi, ICLR 2020) -
google/brax ⭐ 2,388
Massively parallel rigidbody physics simulation on accelerator hardware. -
nvidia-omniverse/IsaacLab ⭐ 2,362
Unified framework for robot learning built on NVIDIA Isaac Sim
🔗 isaac-sim.github.io/isaaclab -
nvidia-omniverse/IsaacGymEnvs ⭐ 2,083
Example RL environments for the NVIDIA Isaac Gym high performance environments -
facebookresearch/habitat-lab ⭐ 2,025
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
🔗 aihabitat.org -
quantecon/QuantEcon.py ⭐ 2,001
A community based Python library for quantitative economics
🔗 quantecon.org/quantecon-py -
microsoft/PromptCraft-Robotics ⭐ 1,910
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
🔗 aka.ms/chatgpt-robotics -
eloialonso/diamond ⭐ 1,607
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model
🔗 diamond-wm.github.io -
deepmodeling/deepmd-kit ⭐ 1,524
A deep learning package for many-body potential energy representation and molecular dynamics
🔗 docs.deepmodeling.com/projects/deepmd -
sail-sg/envpool ⭐ 1,104
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
🔗 envpool.readthedocs.io -
bowang-lab/scGPT ⭐ 1,057
scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AI
🔗 scgpt.readthedocs.io/en/latest -
a-r-j/graphein ⭐ 1,048
Protein Graph Library
🔗 graphein.ai -
viblo/pymunk ⭐ 935
Pymunk is a easy-to-use pythonic 2d physics library that can be used whenever you need 2d rigid body physics from Python
🔗 www.pymunk.org -
google-deepmind/materials_discovery ⭐ 905
Graph Networks for Materials Science (GNoME) is a project centered around scaling machine learning methods to tackle materials science. -
nvidia-omniverse/OmniIsaacGymEnvs ⭐ 874
Reinforcement Learning Environments for Omniverse Isaac Gym -
google/evojax ⭐ 854
EvoJAX is a scalable, general purpose, hardware-accelerated neuroevolution toolkit built on the JAX library -
facebookresearch/fairo ⭐ 854
A modular embodied agent architecture and platform for building embodied agents -
eureka-research/DrEureka ⭐ 822
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
🔗 eureka-research.github.io/dr-eureka -
google-deepmind/concordia ⭐ 715
Concordia is a library to facilitate construction and use of generative agent-based models to simulate interactions of agents in grounded physical, social, or digital space. -
ur-whitelab/chemcrow-public ⭐ 643
Chemcrow -
ur-whitelab/chemcrow-runs ⭐ 72
ur-whitelab/chemcrow-runs
Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials.
-
thealgorithms/Python ⭐ 195,147
All Algorithms implemented in Python
🔗 thealgorithms.github.io/python -
microsoft/generative-ai-for-beginners ⭐ 65,567
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
🔗 microsoft.github.io/generative-ai-for-beginners -
jakevdp/PythonDataScienceHandbook ⭐ 43,382
Python Data Science Handbook: full text in Jupyter Notebooks
🔗 jakevdp.github.io/pythondatasciencehandbook -
mlabonne/llm-course ⭐ 39,801
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
🔗 mlabonne.github.io/blog -
rasbt/LLMs-from-scratch ⭐ 34,673
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
🔗 www.amazon.com/build-large-language-model-scratch/dp/1633437167 -
realpython/python-guide ⭐ 28,447
Python best practices guidebook, written for humans.
🔗 docs.python-guide.org -
christoschristofidis/awesome-deep-learning ⭐ 24,379
A curated list of awesome Deep Learning tutorials, projects and communities. -
d2l-ai/d2l-en ⭐ 24,146
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
🔗 d2l.ai -
wesm/pydata-book ⭐ 22,337
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media -
microsoft/recommenders ⭐ 19,382
Best Practices on Recommendation Systems
🔗 recommenders-team.github.io/recommenders/intro.html -
hannibal046/Awesome-LLM ⭐ 19,217
Awesome-LLM: a curated list of Large Language Model -
fchollet/deep-learning-with-python-notebooks ⭐ 18,802
Jupyter notebooks for the code samples of the book "Deep Learning with Python" -
graykode/nlp-tutorial ⭐ 14,330
Natural Language Processing Tutorial for Deep Learning Researchers
🔗 www.reddit.com/r/machinelearning/comments/amfinl/project_nlptutoral_repository_who_is_studying -
naklecha/llama3-from-scratch ⭐ 13,842
llama3 implementation one matrix multiplication at a time -
shangtongzhang/reinforcement-learning-an-introduction ⭐ 13,673
Python Implementation of Reinforcement Learning: An Introduction -
karpathy/nn-zero-to-hero ⭐ 12,008
Neural Networks: Zero to Hero -
eugeneyan/open-llms ⭐ 11,281
📋 A list of open LLMs available for commercial use. -
mrdbourke/pytorch-deep-learning ⭐ 11,251
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
🔗 learnpytorch.io -
karpathy/micrograd ⭐ 10,639
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API -
rucaibox/LLMSurvey ⭐ 10,567
The official GitHub page for the survey paper "A Survey of Large Language Models".
🔗 arxiv.org/abs/2303.18223 -
openai/spinningup ⭐ 10,228
An educational resource to help anyone learn deep reinforcement learning.
🔗 spinningup.openai.com -
srush/GPU-Puzzles ⭐ 10,023
Teaching beginner GPU programming in a completely interactive fashion -
nielsrogge/Transformers-Tutorials ⭐ 9,595
This repository contains demos I made with the Transformers library by HuggingFace. -
zhanymkanov/fastapi-best-practices ⭐ 9,557
FastAPI Best Practices and Conventions we used at our startup -
mooler0410/LLMsPracticalGuide ⭐ 9,533
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
🔗 arxiv.org/abs/2304.13712v2 -
firmai/industry-machine-learning ⭐ 7,273
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
🔗 www.sov.ai -
gkamradt/langchain-tutorials ⭐ 6,782
Overview and tutorial of the LangChain Library -
udlbook/udlbook ⭐ 6,661
Understanding Deep Learning - Simon J.D. Prince -
neetcode-gh/leetcode ⭐ 5,684
Leetcode solutions for NeetCode.io -
roboflow/notebooks ⭐ 5,648
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
🔗 roboflow.com/models -
udacity/deep-learning-v2-pytorch ⭐ 5,323
Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101 -
mrdbourke/tensorflow-deep-learning ⭐ 5,303
All course materials for the Zero to Mastery Deep Learning with TensorFlow course.
🔗 dbourke.link/ztmtfcourse -
alirezadir/Machine-Learning-Interviews ⭐ 4,977
This repo is meant to serve as a guide for Machine Learning/AI technical interviews. -
timofurrer/awesome-asyncio ⭐ 4,648
A curated list of awesome Python asyncio frameworks, libraries, software and resources -
zotroneneis/machine_learning_basics ⭐ 4,320
Plain python implementations of basic machine learning algorithms -
promptslab/Awesome-Prompt-Engineering ⭐ 3,964
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
🔗 discord.gg/m88xfymbk6 -
huggingface/deep-rl-class ⭐ 3,923
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course. -
huggingface/diffusion-models-class ⭐ 3,696
Materials for the Hugging Face Diffusion Models Course -
rasbt/machine-learning-book ⭐ 3,671
Code Repository for Machine Learning with PyTorch and Scikit-Learn
🔗 sebastianraschka.com/books/#machine-learning-with-pytorch-and-scikit-learn -
cosmicpython/book ⭐ 3,406
A Book about Pythonic Application Architecture Patterns for Managing Complexity. Cosmos is the Opposite of Chaos you see. O'R. wouldn't actually let us call it "Cosmic Python" tho.
🔗 www.cosmicpython.com -
fluentpython/example-code-2e ⭐ 3,319
Example code for Fluent Python, 2nd edition (O'Reilly 2022)
🔗 amzn.to/3j48u2j -
amanchadha/coursera-deep-learning-specialization ⭐ 3,205
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv... -
handsonllm/Hands-On-Large-Language-Models ⭐ 3,012
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
🔗 www.llm-book.com -
mrdbourke/zero-to-mastery-ml ⭐ 2,995
All course materials for the Zero to Mastery Machine Learning and Data Science course.
🔗 dbourke.link/ztmmlcourse -
krzjoa/awesome-python-data-science ⭐ 2,619
Probably the best curated list of data science software in Python.
🔗 krzjoa.github.io/awesome-python-data-science -
gerdm/prml ⭐ 2,161
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop -
cgpotts/cs224u ⭐ 2,119
Code for CS224u: Natural Language Understanding -
cerlymarco/MEDIUM_NoteBook ⭐ 2,084
Repository containing notebooks of my posts on Medium -
trananhkma/fucking-awesome-python ⭐ 1,987
awesome-python with ⭐ and 🍴 -
huggingface/cookbook ⭐ 1,705
Community-driven practical examples of building AI applications and solving various tasks with AI using open-source tools and models.
🔗 huggingface.co/learn/cookbook -
chandlerbang/awesome-self-supervised-gnn ⭐ 1,606
Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN). -
atcold/NYU-DLSP21 ⭐ 1,582
NYU Deep Learning Spring 2021
🔗 atcold.github.io/nyu-dlsp21 -
patrickloeber/MLfromscratch ⭐ 1,335
Machine Learning algorithm implementations from scratch. -
davidadsp/Generative_Deep_Learning_2nd_Edition ⭐ 1,139
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
🔗 www.oreilly.com/library/view/generative-deep-learning/9781098134174 -
jackhidary/quantumcomputingbook ⭐ 807
Companion site for the textbook Quantum Computing: An Applied Approach -
rasbt/LLM-workshop-2024 ⭐ 776
A 4-hour coding workshop to understand how LLMs are implemented and used -
bayesianmodelingandcomputationinpython/BookCode_Edition1 ⭐ 506
Bayesian Modeling and Computation in Python: open-access version of the text and the code examples in the book
🔗 www.bayesiancomputationbook.com -
dylanhogg/awesome-python ⭐ 325
🐍 Hand-picked awesome Python libraries and frameworks, organised by category
🔗 www.awesomepython.org
Template tools and libraries: cookiecutter repos, generators, quick-starts.
-
tiangolo/full-stack-fastapi-template ⭐ 28,190
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more. -
cookiecutter/cookiecutter ⭐ 22,734
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
🔗 pypi.org/project/cookiecutter -
drivendata/cookiecutter-data-science ⭐ 8,396
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
🔗 cookiecutter-data-science.drivendata.org -
buuntu/fastapi-react ⭐ 2,250
🚀 Cookiecutter Template for FastAPI + React Projects. Using PostgreSQL, SQLAlchemy, and Docker -
pyscaffold/pyscaffold ⭐ 2,127
🛠 Python project template generator with batteries included
🔗 pyscaffold.org -
cjolowicz/cookiecutter-hypermodern-python ⭐ 1,824
Cookiecutter template for a Python package based on the Hypermodern Python article series.
🔗 cookiecutter-hypermodern-python.readthedocs.io -
tezromach/python-package-template ⭐ 1,091
🚀 Your next Python package needs a bleeding-edge project structure. -
martinheinz/python-project-blueprint ⭐ 960
Blueprint/Boilerplate For Python Projects -
fmind/mlops-python-package ⭐ 818
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
🔗 fmind.github.io/mlops-python-package
Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars.
-
willmcgugan/rich ⭐ 49,819
Rich is a Python library for rich text and beautiful formatting in the terminal.
🔗 rich.readthedocs.io/en/latest -
tqdm/tqdm ⭐ 28,854
⚡ A Fast, Extensible Progress Bar for Python and CLI
🔗 tqdm.github.io -
google/python-fire ⭐ 27,213
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object. -
willmcgugan/textual ⭐ 25,884
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
🔗 textual.textualize.io -
tiangolo/typer ⭐ 15,973
Typer, build great CLIs. Easy to code. Based on Python type hints.
🔗 typer.tiangolo.com -
pallets/click ⭐ 15,842
Python composable command line interface toolkit
🔗 click.palletsprojects.com -
saulpw/visidata ⭐ 7,953
A terminal spreadsheet multitool for discovering and arranging data
🔗 visidata.org -
tconbeer/harlequin ⭐ 3,863
The SQL IDE for Your Terminal.
🔗 harlequin.sh -
manrajgrover/halo ⭐ 2,901
💫 Beautiful spinners for terminal, IPython and Jupyter -
urwid/urwid ⭐ 2,829
Console user interface library for Python (official repo)
🔗 urwid.org -
textualize/trogon ⭐ 2,503
Easily turn your Click CLI into a powerful terminal application -
darrenburns/elia ⭐ 1,885
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more. -
tmbo/questionary ⭐ 1,590
Python library to build pretty command line user prompts ✨Easy to use multi-select lists, confirmations, free text prompts ... -
jazzband/prettytable ⭐ 1,392
Display tabular data in a visually appealing ASCII table format
🔗 pypi.org/project/prettytable -
1j01/textual-paint ⭐ 954
🎨 MS Paint in your terminal.
🔗 pypi.org/project/textual-paint
Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins.
-
mitmproxy/mitmproxy ⭐ 37,073
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
🔗 mitmproxy.org -
locustio/locust ⭐ 25,189
Write scalable load tests in plain Python 🚗💨
🔗 locust.cloud -
pytest-dev/pytest ⭐ 12,202
The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
🔗 pytest.org -
microsoft/playwright-python ⭐ 12,000
Python version of the Playwright testing and automation library.
🔗 playwright.dev/python -
robotframework/robotframework ⭐ 9,986
Generic automation framework for acceptance testing and RPA
🔗 robotframework.org -
getmoto/moto ⭐ 7,689
A library that allows you to easily mock out tests based on AWS infrastructure.
🔗 docs.getmoto.org/en/latest -
hypothesisworks/hypothesis ⭐ 7,610
Hypothesis is a powerful, flexible, and easy to use library for property-based testing.
🔗 hypothesis.works -
newsapps/beeswithmachineguns ⭐ 6,449
A utility for arming (creating) many bees (micro EC2 instances) to attack (load test) targets (web applications).
🔗 apps.chicagotribune.com -
seleniumbase/SeleniumBase ⭐ 5,515
📊 Blazing fast Python framework for web crawling, scraping, testing, and reporting. Supports pytest. Stealth abilities: UC Mode and CDP Mode.
🔗 seleniumbase.io -
codium-ai/qodo-cover ⭐ 4,564
Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞
🔗 qodo.ai -
spulec/freezegun ⭐ 4,209
Let your Python tests travel through time -
getsentry/responses ⭐ 4,173
A utility for mocking out the Python Requests library. -
confident-ai/deepeval ⭐ 3,905
The LLM Evaluation Framework
🔗 docs.confident-ai.com -
tox-dev/tox ⭐ 3,707
Command line driven CI frontend and development task automation tool.
🔗 tox.wiki -
behave/behave ⭐ 3,203
BDD, Python style.
🔗 behave.readthedocs.io/en/latest -
nedbat/coveragepy ⭐ 3,031
The code coverage tool for Python
🔗 coverage.readthedocs.io -
cobrateam/splinter ⭐ 2,723
splinter - python test framework for web applications
🔗 splinter.readthedocs.org/en/stable/index.html -
kevin1024/vcrpy ⭐ 2,722
Automatically mock your HTTP interactions to simplify and speed up testing -
pytest-dev/pytest-testinfra ⭐ 2,385
With Testinfra you can write unit tests in Python to test actual state of your servers configured by management tools like Salt, Ansible, Puppet, Chef and so on.
🔗 testinfra.readthedocs.io -
pytest-dev/pytest-mock ⭐ 1,864
Thin-wrapper around the mock package for easier use with pytest
🔗 pytest-mock.readthedocs.io/en/latest -
pytest-dev/pytest-cov ⭐ 1,776
Coverage plugin for pytest. -
pytest-dev/pytest-xdist ⭐ 1,497
pytest plugin for distributed testing and loop-on-failures testing modes.
🔗 pytest-xdist.readthedocs.io -
pytest-dev/pytest-asyncio ⭐ 1,439
Asyncio support for pytest
🔗 pytest-asyncio.readthedocs.io -
taverntesting/tavern ⭐ 1,033
A command-line tool and Python library and Pytest plugin for automated testing of RESTful APIs, with a simple, concise and flexible YAML-based syntax
🔗 taverntesting.github.io
Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics.
-
facebook/prophet ⭐ 18,590
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
🔗 facebook.github.io/prophet -
blue-yonder/tsfresh ⭐ 8,477
Automatic extraction of relevant features from time series:
🔗 tsfresh.readthedocs.io -
unit8co/darts ⭐ 8,145
A python library for user-friendly forecasting and anomaly detection on time series.
🔗 unit8co.github.io/darts -
sktime/sktime ⭐ 8,007
A unified framework for machine learning with time series
🔗 www.sktime.net -
facebookresearch/Kats ⭐ 5,167
Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends. -
awslabs/gluonts ⭐ 4,662
Probabilistic time series modeling in Python
🔗 ts.gluon.ai -
nixtla/statsforecast ⭐ 4,028
Lightning ⚡️ fast forecasting with statistical and econometric models.
🔗 nixtlaverse.nixtla.io/statsforecast -
google-research/timesfm ⭐ 3,852
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
🔗 research.google/blog/a-decoder-only-foundation-model-for-time-series-forecasting -
tdameritrade/stumpy ⭐ 3,679
STUMPY is a powerful and scalable Python library for modern time series analysis
🔗 stumpy.readthedocs.io/en/latest -
salesforce/Merlion ⭐ 3,434
Merlion: A Machine Learning Framework for Time Series Intelligence -
amazon-science/chronos-forecasting ⭐ 2,663
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
🔗 arxiv.org/abs/2403.07815 -
rjt1990/pyflux ⭐ 2,113
Open source time series library for Python -
aistream-peelout/flow-forecast ⭐ 2,072
Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).
🔗 flow-forecast.atlassian.net/wiki/spaces/ff/overview -
uber/orbit ⭐ 1,895
A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.
🔗 orbit-ml.readthedocs.io/en/stable -
alkaline-ml/pmdarima ⭐ 1,602
A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
🔗 www.alkaline-ml.com/pmdarima -
time-series-foundation-models/lag-llama ⭐ 1,278
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting -
winedarksea/AutoTS ⭐ 1,134
Automated Time Series Forecasting -
autoviml/Auto_TS ⭐ 737
Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Created by Ram Seshadri. Collaborators welcome. -
google/temporian ⭐ 680
Temporian is an open-source Python library for preprocessing ⚡ and feature engineering 🛠 temporal data 📈 for machine learning applications 🤖
🔗 temporian.readthedocs.io
Typing libraries: static and run-time type checking, annotations.
-
python/mypy ⭐ 18,622
Optional static typing for Python
🔗 www.mypy-lang.org -
microsoft/pyright ⭐ 13,532
Static Type Checker for Python -
facebook/pyre-check ⭐ 6,878
Performant type-checking for python.
🔗 pyre-check.org -
python-attrs/attrs ⭐ 5,319
Python Classes Without Boilerplate
🔗 www.attrs.org -
google/pytype ⭐ 4,785
A static type analyzer for Python code
🔗 google.github.io/pytype -
instagram/MonkeyType ⭐ 4,785
A Python library that generates static type annotations by collecting runtime types -
python/typeshed ⭐ 4,407
Collection of library stubs for Python, with static types -
mtshiba/pylyzer ⭐ 2,529
A fast, feature-rich static code analyzer & language server for Python
🔗 mtshiba.github.io/pylyzer -
microsoft/pylance-release ⭐ 1,724
Fast, feature-rich language support for Python. Documentation and issues for Pylance. -
agronholm/typeguard ⭐ 1,565
Run-time type checker for Python -
patrick-kidger/torchtyping ⭐ 1,406
Type annotations and dynamic checking for a tensor's shape, dtype, names, etc. -
robertcraigie/pyright-python ⭐ 185
Python command line wrapper for pyright, a static type checker
🔗 pypi.org/project/pyright
General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools.
-
yt-dlp/yt-dlp ⭐ 92,077
A feature-rich command-line audio/video downloader
🔗 discord.gg/h5mncfw63r -
home-assistant/core ⭐ 74,262
🏡 Open source home automation that puts local control and privacy first.
🔗 www.home-assistant.io -
abi/screenshot-to-code ⭐ 64,962
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
🔗 screenshottocode.com -
python/cpython ⭐ 63,945
The Python programming language
🔗 www.python.org -
localstack/localstack ⭐ 56,687
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
🔗 localstack.cloud -
faif/python-patterns ⭐ 40,600
A collection of design patterns/idioms in Python -
mingrammer/diagrams ⭐ 39,901
🎨 Diagram as Code for prototyping cloud system architectures
🔗 diagrams.mingrammer.com -
ggerganov/whisper.cpp ⭐ 36,072
Port of OpenAI's Whisper model in C/C++ -
keon/algorithms ⭐ 24,133
Minimal examples of data structures and algorithms in Python -
modularml/mojo ⭐ 23,396
The Mojo Programming Language
🔗 docs.modular.com/mojo/manual -
openai/openai-python ⭐ 23,333
The official Python library for the OpenAI API
🔗 pypi.org/project/openai -
norvig/pytudes ⭐ 23,194
Python programs, usually short, of considerable difficulty, to perfect particular skills. -
paul-gauthier/aider ⭐ 22,871
Aider is a command line tool that lets you pair program with LLMs, to edit code stored in your local git repository
🔗 aider.chat -
pydantic/pydantic ⭐ 21,532
Data validation using Python type hints
🔗 docs.pydantic.dev -
squidfunk/mkdocs-material ⭐ 21,298
Documentation that simply works
🔗 squidfunk.github.io/mkdocs-material -
facebookresearch/audiocraft ⭐ 21,106
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning. -
chriskiehl/Gooey ⭐ 20,695
Turn (almost) any Python command line program into a full GUI application with one line -
delgan/loguru ⭐ 20,160
Python logging made (stupidly) simple -
blakeblackshear/frigate ⭐ 19,670
NVR with realtime local object detection for IP cameras
🔗 frigate.video -
micropython/micropython ⭐ 19,527
MicroPython - a lean and efficient Python implementation for microcontrollers and constrained systems
🔗 micropython.org -
mkdocs/mkdocs ⭐ 19,516
Project documentation with Markdown.
🔗 www.mkdocs.org -
rustpython/RustPython ⭐ 19,276
A Python Interpreter written in Rust
🔗 rustpython.github.io -
kivy/kivy ⭐ 17,803
Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
🔗 kivy.org -
higherorderco/Bend ⭐ 17,742
A massively parallel, high-level programming language
🔗 higherorderco.com -
ipython/ipython ⭐ 16,321
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
🔗 ipython.readthedocs.org -
alievk/avatarify-python ⭐ 16,305
Avatars for Zoom, Skype and other video-conferencing apps. -
openai/triton ⭐ 13,614
Development repository for the Triton language and compiler
🔗 triton-lang.org -
zulko/moviepy ⭐ 12,685
Video editing with Python
🔗 zulko.github.io/moviepy -
pyo3/pyo3 ⭐ 12,526
Rust bindings for the Python interpreter
🔗 pyo3.rs -
python-pillow/Pillow ⭐ 12,370
The Python Imaging Library adds image processing capabilities to Python (Pillow is the friendly PIL fork)
🔗 python-pillow.org -
pyodide/pyodide ⭐ 12,365
Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
🔗 pyodide.org/en/stable -
pytube/pytube ⭐ 12,363
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
🔗 pytube.io -
nuitka/Nuitka ⭐ 12,164
Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4-3.13. You feed it your Python app, it does a lot of clever things, and spits out an executable or extension module.
🔗 nuitka.net -
caronc/apprise ⭐ 12,134
Apprise - Push Notifications that work with just about every platform!
🔗 hub.docker.com/r/caronc/apprise -
dbader/schedule ⭐ 11,868
Python job scheduling for humans.
🔗 schedule.readthedocs.io -
ninja-build/ninja ⭐ 11,341
Ninja is a small build system with a focus on speed.
🔗 ninja-build.org -
secdev/scapy ⭐ 10,836
Scapy: the Python-based interactive packet manipulation program & library.
🔗 scapy.net -
asweigart/pyautogui ⭐ 10,536
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard. -
magicstack/uvloop ⭐ 10,473
Ultra fast asyncio event loop. -
pallets/jinja ⭐ 10,420
A very fast and expressive template engine.
🔗 jinja.palletsprojects.com -
aristocratos/bpytop ⭐ 10,212
Linux/OSX/FreeBSD resource monitor -
cython/cython ⭐ 9,584
The most widely used Python to C compiler
🔗 cython.org -
aws/serverless-application-model ⭐ 9,367
The AWS Serverless Application Model (AWS SAM) transform is a AWS CloudFormation macro that transforms SAM templates into CloudFormation templates.
🔗 aws.amazon.com/serverless/sam -
paramiko/paramiko ⭐ 9,176
The leading native Python SSHv2 protocol library.
🔗 paramiko.org -
boto/boto3 ⭐ 9,093
AWS SDK for Python
🔗 aws.amazon.com/sdk-for-python -
facebookresearch/hydra ⭐ 8,861
Hydra is a framework for elegantly configuring complex applications
🔗 hydra.cc -
arrow-py/arrow ⭐ 8,742
🏹 Better dates & times for Python
🔗 arrow.readthedocs.io -
py-pdf/pypdf ⭐ 8,482
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
🔗 pypdf.readthedocs.io/en/latest -
xonsh/xonsh ⭐ 8,467
🐚 Python-powered shell. Full-featured and cross-platform.
🔗 xon.sh -
eternnoir/pyTelegramBotAPI ⭐ 8,159
Python Telegram bot api. -
kellyjonbrazil/jc ⭐ 7,948
CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts. -
googleapis/google-api-python-client ⭐ 7,836
🐍 The official Python client library for Google's discovery based APIs.
🔗 googleapis.github.io/google-api-python-client/docs -
theskumar/python-dotenv ⭐ 7,710
Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.
🔗 saurabh-kumar.com/python-dotenv -
jasonppy/VoiceCraft ⭐ 7,685
Zero-Shot Speech Editing and Text-to-Speech in the Wild -
googlecloudplatform/python-docs-samples ⭐ 7,485
Code samples used on cloud.google.com -
google/latexify_py ⭐ 7,282
A library to generate LaTeX expression from Python code. -
icloud-photos-downloader/icloud_photos_downloader ⭐ 7,160
A command-line tool to download photos from iCloud -
marshmallow-code/marshmallow ⭐ 7,068
A lightweight library for converting complex objects to and from simple Python datatypes.
🔗 marshmallow.readthedocs.io -
pygithub/PyGithub ⭐ 7,064
Typed interactions with the GitHub API v3
🔗 pygithub.readthedocs.io -
bndr/pipreqs ⭐ 6,867
pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward. -
hugapi/hug ⭐ 6,863
Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler. -
jd/tenacity ⭐ 6,832
Retrying library for Python
🔗 tenacity.readthedocs.io -
pyca/cryptography ⭐ 6,712
cryptography is a package designed to expose cryptographic primitives and recipes to Python developers.
🔗 cryptography.io -
gorakhargosh/watchdog ⭐ 6,657
Python library and shell utilities to monitor filesystem events.
🔗 packages.python.org/watchdog -
sphinx-doc/sphinx ⭐ 6,646
The Sphinx documentation generator
🔗 www.sphinx-doc.org -
openai/point-e ⭐ 6,563
Point cloud diffusion for 3D model synthesis -
timdettmers/bitsandbytes ⭐ 6,385
Accessible large language models via k-bit quantization for PyTorch.
🔗 huggingface.co/docs/bitsandbytes/main/en/index -
ijl/orjson ⭐ 6,336
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy -
agronholm/apscheduler ⭐ 6,335
Task scheduling library for Python -
sdispater/pendulum ⭐ 6,286
Python datetimes made easy
🔗 pendulum.eustace.io -
scikit-image/scikit-image ⭐ 6,112
Image processing in Python
🔗 scikit-image.org -
wireservice/csvkit ⭐ 6,031
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
🔗 csvkit.readthedocs.io -
pdfminer/pdfminer.six ⭐ 5,993
Community maintained fork of pdfminer - we fathom PDF
🔗 pdfminersix.readthedocs.io -
pytransitions/transitions ⭐ 5,810
A lightweight, object-oriented finite state machine implementation in Python with many extensions -
rsalmei/alive-progress ⭐ 5,556
A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations! -
buildbot/buildbot ⭐ 5,282
Python-based continuous integration testing framework; your pull requests are more than welcome!
🔗 www.buildbot.net -
spotify/pedalboard ⭐ 5,269
🎛 🔊 A Python library for audio.
🔗 spotify.github.io/pedalboard -
prompt-toolkit/ptpython ⭐ 5,238
A better Python REPL -
traceloop/openllmetry ⭐ 5,137
Open-source observability for your LLM application, based on OpenTelemetry
🔗 www.traceloop.com/openllmetry -
pycqa/pycodestyle ⭐ 5,047
Simple Python style checker in one Python file
🔗 pycodestyle.pycqa.org -
pywinauto/pywinauto ⭐ 5,031
Windows GUI Automation with Python (based on text properties)
🔗 pywinauto.github.io -
tebelorg/RPA-Python ⭐ 4,980
Python package for doing RPA -
jorgebastida/awslogs ⭐ 4,862
AWS CloudWatch logs for Humans™ -
pythonnet/pythonnet ⭐ 4,811
Python for .NET is a package that gives Python programmers nearly seamless integration with the .NET Common Language Runtime (CLR) and provides a powerful application scripting tool for .NET developers.
🔗 pythonnet.github.io -
pytoolz/toolz ⭐ 4,719
A functional standard library for Python.
🔗 toolz.readthedocs.org -
hhatto/autopep8 ⭐ 4,583
A tool that automatically formats Python code to conform to the PEP 8 style guide.
🔗 pypi.org/project/autopep8 -
pyinvoke/invoke ⭐ 4,422
Pythonic task management & command execution.
🔗 pyinvoke.org -
bogdanp/dramatiq ⭐ 4,378
A fast and reliable background task processing library for Python 3.
🔗 dramatiq.io -
ashleve/lightning-hydra-template ⭐ 4,323
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡ -
adafruit/circuitpython ⭐ 4,140
CircuitPython - a Python implementation for teaching coding with microcontrollers
🔗 circuitpython.org -
blealtan/efficient-kan ⭐ 4,123
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN). -
evhub/coconut ⭐ 4,106
Coconut (coconut-lang.org) is a variant of Python that adds on top of Python syntax new features for simple, elegant, Pythonic functional programming.
🔗 coconut-lang.org -
miguelgrinberg/python-socketio ⭐ 4,027
Python Socket.IO server and client -
ets-labs/python-dependency-injector ⭐ 4,022
Dependency injection framework for Python
🔗 python-dependency-injector.ets-labs.org -
pyo3/maturin ⭐ 4,013
Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages
🔗 maturin.rs -
pyinfra-dev/pyinfra ⭐ 3,950
pyinfra turns Python code into shell commands and runs them on your servers. Execute ad-hoc commands and write declarative operations. Target SSH servers, local machine and Docker containers. Fast and scales from one server to thousands.
🔗 pyinfra.com -
joblib/joblib ⭐ 3,895
Computing with Python functions.
🔗 joblib.readthedocs.org -
rspeer/python-ftfy ⭐ 3,826
Fixes mojibake and other glitches in Unicode text, after the fact.
🔗 ftfy.readthedocs.org -
python-markdown/markdown ⭐ 3,814
A Python implementation of John Gruber’s Markdown with Extension support.
🔗 python-markdown.github.io -
more-itertools/more-itertools ⭐ 3,748
More routines for operating on iterables, beyond itertools
🔗 more-itertools.rtfd.io -
zeromq/pyzmq ⭐ 3,739
PyZMQ: Python bindings for zeromq
🔗 zguide.zeromq.org/py:all -
pydata/xarray ⭐ 3,655
N-D labeled arrays and datasets in Python
🔗 xarray.dev -
hynek/structlog ⭐ 3,604
Simple, powerful, and fast logging for Python.
🔗 www.structlog.org -
tartley/colorama ⭐ 3,576
Simple cross-platform colored terminal text in Python -
spotify/basic-pitch ⭐ 3,520
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
🔗 basicpitch.io -
jorisschellekens/borb ⭐ 3,408
borb is a library for reading, creating and manipulating PDF files in python.
🔗 borbpdf.com -
camelot-dev/camelot ⭐ 3,049
A Python library to extract tabular data from PDFs
🔗 camelot-py.readthedocs.io -
libaudioflux/audioFlux ⭐ 2,929
A library for audio and music analysis, feature extraction.
🔗 audioflux.top -
legrandin/pycryptodome ⭐ 2,865
A self-contained cryptographic library for Python
🔗 www.pycryptodome.org -
tox-dev/pipdeptree ⭐ 2,830
A command line utility to display dependency tree of the installed Python packages
🔗 pypi.python.org/pypi/pipdeptree -
liiight/notifiers ⭐ 2,685
The easy way to send notifications
🔗 notifiers.readthedocs.io -
whylabs/whylogs ⭐ 2,658
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
🔗 whylogs.readthedocs.io -
cdgriffith/Box ⭐ 2,643
Python dictionaries with advanced dot notation access
🔗 github.com/cdgriffith/box/wiki -
pexpect/pexpect ⭐ 2,629
A Python module for controlling interactive programs in a pseudo-terminal
🔗 pexpect.readthedocs.io -
litl/backoff ⭐ 2,614
Python library providing function decorators for configurable backoff and retry -
scrapinghub/dateparser ⭐ 2,572
python parser for human readable dates -
pypa/setuptools ⭐ 2,547
Official project repository for the Setuptools build system
🔗 pypi.org/project/setuptools -
comet-ml/opik ⭐ 2,502
Opik is an open-source platform for evaluating, testing and monitoring LLM applications. -
pyston/pyston ⭐ 2,501
(No longer maintained) A faster and highly-compatible implementation of the Python programming language.
🔗 www.pyston.org -
dosisod/refurb ⭐ 2,489
A tool for refurbishing and modernizing Python codebases -
jcrist/msgspec ⭐ 2,475
A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
🔗 jcristharif.com/msgspec -
nschloe/tikzplotlib ⭐ 2,430
📊 Save matplotlib figures as TikZ/PGFplots for smooth integration into LaTeX. -
hgrecco/pint ⭐ 2,418
Operate and manipulate physical quantities in Python
🔗 pint.readthedocs.org -
grantjenks/python-diskcache ⭐ 2,404
Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.
🔗 www.grantjenks.com/docs/diskcache -
dateutil/dateutil ⭐ 2,379
Useful extensions to the standard Python datetime features -
tkem/cachetools ⭐ 2,362
Various memoizing collections and decorators, including variants of the Python Standard Library's @lru_cache function decorator -
pndurette/gTTS ⭐ 2,328
Python library and CLI tool to interface with Google Translate's text-to-speech API
🔗 gtts.readthedocs.org -
pydantic/logfire ⭐ 2,311
Uncomplicated Observability for Python and beyond! 🪵🔥
🔗 logfire.pydantic.dev/docs -
kiminewt/pyshark ⭐ 2,265
Python wrapper for tshark, allowing python packet parsing using wireshark dissectors -
pyparsing/pyparsing ⭐ 2,245
Python library for creating PEG parsers -
astanin/python-tabulate ⭐ 2,195
Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.
🔗 pypi.org/project/tabulate -
rhettbull/osxphotos ⭐ 2,193
Python app to work with pictures and associated metadata from Apple Photos on macOS. Also includes a package to provide programmatic access to the Photos library, pictures, and metadata. -
nateshmbhat/pyttsx3 ⭐ 2,154
Offline Text To Speech synthesis for python -
ianmiell/shutit ⭐ 2,152
Automation framework for programmers
🔗 ianmiell.github.io/shutit -
grahamdumpleton/wrapt ⭐ 2,071
A Python module for decorators, wrappers and monkey patching. -
google/gin-config ⭐ 2,069
Gin provides a lightweight configuration framework for Python -
seperman/deepdiff ⭐ 2,044
DeepDiff: Deep Difference and search of any Python object/data. DeepHash: Hash of any object based on its contents. Delta: Use deltas to reconstruct objects by adding deltas together.
🔗 zepworks.com -
pyfilesystem/pyfilesystem2 ⭐ 1,997
Python's Filesystem abstraction layer
🔗 www.pyfilesystem.org -
omry/omegaconf ⭐ 1,995
Flexible Python configuration system. The last one you will ever need. -
mitmproxy/pdoc ⭐ 1,976
API Documentation for Python Projects
🔗 pdoc.dev -
python-rope/rope ⭐ 1,975
a python refactoring library -
julienpalard/Pipe ⭐ 1,964
A Python library to use infix notation in Python -
landscapeio/prospector ⭐ 1,960
Inspects Python source files and provides information about type and location of classes, methods etc -
numba/llvmlite ⭐ 1,943
A lightweight LLVM python binding for writing JIT compilers
🔗 llvmlite.pydata.org -
carpedm20/emoji ⭐ 1,910
emoji terminal output for Python -
chaostoolkit/chaostoolkit ⭐ 1,895
Chaos Engineering Toolkit & Orchestration for Developers
🔗 chaostoolkit.org -
pydoit/doit ⭐ 1,876
CLI task management & automation tool
🔗 pydoit.org -
hbldh/bleak ⭐ 1,849
A cross platform Bluetooth Low Energy Client for Python using asyncio -
pygments/pygments ⭐ 1,843
Pygments is a generic syntax highlighter written in Python
🔗 pygments.org -
open-telemetry/opentelemetry-python ⭐ 1,823
OpenTelemetry Python API and SDK
🔗 opentelemetry.io -
p0dalirius/Coercer ⭐ 1,815
A python script to automatically coerce a Windows server to authenticate on an arbitrary machine through 12 methods.
🔗 podalirius.net -
joowani/binarytree ⭐ 1,808
Python Library for Studying Binary Trees
🔗 binarytree.readthedocs.io -
samuelcolvin/watchfiles ⭐ 1,801
Simple, modern and fast file watching and code reload in Python.
🔗 watchfiles.helpmanual.io -
home-assistant/supervisor ⭐ 1,800
🏡 Home Assistant Supervisor
🔗 home-assistant.io/hassio -
konradhalas/dacite ⭐ 1,778
Simple creation of data classes from dictionaries. -
rubik/radon ⭐ 1,750
Various code metrics for Python code
🔗 radon.readthedocs.org -
mkdocstrings/mkdocstrings ⭐ 1,747
📘 Automatic documentation from sources, for MkDocs.
🔗 mkdocstrings.github.io -
kalliope-project/kalliope ⭐ 1,720
Kalliope is a framework that will help you to create your own personal assistant.
🔗 kalliope-project.github.io -
quodlibet/mutagen ⭐ 1,582
Python module for handling audio metadata
🔗 mutagen.readthedocs.io -
facebookincubator/Bowler ⭐ 1,572
Safe code refactoring for modern Python.
🔗 pybowler.io -
instagram/LibCST ⭐ 1,570
A concrete syntax tree parser and serializer library for Python that preserves many aspects of Python's abstract syntax tree
🔗 libcst.readthedocs.io -
anthropics/anthropic-sdk-python ⭐ 1,554
SDK providing access to Anthropic's safety-first language model APIs -
imageio/imageio ⭐ 1,518
Python library for reading and writing image data
🔗 imageio.readthedocs.io -
fabiocaccamo/python-benedict ⭐ 1,515
📘 dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities. -
lcompilers/lpython ⭐ 1,506
Python compiler
🔗 lpython.org -
nficano/python-lambda ⭐ 1,501
A toolkit for developing and deploying serverless Python code in AWS Lambda. -
aws-samples/aws-glue-samples ⭐ 1,445
AWS Glue code samples -
lidatong/dataclasses-json ⭐ 1,389
Easily serialize Data Classes to and from JSON -
brandon-rhodes/python-patterns ⭐ 1,341
Source code behind the python-patterns.guide site by Brandon Rhodes -
aio-libs/yarl ⭐ 1,339
Yet another URL library
🔗 yarl.aio-libs.org -
ossf/criticality_score ⭐ 1,338
Gives criticality score for an open source project -
oracle/graalpython ⭐ 1,263
GraalPy – A high-performance embeddable Python 3 runtime for Java
🔗 www.graalvm.org/python -
pypy/pypy ⭐ 1,156
PyPy is a very fast and compliant implementation of the Python language.
🔗 pypy.org -
pyo3/rust-numpy ⭐ 1,146
PyO3-based Rust bindings of the NumPy C-API -
pdoc3/pdoc ⭐ 1,139
🐍 ➡️ 📜 Auto-generate API documentation for Python projects
🔗 pdoc3.github.io/pdoc -
pyfpdf/fpdf2 ⭐ 1,138
Simple PDF generation for Python
🔗 py-pdf.github.io/fpdf2 -
c4urself/bump2version ⭐ 1,064
Version-bump your software with a single command
🔗 pypi.python.org/pypi/bump2version -
fsspec/filesystem_spec ⭐ 1,048
A specification that python filesystems should adhere to. -
milvus-io/pymilvus ⭐ 1,041
Python SDK for Milvus. -
metachris/logzero ⭐ 1,027
Robust and effective logging for Python 2 and 3.
🔗 logzero.readthedocs.io -
extensityai/symbolicai ⭐ 989
Compositional Differentiable Programming Library - divide-and-conquer approach to break down a complex problem into smaller, more manageable problems. -
fastai/fastcore ⭐ 980
Python supercharged for the fastai library
🔗 fastcore.fast.ai -
lastmile-ai/aiconfig ⭐ 965
AIConfig saves prompts, models and model parameters as source control friendly configs. This allows you to iterate on prompts and model parameters separately from your application code.
🔗 aiconfig.lastmileai.dev -
ariebovenberg/whenever ⭐ 906
⏰ Modern datetime library for Python
🔗 whenever.rtfd.io -
barracuda-fsh/pyobd ⭐ 878
An OBD-II compliant car diagnostic tool -
samuelcolvin/dirty-equals ⭐ 805
Doing dirty (but extremely useful) things with equals.
🔗 dirty-equals.helpmanual.io -
qdrant/qdrant-client ⭐ 803
Python client for Qdrant vector search engine
🔗 qdrant.tech -
juanbindez/pytubefix ⭐ 756
Python3 library for downloading YouTube Videos.
🔗 pytubefix.readthedocs.io -
pypa/build ⭐ 748
A simple, correct Python build frontend
🔗 build.pypa.io -
open-telemetry/opentelemetry-python-contrib ⭐ 743
OpenTelemetry instrumentation for Python modules
🔗 opentelemetry.io -
modal-labs/modal-examples ⭐ 741
Examples of programs built using Modal
🔗 modal.com/docs -
gefyrahq/gefyra ⭐ 699
Blazingly-fast 🚀, rock-solid, local application development ➡️ with Kubernetes.
🔗 gefyra.dev -
instagram/Fixit ⭐ 673
Advanced Python linting framework with auto-fixes and hierarchical configuration that makes it easy to write custom in-repo lint rules.
🔗 fixit.rtfd.io/en/latest -
fastai/ghapi ⭐ 627
A delightful and complete interface to GitHub's amazing API
🔗 ghapi.fast.ai -
argoproj-labs/hera ⭐ 620
Hera makes Python code easy to orchestrate on Argo Workflows through native Python integrations. It lets you construct and submit your Workflows entirely in Python. ⭐️ Remember to star!
🔗 hera.rtfd.io -
platformdirs/platformdirs ⭐ 604
A small Python module for determining appropriate platform-specific dirs, e.g. a "user data dir".
🔗 platformdirs.readthedocs.io -
methexis-inc/terminal-copilot ⭐ 573
A smart terminal assistant that helps you find the right command. -
steamship-core/steamship-langchain ⭐ 511
steamship-langchain -
cfahlgren1/observers ⭐ 215
A Lightweight Library for AI Observability -
neuml/annotateai ⭐ 167
Automatically annotates papers using Large Language Models (LLMs)
Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL.
-
apache/superset ⭐ 63,205
Apache Superset is a Data Visualization and Data Exploration Platform
🔗 superset.apache.org -
streamlit/streamlit ⭐ 36,055
Streamlit — A faster way to build and share data apps.
🔗 streamlit.io -
gradio-app/gradio ⭐ 34,428
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
🔗 www.gradio.app -
plotly/dash ⭐ 21,618
Data Apps & Dashboards for Python. No JavaScript Required.
🔗 plotly.com/dash -
matplotlib/matplotlib ⭐ 20,417
matplotlib: plotting with Python
🔗 matplotlib.org/stable -
danny-avila/LibreChat ⭐ 19,577
LibreChat is a free, open source AI chat platform. This Web UI offers vast customization, supporting numerous AI providers, services, and integrations.
🔗 librechat.ai -
bokeh/bokeh ⭐ 19,426
Interactive Data Visualization in the browser, from Python
🔗 bokeh.org -
plotly/plotly.py ⭐ 16,411
The interactive graphing library for Python ✨ This project now includes Plotly Express!
🔗 plotly.com/python -
mwaskom/seaborn ⭐ 12,647
Statistical data visualization in Python
🔗 seaborn.pydata.org -
visgl/deck.gl ⭐ 12,311
WebGL2 powered visualization framework
🔗 deck.gl -
marceloprates/prettymaps ⭐ 11,187
A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries. -
altair-viz/altair ⭐ 9,422
Declarative visualization library for Python
🔗 altair-viz.github.io -
nvidia/TensorRT-LLM ⭐ 8,841
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT...
🔗 nvidia.github.io/tensorrt-llm -
lux-org/lux ⭐ 5,221
Automatically visualize your pandas dataframe via a single print! 📊 💡 -
renpy/renpy ⭐ 5,100
The Ren'Py Visual Novel Engine
🔗 www.renpy.org -
holoviz/panel ⭐ 4,839
Panel: The powerful data exploration & web app framework for Python
🔗 panel.holoviz.org -
man-group/dtale ⭐ 4,788
Visualizer for pandas data structures
🔗 alphatechadmin.pythonanywhere.com -
has2k1/plotnine ⭐ 4,072
A Grammar of Graphics for Python
🔗 plotnine.org -
residentmario/missingno ⭐ 3,975
missingno provides a small toolset of flexible and easy-to-use missing data visualizations and utilities that allows you to get a quick visual summary of the completeness (or lack thereof) of your dataset. -
pyqtgraph/pyqtgraph ⭐ 3,915
Fast data visualization and GUI tools for scientific / engineering applications
🔗 www.pyqtgraph.org -
ml-tooling/opyrator ⭐ 3,109
🪄 Turns your machine learning code into microservices with web API, interactive GUI, and more.
🔗 opyrator-playground.mltooling.org -
netflix/flamescope ⭐ 3,030
FlameScope is a visualization tool for exploring different time ranges as Flame Graphs. -
pyvista/pyvista ⭐ 2,771
3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
🔗 docs.pyvista.org -
facebookresearch/hiplot ⭐ 2,768
HiPlot makes understanding high dimensional data easy
🔗 facebookresearch.github.io/hiplot -
mckinsey/vizro ⭐ 2,730
Vizro is a toolkit for creating modular data visualization applications.
🔗 vizro.readthedocs.io/en/stable -
holoviz/holoviews ⭐ 2,711
With Holoviews, your data visualizes itself.
🔗 holoviews.org -
kozea/pygal ⭐ 2,672
pygal is a dynamic SVG charting library written in python.
🔗 www.pygal.org -
napari/napari ⭐ 2,229
A fast, interactive, multi-dimensional image viewer for Python. It's designed for browsing, annotating, and analyzing large multi-dimensional images.
🔗 napari.org -
marcomusy/vedo ⭐ 2,061
A python module for scientific analysis of 3D data based on VTK and Numpy
🔗 vedo.embl.es -
datapane/datapane ⭐ 1,384
Build and share data reports in 100% Python
🔗 datapane.com -
facultyai/dash-bootstrap-components ⭐ 1,123
Bootstrap components for Plotly Dash
🔗 dash-bootstrap-components.opensource.faculty.ai -
nomic-ai/deepscatter ⭐ 1,062
Zoomable, animated scatterplots in the browser that scales over a billion points -
holoviz/holoviz ⭐ 849
High-level tools to simplify visualization in Python.
🔗 holoviz.org -
hazyresearch/meerkat ⭐ 830
Creative interactive views of any dataset.
Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management.
-
django/django ⭐ 81,349
The Web framework for perfectionists with deadlines.
🔗 www.djangoproject.com -
tiangolo/fastapi ⭐ 78,385
FastAPI framework, high performance, easy to learn, fast to code, ready for production
🔗 fastapi.tiangolo.com -
pallets/flask ⭐ 68,258
The Python micro framework for building web applications.
🔗 flask.palletsprojects.com -
sherlock-project/sherlock ⭐ 60,852
Hunt down social media accounts by username across social networks
🔗 sherlockproject.xyz -
psf/requests ⭐ 52,262
A simple, yet elegant, HTTP library.
🔗 requests.readthedocs.io/en/latest -
tornadoweb/tornado ⭐ 21,789
Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.
🔗 www.tornadoweb.org -
reflex-dev/reflex ⭐ 20,656
🕸️ Web apps in pure Python 🐍
🔗 reflex.dev -
wagtail/wagtail ⭐ 18,482
A Django content management system focused on flexibility and user experience
🔗 wagtail.org -
huge-success/sanic ⭐ 18,129
Accelerate your web app development | Build fast. Run fast.
🔗 sanic.dev -
pyscript/pyscript ⭐ 18,053
A framework that allows users to create rich Python applications in the browser using HTML's interface and the power of Pyodide, WASM, and modern web technologies.
🔗 pyscript.net -
vincigit00/Scrapegraph-ai ⭐ 16,098
ScrapeGraphAI is a web scraping python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents
🔗 scrapegraphai.com -
aio-libs/aiohttp ⭐ 15,196
Asynchronous HTTP client/server framework for asyncio and Python
🔗 docs.aiohttp.org -
encode/httpx ⭐ 13,389
A next generation HTTP client for Python. 🦋
🔗 www.python-httpx.org -
getpelican/pelican ⭐ 12,625
Static site generator that supports Markdown and reST syntax. Powered by Python.
🔗 getpelican.com -
flet-dev/flet ⭐ 11,650
Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.
🔗 flet.dev -
aws/chalice ⭐ 10,681
Python Serverless Microframework for AWS -
encode/starlette ⭐ 10,337
The little ASGI framework that shines. 🌟
🔗 www.starlette.io -
zauberzeug/nicegui ⭐ 10,281
Create web-based user interfaces with Python. The nice way.
🔗 nicegui.io -
benoitc/gunicorn ⭐ 9,892
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
🔗 www.gunicorn.org -
falconry/falcon ⭐ 9,542
The no-magic web API and microservices framework for Python developers, with a focus on reliability, correctness, and performance at scale.
🔗 falcon.readthedocs.io -
encode/uvicorn ⭐ 8,666
An ASGI web server, for Python. 🦄
🔗 www.uvicorn.org -
bottlepy/bottle ⭐ 8,468
bottle.py is a fast and simple micro-framework for python web-applications.
🔗 bottlepy.org -
graphql-python/graphene ⭐ 8,109
GraphQL framework for Python
🔗 graphene-python.org -
reactive-python/reactpy ⭐ 7,899
ReactPy is a library for building user interfaces in Python without Javascript
🔗 reactpy.dev -
vitalik/django-ninja ⭐ 7,391
💨 Fast, Async-ready, Openapi, type hints based framework for building APIs
🔗 django-ninja.dev -
pyeve/eve ⭐ 6,715
REST API framework designed for human beings
🔗 python-eve.org -
pallets/werkzeug ⭐ 6,668
The comprehensive WSGI web application library.
🔗 werkzeug.palletsprojects.com -
webpy/webpy ⭐ 5,889
web.py is a web framework for python that is as simple as it is powerful.
🔗 webpy.org -
starlite-api/litestar ⭐ 5,673
Production-ready, Light, Flexible and Extensible ASGI API framework | Effortlessly Build Performant APIs
🔗 litestar.dev -
stephenmcd/mezzanine ⭐ 4,765
CMS framework for Django
🔗 mezzanine.jupo.org -
fastapi-users/fastapi-users ⭐ 4,724
Ready-to-use and customizable users management for FastAPI
🔗 fastapi-users.github.io/fastapi-users -
nameko/nameko ⭐ 4,716
A microservices framework for Python that lets service developers concentrate on application logic and encourages testability.
🔗 www.nameko.io -
pywebio/PyWebIO ⭐ 4,568
Write interactive web app in script way.
🔗 pywebio.readthedocs.io -
strawberry-graphql/strawberry ⭐ 4,045
A GraphQL library for Python that leverages type annotations 🍓
🔗 strawberry.rocks -
h2oai/wave ⭐ 4,019
H2O Wave is a software stack for building beautiful, low-latency, realtime, browser-based applications and dashboards entirely in Python/R without using HTML, Javascript, or CSS.
🔗 wave.h2o.ai -
pylons/pyramid ⭐ 3,978
Pyramid - A Python web framework
🔗 trypyramid.com -
websocket-client/websocket-client ⭐ 3,591
WebSocket client for Python
🔗 github.com/websocket-client/websocket-client -
unbit/uwsgi ⭐ 3,466
uWSGI application server container
🔗 projects.unbit.it/uwsgi -
pallets/quart ⭐ 3,059
An async Python micro framework for building web applications.
🔗 quart.palletsprojects.com -
fastapi-admin/fastapi-admin ⭐ 2,947
A fast admin dashboard based on FastAPI and TortoiseORM with tabler ui, inspired by Django admin
🔗 fastapi-admin-docs.long2ice.io -
flipkart-incubator/Astra ⭐ 2,521
Automated Security Testing For REST API's -
dot-agent/nextpy ⭐ 2,232
🤖Self-Modifying Framework from the Future 🔮 World's First AMS
🔗 dotagent.ai -
masoniteframework/masonite ⭐ 2,218
The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ
🔗 docs.masoniteproject.com -
python-restx/flask-restx ⭐ 2,168
Fork of Flask-RESTPlus: Fully featured framework for fast, easy and documented API development with Flask
🔗 flask-restx.readthedocs.io/en/latest -
s3rius/FastAPI-template ⭐ 2,033
Feature rich robust FastAPI template. -
dmontagu/fastapi-utils ⭐ 1,960
Reusable utilities for FastAPI: a number of utilities to help reduce boilerplate and reuse common functionality across projects
🔗 fastapiutils.github.io/fastapi-utils -
neoteroi/BlackSheep ⭐ 1,897
Fast ASGI web framework for Python
🔗 www.neoteroi.dev/blacksheep -
cherrypy/cherrypy ⭐ 1,859
CherryPy is a pythonic, object-oriented HTTP framework. https://cherrypy.dev
🔗 docs.cherrypy.dev -
indico/indico ⭐ 1,794
Indico - A feature-rich event management system, made @ CERN, the place where the Web was born.
🔗 getindico.io -
jordaneremieff/mangum ⭐ 1,736
AWS Lambda support for ASGI applications
🔗 mangum.fastapiexpert.com -
wtforms/wtforms ⭐ 1,515
A flexible forms validation and rendering library for Python.
🔗 wtforms.readthedocs.io -
awtkns/fastapi-crudrouter ⭐ 1,422
A dynamic FastAPI router that automatically creates CRUD routes for your models
🔗 fastapi-crudrouter.awtkns.com -
long2ice/fastapi-cache ⭐ 1,386
fastapi-cache is a tool to cache fastapi response and function result, with backends support redis and memcached.
🔗 github.com/long2ice/fastapi-cache -
rstudio/py-shiny ⭐ 1,327
Shiny for Python
🔗 shiny.posit.co/py -
whitphx/stlite ⭐ 1,255
A port of Streamlit to WebAssembly, powered by Pyodide.
🔗 edit.share.stlite.net -
magicstack/httptools ⭐ 1,208
Fast HTTP parser -
koxudaxi/fastapi-code-generator ⭐ 1,080
This code generator creates FastAPI app from an openapi file. -
aeternalis-ingenium/FastAPI-Backend-Template ⭐ 656
A backend project template with FastAPI, PostgreSQL with asynchronous SQLAlchemy 2.0, Alembic for asynchronous database migration, and Docker.
Interactive version: www.awesomepython.org, Hugging Face Dataset: awesome-python
Please raise a new issue to suggest a Python repo that you would like to see added.
1,632 hand-picked awesome Python libraries and frameworks, updated 19 Dec 2024