Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

jon-tow Follow

Overview Repositories 71 Projects 0 Packages 0 Stars 178

More

Overview
Repositories
Projects
Packages
Stars

jon-tow

Follow

🐨

Jonathan Tow jon-tow

🐨

Follow

71 followers · 4 following

https://jon-tow.github.io

Achievements

Achievements

Organizations

Block or report jon-tow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Overview Repositories 71 Projects 0 Packages 0 Stars 178

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Python HTML Cuda C++ JavaScript Jupyter Notebook Rust Swift

Sort Last updated

Select order

Last updated Name Stars

codeaplaca-personified Public

Python Updated Nov 25, 2024
modded-nanogpt Public
Forked from KellerJordan/modded-nanogpt

NanoGPT (124M) in 5 minutes

Python 1 MIT License Updated Nov 22, 2024
ml-cross-entropy Public
Forked from apple/ml-cross-entropy

Python Other Updated Nov 19, 2024
jon-tow.github.io Public

My personal website

HTML Updated Oct 14, 2024
ProX Public
Forked from GAIR-NLP/ProX

Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"

Python Apache License 2.0 Updated Oct 11, 2024
torchtitan Public
Forked from pytorch/torchtitan

A native PyTorch Library for large model training

Python BSD 3-Clause "New" or "Revised" License Updated Sep 13, 2024
mini-omni Public
Forked from gpt-omni/mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python MIT License Updated Sep 4, 2024
grouped_gemm Public
Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda Apache License 2.0 Updated Aug 26, 2024
WaveCoder Public
Forked from microsoft/WaveCoder

Advancing LLM with Diverse Coding Capabilities

Python MIT License Updated Aug 2, 2024
bigcode-evaluation-harness Public
Forked from bigcode-project/bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Python Apache License 2.0 Updated Jul 15, 2024
efficient_cross_entropy Public
Forked from mgmalek/efficient_cross_entropy

Python MIT License Updated May 28, 2024
triton Public
Forked from triton-lang/triton

Development repository for the Triton language and compiler

C++ MIT License Updated May 24, 2024
cs224n Public

Solutions to CS224n: Natural Language Processing with Deep Learning assignments.

JavaScript 71 34 Updated May 3, 2024
zero-bubble-pipeline-parallelism Public
Forked from sail-sg/zero-bubble-pipeline-parallelism

Zero Bubble Pipeline Parallelism

Python Other Updated Apr 29, 2024
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python Apache License 2.0 Updated Apr 11, 2024
scattermoe Public
Forked from shawntan/scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Python Apache License 2.0 Updated Mar 14, 2024
ring-flash-attention Public
Forked from zhuzilin/ring-flash-attention

Ring attention implementation with flash attention

Python Updated Feb 27, 2024
megablocks Public
Forked from databricks/megablocks

Python Apache License 2.0 Updated Feb 18, 2024
english-wordnet Public
Forked from globalwordnet/english-wordnet

The Open English WordNet

Python Other Updated Dec 5, 2023
ml-engineering Public
Forked from stas00/ml-engineering

Machine Learning Engineering Guides and Tools

Python Creative Commons Attribution Share Alike 4.0 International Updated Nov 8, 2023
text-dedup Public
Forked from ChenghaoMou/text-dedup

All-in-one text de-duplication

Jupyter Notebook Apache License 2.0 Updated Nov 4, 2023
gpt-neox Public
Forked from EleutherAI/gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Python Apache License 2.0 Updated Sep 5, 2023
Megatron-LLM Public
Forked from epfLLM/Megatron-LLM

distributed trainer for LLMs

Python Other Updated Sep 1, 2023
rerope Public
Forked from bojone/rerope

Rectified Rotary Position Embeddings

Python Updated Aug 7, 2023
flash-attention Public
Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python BSD 3-Clause "New" or "Revised" License Updated Jul 29, 2023
scaled-rope Public
Forked from jquesnelle/yarn

Python MIT License Updated Jul 26, 2023
contriever Public
Forked from CarperAI/contriever

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python Other Updated Jul 3, 2023
dynamic-sparse-flash-attention Public
Forked from epfml/dynamic-sparse-flash-attention

Jupyter Notebook Other Updated Jun 2, 2023
goodreads Public
Forked from MengtingWan/goodreads

code samples for the goodreads datasets

Jupyter Notebook Apache License 2.0 Updated May 29, 2023
DeepSpeed Public
Forked from deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python Apache License 2.0 Updated May 27, 2023

Previous Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.