Update public README with project info and include new docs pointers.

PiperOrigin-RevId: 705944447
google · Dec 13, 2024 · eff605a · eff605a
1 parent 9f6b6a5
commit eff605a
Show file tree

Hide file tree

Showing 5 changed files with 183 additions and 135 deletions.
diff --git a/README.md b/README.md
@@ -1,11 +1,56 @@
 # Grain - Feeding JAX Models
 
-Grain is a library for reading data for training and evaluating JAX models. It's
-open source, fast and deterministic.
+[![Continuous integration](https://github.com/google/grain/actions/workflows/tests.yaml/badge.svg)](https://github.com/google/grain/actions/workflows/tests.yaml)
+[![PyPI version](https://img.shields.io/pypi/v/grain)](https://pypi.org/project/grain/)
 
-* Installation: `pip install grain`
-* [Docs](https://github.com/google/grain/tree/main/docs)
-* Grain is used by [MaxText](https://github.com/google/maxtext/tree/main), a simple, performant and scalable JAX codebase for LLM.
 
-Check out [`tutorials/`](./tutorials) for more information on how to use Grain!
+[**Installation**](#installation)
+| [**Quickstart**](#quickstart)
+| [**Reference docs**](https://google-grain.readthedocs.io/en/latest/)
 
+Grain is a Python library for reading data for training and evaluating JAX
+models. It is flexible, fast and deterministic.
+
+Grain allows to define data processing steps in a simple declarative way:
+
+```python
+import grain.python as grain
+
+dataset = (
+    grain.MapDataset.source([0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10])
+    .shuffle(seed=10)  # Shuffles elements globally.
+    .map(lambda x: x+1)  # Maps each element.
+    .batch(batch_size=2)  # Batches consecutive elements.
+)
+
+for batch in dataset:
+  # Training step.
+```
+
+Grain is designed to work with JAX models but it does not require JAX to run
+and can be used with other frameworks as well.
+
+## Installation
+
+Grain is available on [PyPI](https://pypi.org/project/grain/) and can be
+installed with `pip install grain`.
+
+### Supported platforms
+
+Grain does not directly use GPU or TPU in its transformations, the processing
+within Grain will be done on the CPU by default.
+
+|         |  Linux  |   Mac   | Windows |
+|---------|---------|---------|---------|
+| x86_64  | yes     | WIP     | no      |
+| aarch64 | yes     | WIP     | n/a     |
+
+## Quickstart
+
+- [Basic `Dataset` tutorial](https://google-grain.readthedocs.io/en/latest/tutorials/dataset_basic_tutorial.html)
+
+## Existing users
+
+Grain is used by [MaxText](https://github.com/google/maxtext/tree/main),
+[kauldron](https://github.com/google-research/kauldron) and multiple internal
+Google projects.
diff --git a/docs/conf.py b/docs/conf.py
@@ -49,7 +49,7 @@
 # https://www.sphinx-doc.org/en/master/usage/configuration.html#options-for-html-output
 
 html_theme = 'sphinx_book_theme'
-html_title = 'PyGrain'
+html_title = 'Grain'
 html_static_path = ['_static']
 
 # TODO: Add logo and favicon