Skip to content
@AlignmentResearch

FAR.AI

Frontier alignment research to ensure the safe development and deployment of advanced AI systems.

Popular repositories Loading

  1. tuned-lens tuned-lens Public

    Tools for understanding how transformer predictions are built layer-by-layer

    Python 485 53

  2. go_attack go_attack Public

    Python 85 7

  3. vlmrm vlmrm Public

    Python 52 15

  4. gpt-4-novel-apis-attacks gpt-4-novel-apis-attacks Public

    20 1

  5. learned-planner learned-planner Public

    Interpretability tools for recurrent convolutional networks (DRC) that play Sokoban

    Python 12 3

  6. scaling-poisoning scaling-poisoning Public

    Python 8 2

Repositories

Showing 10 of 41 repositories
  • alignment-workshop-website Public

    Stub website for redirects to new FAR.AI website

    AlignmentResearch/alignment-workshop-website’s past year of commit activity
    0 0 0 0 Updated Apr 14, 2025
  • train-learned-planner Public

    Experimenting with CleanRL for learned-planners

    AlignmentResearch/train-learned-planner’s past year of commit activity
    Python 5 1 1 2 Updated Apr 9, 2025
  • envpool Public Forked from sail-sg/envpool

    C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

    AlignmentResearch/envpool’s past year of commit activity
    C++ 1 Apache-2.0 115 0 1 Updated Apr 8, 2025
  • learned-planners-stable-baselines3 Public Forked from AlignmentResearch/stable-baselines3

    PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

    AlignmentResearch/learned-planners-stable-baselines3’s past year of commit activity
    Python 2 MIT 1,886 0 0 Updated Apr 2, 2025
  • gym-sokoban Public

    Sokoban environment for Gym

    AlignmentResearch/gym-sokoban’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Apr 2, 2025
  • kubespray Public Forked from kubernetes-sigs/kubespray

    Deploy a Production Ready Kubernetes Cluster

    AlignmentResearch/kubespray’s past year of commit activity
    Jinja 0 Apache-2.0 6,733 0 0 Updated Mar 31, 2025
  • DeepGEMM Public Forked from deepseek-ai/DeepGEMM

    DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

    AlignmentResearch/DeepGEMM’s past year of commit activity
    Cuda 0 MIT 561 0 0 Updated Mar 30, 2025
  • harmtune Public
    AlignmentResearch/harmtune’s past year of commit activity
    Python 3 0 2 0 Updated Mar 26, 2025
  • refusal_direction Public Forked from andyrdt/refusal_direction

    Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".

    AlignmentResearch/refusal_direction’s past year of commit activity
    Python 1 Apache-2.0 46 0 0 Updated Mar 19, 2025
  • kueue Public Forked from kubernetes-sigs/kueue

    Kubernetes-native Job Queueing

    AlignmentResearch/kueue’s past year of commit activity
    Go 0 Apache-2.0 324 0 0 Updated Mar 18, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…