Palisade Research
We build concrete demonstrations of dangerous capabilities to advise policy makers and the public on AI risks.
- 22 followers
- United States of America
- https://palisaderesearch.org
- @palisadeai
Popular repositories Loading
-
-
-
pass-interview
pass-interview PublicForked from openai/openai-realtime-console
Voice interview agent passes a technical screen
-
Repositories
Showing 10 of 16 repositories
- mle-bench Public Forked from openai/mle-bench
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
PalisadeResearch/mle-bench’s past year of commit activity - lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
PalisadeResearch/lm-evaluation-harness’s past year of commit activity - llm-honeypot Public
PalisadeResearch/llm-honeypot’s past year of commit activity - inspect_ai Public Forked from UKGovernmentBEIS/inspect_ai
Inspect: A framework for large language model evaluations
PalisadeResearch/inspect_ai’s past year of commit activity - pass-interview Public Forked from openai/openai-realtime-console
Voice interview agent passes a technical screen
PalisadeResearch/pass-interview’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…