OpenAI API client library for Rust (unofficial)
-
Updated
Mar 14, 2025 - Rust
OpenAI API client library for Rust (unofficial)
This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, motivations, etc.) in a short creative story
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private conversations, form alliances, and vote to eliminate each other
Benchmark that evaluates LLMs using 601 NYT Connections puzzles extended with extra trick words
Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLMs to engage in public conversation before secretly picking a move (1, 3, or 5 steps). Whenever two or more players choose the same number, all colliding players fail to advance.
Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a small set of examples and anti-examples, then detect which item truly fits that theme among a collection of misleading candidates.
Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies among Large Language Models (LLMs) in a resource-sharing economic scenario. Our experiment extends the classic PGG with a punishment phase, allowing players to penalize free-riders or retaliate against others.
The GenAI API wrapper for Delphi is designed to integrate OpenAI’s latest models (GPT-4o, O1, O3 and GPT-4.5) seamlessly, offering robust features for chat interactions, text generation, vision processing, audio analysis, JSON configuration, Web search and asynchronous operations with efficient error handling and testing support.
LLM-driven software development helper.
Add a description, image, and links to the gpt-4-5 topic page so that developers can more easily learn about it.
To associate your repository with the gpt-4-5 topic, visit your repo's landing page and select "manage topics."