Examples
HyperFlow comes with multiple real-world examples showing standard configurations, local improvement loops, or evolutionary setups.
You can run single evaluations to test if an agent behaves well out-of-the-box, or you can invoke evolutionary loops so the agent writes patches to its own logic!
Example Domains
- Bash: Generates terminal commands.
- Scoring: Grade student math answers (accept/reject).
- Calculator: Solve math problems using a tool.
- Fact-check: Classify statements as true/false.
- Paper Review: Predict accept/reject for research paper abstracts.
- Git Evolution: A comprehensive standard flow utilizing patches across an isolated git structure branch.
Running the Examples
Evaluate Single Run
Execute run.py to evaluate the current solver logic strictly once.
cd examples/bash && python run.py
cd examples/factcheck && python run.py
cd examples/paper_review && python run.py
Run Evolutionary Self-Improvement
Execute run.py evolve (or via specific script parameters depending on the example) to trigger the evolutionary loop.
cd examples/bash && python run.py evolve
cd examples/factcheck && python run.py evolve
cd examples/scoring && python run.py
cd examples/calculator && python run.py
cd examples/git_evolution && python run.py
For the git-based evolution:
cd examples/git_evolution && python run.py # 2 generations
cd examples/git_evolution && python run.py 5 # 5 generations
cd examples/git_evolution && python run.py --reset # Start over
