OpenEnv FastAPI Multi-Agent v2.0.0
An OpenEnv-compliant pen-testing environment that teaches Llama 3.2 3B to do real security audit reasoning โ not pattern matching. Built for the Meta PyTorch OpenEnv Hackathon ร SST Bangalore 2026.
6.4ร avg score lift
Llama 3.2 3B post-GRPO: 0.075 โ 0.482
3 hackathon themes
World Modeling ยท Multi-Agent ยท Long-Horizon
11-component grader
Reward-hacking-resistant by design
POST /reset + POST /step โ OpenEnv standard episode protocol
This Space speaks the OpenEnv protocol. From any environment with
openenv-core installed:
from security_audit_env import SecurityAuditEnv, SecurityAuditAction
with SecurityAuditEnv(base_url="https://Sayuj63-Vapt-env.hf.space").sync() as env:
r = env.reset(scenario_id="easy")
rs = env.step(SecurityAuditAction(
action_type="use_tool",
tool_name="network_scan",
arguments={"target": "10.0.1.0/24"},
))
print(rs.observation.discovered_hosts)
Full README on the GitHub repo โ scroll for the full architecture, the multi-dimensional grader, and the GRPO post-training results.
Built for Meta PyTorch OpenEnv Hackathon ร SST Bangalore 2026