Ongoing research into AI agent framework security identified an exploit chain in AutoGen Studio (AutoGen’s open-source prototyping user interface) that allows untrusted web content rendered by a ...
Ready to make a real impact in South Africa’s financial landscape? Our client, a leading and forward-thinking player in the Financial Services sector, is searching for a skilled and passionate SQL ...
Skill Eval Harness is a Python CLI for testing whether an Agent Skill changes observable output. It reads evals/shared-benchmark.json, emits answer-key-safe task rows, grades files under eval-runs/, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results