- Add the Tool-Integrated Reasoning (TIR) agent for Qwen2.5-Math.
- Add support for caching the LLM's responses. LLM caching can be enabled by providing
cache_dir
in the LLM's config, i.e.,llm = get_chat_model(cfg={..., "cache_dir": "/tmp/directory_to_save_llm_cache"})
. - Add self-play agents to simulate a dialogue between a user and an AI agent, mainly for data synthesis and testing purposes.