Skip to content

Questions about Verl training process #16

@Skyorca

Description

@Skyorca

Hello, may I ask which version of Verl was used for training? I noticed that the inference process uses a continuation-based approach, as shown in the /src/run_deep_agent.py. However, the Verl 0.7 training framework, specifically the agent_loop mode, only supports multi-turn dialogue format for inference.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions