Skip to content

Question about loading LLaMA-2 7B on the LLM context extension stage #71

@ImKeTT

Description

@ImKeTT

Great work! I noticed that you initialized the LWM model with LLaMA-2 7B model. But I couldn't find where you loaded it anywhere (certainly not in the scripts/run_train_text.sh). Would you tell me which model weight you loaded and how to load it for the Stage I LLM tuning? Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions