Skip to content

Questions about the hardware specs for fiddler #7

@fangyu29

Description

@fangyu29

Copying 300MB weights parameters (one expert of mixtral-8x7b) from cpu to gpu requiring 50ms indicates that the PCIe bandwidth is only 0.3GB/50ms = 6GB/s, which is much slower than the reported L4 gpu's PCIe bandwidth (PCIe Gen4 x16 64GB/s) in https://www.nvidia.com/en-us/data-center/l4/ , is there any explanation about it? Thanks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions