Skip to content

Don't aggregate kernel records into hipLaunch parent#182

Open
michaelmckinsey1 wants to merge 1 commit into
llnl:developfrom
michaelmckinsey1:rocm-activity-profile
Open

Don't aggregate kernel records into hipLaunch parent#182
michaelmckinsey1 wants to merge 1 commit into
llnl:developfrom
michaelmckinsey1:rocm-activity-profile

Conversation

@michaelmckinsey1

@michaelmckinsey1 michaelmckinsey1 commented Jun 10, 2026

Copy link
Copy Markdown
Collaborator

Currently, records profiled from different threads, like rocm gputime + rocprofiler counters will result in counters being aggregated into the hipLaunchKernel region.

image

(The red shows where this PR is moving the data)

This is problematic if there are multiple kernels launched within a single cpu region, because we cannot distinguish their counters once they are aggregated.

@michaelmckinsey1 michaelmckinsey1 self-assigned this Jun 10, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant