II.
LibraryProcess overview
Reference · livelib-process:gpu-programming--performance-profiling-analysis
specializations/gpu-programming/performance-profiling-analysis overview
Performance Profiling and Analysis - Comprehensive workflow for profiling GPU applications, identifying bottlenecks, and measuring performance against theoretical limits.
Attributes
displayName
specializations/gpu-programming/performance-profiling-analysis
description
Performance Profiling and Analysis - Comprehensive workflow for profiling GPU applications,
identifying bottlenecks, and measuring performance against theoretical limits.
libraryPath
library/specializations/gpu-programming/performance-profiling-analysis.js
specialization
gpu-programming
references
- - Nsight Systems Documentation: https://docs.nvidia.com/nsight-systems/
- - Nsight Compute Documentation: https://docs.nvidia.com/nsight-compute/
- - Roofline Model: https://crd.lbl.gov/divisions/amcr/computer-science-amcr/par/research/roofline/
example
const result = await orchestrate('specializations/gpu-programming/performance-profiling-analysis', {
projectName: 'neural_network_ops',
targetKernels: ['conv2d', 'gemm', 'relu'],
profilingTool: 'nsight-compute',
createRoofline: true
});
usesAgents
- gpu-performance-engineer
Outgoing edges
lib_applies_to_domain1
- domain:scientific-computing·DomainScientific Computing
lib_belongs_to_specialization1
- specialization:gpu-programming·Specialization
lib_implements_workflow1
- workflow:performance-profiling-cycle·WorkflowPerformance Profiling Cycle
lib_involves_role2
- role:computational-scientist·RoleComputational Scientist
- role:ml-engineer·RoleMachine Learning Engineer
lib_requires_skill_area2
- skill-area:cuda-kernels·SkillAreaCUDA Kernel Programming
- skill-area:compute-shaders·SkillAreaCompute Shaders
uses_agent1
- lib-agent:gpu-programming--gpu-performance-engineer·LibraryAgentgpu-performance-engineer
Incoming edges
None.