Hardware Performance Counters and Code Performance improvements: Success Stories

Here we collect Success Stories of bottlenecks identified using Performance Hardware counters and solved properly improving the code

to plot the location of the latency critical event I run

 
~/pmu-tools/ocperf.py record -e resource_stalls.any -e rs_events.empty_cycles -e uops_executed.stall_cycles -e branch-misses -e offcore_requests_outstanding.demand_data_rd_ge_6 cmsRun doTkReco.py >
 

-- VincenzoInnocente - 2016-02-02

  • Top functions where occur:
    Screen_Shot_2016-02-02_at_1.24.57_PM.png

* code with the highest number of:
TreeOriginal.png

Edit | Attach | Watch | Print version | History: r2 < r1 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r2 - 2016-02-02 - VincenzoInnocente
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    Main All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright &© 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback