We need elaborate profiling for ABACUS version 2.2.0, please collect as many case as you can! #716
Replies: 9 comments
-
units: second <style> </style>
|
Beta Was this translation helpful? Give feedback.
-
To optimize bottleneck, I suggest we don't need so much samples. The bottlenecks should be consistent across samples, while the algorithm is static. A few profiling cases are sufficient to guide our optimize direction. |
Beta Was this translation helpful? Give feedback.
-
The format of time in running_scf.log is not scientific and only 2 significant digits are reserved, it is better to make it same with that in OUTPUT. |
Beta Was this translation helpful? Give feedback.
-
Some timer of function has error, we should keep repairing them. |
Beta Was this translation helpful? Give feedback.
-
The new target is to elaborate profiling for several cases. |
Beta Was this translation helpful? Give feedback.
-
That's good. Will you provide such cases? @dyzheng |
Beta Was this translation helpful? Give feedback.
-
Of course not, please learn some DFT knowledge when you have time. |
Beta Was this translation helpful? Give feedback.
-
This should be another issue. |
Beta Was this translation helpful? Give feedback.
-
This is another issue. |
Beta Was this translation helpful? Give feedback.
-
Describe Current Status and Possible Solution
We can seperate the time consumed by the program into several parts collected in the end of screem or OUT.suffix/running_scf.txt outputs.
For example:
|CLASS_NAME---------|NAME---------------|TIME(Sec)-----|CALLS----|AVG------|PER%-------
Gint_k vlocal 46.585 10 4.7 33 %
We can focus on PER% > 1% and mark NAME, CALLS, AVG;
another significant feature is cores in this calculation.
Better to note what type of machine and note which compiler.
Beta Was this translation helpful? Give feedback.
All reactions