Where in the code uses "immediate eviction" and "fetched from L2 cache"?? #8

ziyuhuang123 · 2024-01-24T15:05:14Z

Hi! I find your repo very interesting and I gave it a star without hesitation! I am also learning L2 cache recently, so I wonder where it uses "immediate eviction" and "fetched from L2 cache"?? I guess it has relation with discard_memory or L2 persistent API?

Thank you!!

By the way, you mentioned you use ncu to perform and analyze it, also interested how that is done. Maybe you could publish a top conference paper!

efrantar · 2024-01-25T20:30:20Z

Hi, the L2 cache is used implicitly whenever global memory is fetched; the immediate eviction cache policy for weight loads is defined here. The key is that we want to reuse activations (which we need to load many times) in L2 cache, but don't care about weights as they are only accessed exactly once.

We are considering a write-up of this work, however I am currently very busy, so this may take quite a while.

ziyuhuang123 · 2024-01-26T08:02:03Z

I see! Is it possible to use L2 cache better? I know there is an API mentioned here. But I can not find out a way to use it well.... I mean, maybe some random access will squeeze out the useful data in L2? What do you think? Thanks!!!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Where in the code uses "immediate eviction" and "fetched from L2 cache"?? #8

Where in the code uses "immediate eviction" and "fetched from L2 cache"?? #8

ziyuhuang123 commented Jan 24, 2024

efrantar commented Jan 25, 2024

ziyuhuang123 commented Jan 26, 2024

Where in the code uses "immediate eviction" and "fetched from L2 cache"?? #8

Where in the code uses "immediate eviction" and "fetched from L2 cache"?? #8

Comments

ziyuhuang123 commented Jan 24, 2024

efrantar commented Jan 25, 2024

ziyuhuang123 commented Jan 26, 2024