AI training time is at a point in an exponential where more throughput isn't going to advance functionality much at all. The underlying problem, problem solving by training, is computationally ...
Computer scientists are a demanding bunch. For them, it’s not enough to get the right answer to a problem — the goal, almost always, is to get the answer as efficiently as possible. Take the act of ...
If you start naively without any library that avoids the problem then memory access is the problem. Have a look at how much effort is needed to avoid the problem, for example with blocking algorithms.