Large contexts are very important but they are cheap compared in terms of RAM compared to the costs of increasing parameter count.
Large contexts are very important but they are cheap compared in terms of RAM compared to the costs of increasing parameter count.