1. If you have the full optimized graph in WM, it means all the nodes are already in WM. We have an option called "Mem/InitWMWithAllNodes" to load all nodes in WM independently if memory management is enabled or not. In general, processing time is significantly lower on localization mode, so depending of the map, all nodes can then be used in localization mode while they could not in mapping mode.
2. the error could be larger than 10 meters relatively fast (see some examples in this video
https://www.youtube.com/watch?v=XrnyhaxPCro). We could however estimate a radius based on the accumulated odometry covariance (e.g, similar to EKF SLAM odometry error propagation). One problem is that with memory management, we couldn't correct the accumulated error on loop closures if we don't have all nodes from the beginning of that session (so that resulting pose-graph optimization residual makes sense). The error would then grow out of bound, eventually including the whole map. This would also not work for multi-session mapping, as the original odometry frame will be different between the sessions.