If you are navigating in map frame, the robot can jump. See definition of map frame here
REP-105. What you want to do is to control the robot in odom frame instead, then adjust the goal in odom frame if there is a re-localization.
Visual odometry doesn't do any localization, rtabmap does. You can feed wheel odometry/IMU fusion directly to rtabmap node instead of using visual odometry.
cheers,
Mathieu