You can play with parameter Kp/NndrRatio (default 0.8) to adjust how many new words are created for each image. Increasing it will result in a smaller dictionary though loop closure detection recall may decrease. See also Figure 11 of that
paper.
Decreasing the number of features used for dictionary (Kp/MaxFeatures, default 500) can be also another option.