JWG (4) [Avatar] Offline
#1
The "Note" on p 167 says "The term 'bag' here refers to the fact that we are dealing with a set of tokens rather than
a list or sequence: the tokens have no specific order."

This is inaccurate; a bag is an unordered collection, but it allows duplicates, which sets do not. One might try:

The term 'bag' means a collection that is unordered where duplicates are significant. For example {"the", "cat", "on", "the", "mat"} is the same bag of words as {"cat", "mat", "on", "the", "the"}, but not the same as {"cat", "mat", "on", "the"}