The tutorial needs to update the actual reason about shuffling before sharding. It's not accurate. Shuffling before sharding is required to achieve global shuffling rather than only shuffling inside ...