Simply collecting data is not enough. You can fill spreadsheets with data, but it's useless if you can't act on it. Regression is one of the most powerful statistical tools for finding relationships ...
Distributed statistical learning problems arise commonly when dealing with large datasets. In this setup, datasets are partitioned over machines, which compute locally, and communicate short messages.