- [x] Add license and boilerplate to everything. - [x] Separate the data generation from the training pipeline