Generate multitask tabular data with controllable task correlations, polynomial complexity, and noise.
The generator creates a multitask regression dataset where task weight vectors share a controlled pairwise cosine similarity ρ.
The output CSV contains columns x_0, x_1, …, x_{D-1}, task_0, task_1, …, task_{T-1}, split where split is train or test.
Need more than 200K samples? Use the Google Colab notebook (coming soon).