tirank.Dataloader.generate_val
- tirank.Dataloader.generate_val(savePath, validation_proportion=0.15, mode=None)[source]
Splits the bulk expression and clinical data into training and validation sets.
Loads the full bulk expression and clinical data, combines them, performs a random split, and saves the training and validation sets back to disk in the ‘2_preprocessing/split_data’ directory.
- Parameters:
savePath (str) – The main project directory path.
validation_proportion (float, optional) – The fraction of data to use for the validation set. Defaults to 0.15.
mode (str, optional) – The analysis mode (‘Cox’, ‘Classification’, ‘Regression’). This determines how many columns to use for the clinical data.
- Returns:
None