tirank.Dataloader.generate_val

tirank.Dataloader.generate_val(savePath, validation_proportion=0.15, mode=None)[source]

Splits the bulk expression and clinical data into training and validation sets.

Loads the full bulk expression and clinical data, combines them, performs a random split, and saves the training and validation sets back to disk in the ‘2_preprocessing/split_data’ directory.

Parameters:
  • savePath (str) – The main project directory path.

  • validation_proportion (float, optional) – The fraction of data to use for the validation set. Defaults to 0.15.

  • mode (str, optional) – The analysis mode (‘Cox’, ‘Classification’, ‘Regression’). This determines how many columns to use for the clinical data.

Returns:

None