sakura.utils.data_splitter.DataSplitter
- class sakura.utils.data_splitter.DataSplitter
Bases:
objectClass for creating dataset splits using label-based grouping
Methods
Obtain a label vector containing 1~k for included points, 0 for not included points.
Obtain 2*k split codes based on random K-Fold split, where train or test are labelled as 1 (corresp.).
(to be implemented)
Obtain a split code from label vector, points labelled from 1~k are considered as selected (1), otherwise not selected (0).
Obtain 2 split codes from label vector, points labelled from 1~k are considered as train (1 in first vector), rest of selected (non-zero) cells are test(1 in second vector), unselected points remain unchanged (0 in all vectors).
Obtain cross validation foldings directly from 1~k labels, 0 considered to be not selected