Package weka.filters.supervised.instance
Class StratifiedRemoveFolds
- java.lang.Object
-
- weka.filters.Filter
-
- weka.filters.supervised.instance.StratifiedRemoveFolds
-
- All Implemented Interfaces:
java.io.Serializable,CapabilitiesHandler,OptionHandler,RevisionHandler,SupervisedFilter
public class StratifiedRemoveFolds extends Filter implements SupervisedFilter, OptionHandler
This filter takes a dataset and outputs a specified fold for cross validation. If you do not want the folds to be stratified use the unsupervised version. Valid options are:-V Specifies if inverse of selection is to be output.
-N <number of folds> Specifies number of folds dataset is split into. (default 10)
-F <fold> Specifies which fold is selected. (default 1)
-S <seed> Specifies random number seed. (default 0, no randomizing)
- Version:
- $Revision: 5542 $
- Author:
- Eibe Frank (eibe@cs.waikato.ac.nz)
- See Also:
- Serialized Form
-
-
Constructor Summary
Constructors Constructor Description StratifiedRemoveFolds()
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description booleanbatchFinished()Signify that this batch of input to the filter is finished.java.lang.StringfoldTipText()Returns the tip text for this propertyCapabilitiesgetCapabilities()Returns the Capabilities of this filter.intgetFold()Gets the fold which is selected.booleangetInvertSelection()Gets if selection is to be inverted.intgetNumFolds()Gets the number of folds in which dataset is to be split into.java.lang.String[]getOptions()Gets the current settings of the filter.java.lang.StringgetRevision()Returns the revision string.longgetSeed()Gets the random number seed used for shuffling the dataset.java.lang.StringglobalInfo()Returns a string describing this filterbooleaninput(Instance instance)Input an instance for filtering.java.lang.StringinvertSelectionTipText()Returns the tip text for this propertyjava.util.EnumerationlistOptions()Gets an enumeration describing the available options..static voidmain(java.lang.String[] argv)Main method for testing this class.java.lang.StringnumFoldsTipText()Returns the tip text for this propertyjava.lang.StringseedTipText()Returns the tip text for this propertyvoidsetFold(int fold)Selects a fold.booleansetInputFormat(Instances instanceInfo)Sets the format of the input instances.voidsetInvertSelection(boolean inverse)Sets if selection is to be inverted.voidsetNumFolds(int numFolds)Sets the number of folds the dataset is split into.voidsetOptions(java.lang.String[] options)Parses a given list of options.voidsetSeed(long seed)Sets the random number seed for shuffling the dataset.-
Methods inherited from class weka.filters.Filter
batchFilterFile, filterFile, getCapabilities, getOutputFormat, isFirstBatchDone, isNewBatch, isOutputFormatDefined, makeCopies, makeCopy, numPendingOutput, output, outputPeek, toString, useFilter, wekaStaticWrapper
-
-
-
-
Method Detail
-
listOptions
public java.util.Enumeration listOptions()
Gets an enumeration describing the available options..- Specified by:
listOptionsin interfaceOptionHandler- Returns:
- an enumeration of all the available options.
-
setOptions
public void setOptions(java.lang.String[] options) throws java.lang.ExceptionParses a given list of options. Valid options are:-V Specifies if inverse of selection is to be output.
-N <number of folds> Specifies number of folds dataset is split into. (default 10)
-F <fold> Specifies which fold is selected. (default 1)
-S <seed> Specifies random number seed. (default 0, no randomizing)
- Specified by:
setOptionsin interfaceOptionHandler- Parameters:
options- the list of options as an array of strings- Throws:
java.lang.Exception- if an option is not supported
-
getOptions
public java.lang.String[] getOptions()
Gets the current settings of the filter.- Specified by:
getOptionsin interfaceOptionHandler- Returns:
- an array of strings suitable for passing to setOptions
-
globalInfo
public java.lang.String globalInfo()
Returns a string describing this filter- Returns:
- a description of the filter suitable for displaying in the explorer/experimenter gui
-
invertSelectionTipText
public java.lang.String invertSelectionTipText()
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getInvertSelection
public boolean getInvertSelection()
Gets if selection is to be inverted.- Returns:
- true if the selection is to be inverted
-
setInvertSelection
public void setInvertSelection(boolean inverse)
Sets if selection is to be inverted.- Parameters:
inverse- true if inversion is to be performed
-
numFoldsTipText
public java.lang.String numFoldsTipText()
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getNumFolds
public int getNumFolds()
Gets the number of folds in which dataset is to be split into.- Returns:
- the number of folds the dataset is to be split into.
-
setNumFolds
public void setNumFolds(int numFolds)
Sets the number of folds the dataset is split into. If the number of folds is zero, it won't split it into folds.- Parameters:
numFolds- number of folds dataset is to be split into- Throws:
java.lang.IllegalArgumentException- if number of folds is negative
-
foldTipText
public java.lang.String foldTipText()
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getFold
public int getFold()
Gets the fold which is selected.- Returns:
- the fold which is selected
-
setFold
public void setFold(int fold)
Selects a fold.- Parameters:
fold- the fold to be selected.- Throws:
java.lang.IllegalArgumentException- if fold's index is smaller than 1
-
seedTipText
public java.lang.String seedTipText()
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getSeed
public long getSeed()
Gets the random number seed used for shuffling the dataset.- Returns:
- the random number seed
-
setSeed
public void setSeed(long seed)
Sets the random number seed for shuffling the dataset. If seed is negative, shuffling won't be performed.- Parameters:
seed- the random number seed
-
getCapabilities
public Capabilities getCapabilities()
Returns the Capabilities of this filter.- Specified by:
getCapabilitiesin interfaceCapabilitiesHandler- Overrides:
getCapabilitiesin classFilter- Returns:
- the capabilities of this object
- See Also:
Capabilities
-
setInputFormat
public boolean setInputFormat(Instances instanceInfo) throws java.lang.Exception
Sets the format of the input instances.- Overrides:
setInputFormatin classFilter- Parameters:
instanceInfo- an Instances object containing the input instance structure (any instances contained in the object are ignored - only the structure is required).- Returns:
- true because outputFormat can be collected immediately
- Throws:
java.lang.Exception- if the input format can't be set successfully
-
input
public boolean input(Instance instance)
Input an instance for filtering. Filter requires all training instances be read before producing output.
-
batchFinished
public boolean batchFinished()
Signify that this batch of input to the filter is finished. Output() may now be called to retrieve the filtered instances.- Overrides:
batchFinishedin classFilter- Returns:
- true if there are instances pending output
- Throws:
java.lang.IllegalStateException- if no input structure has been defined
-
getRevision
public java.lang.String getRevision()
Returns the revision string.- Specified by:
getRevisionin interfaceRevisionHandler- Overrides:
getRevisionin classFilter- Returns:
- the revision
-
main
public static void main(java.lang.String[] argv)
Main method for testing this class.- Parameters:
argv- should contain arguments to the filter: use -h for help
-
-