Package weka.core
Class SparseInstance
java.lang.Object
weka.core.Instance
weka.core.SparseInstance
- All Implemented Interfaces:
Serializable,Copyable,RevisionHandler
- Direct Known Subclasses:
BinarySparseInstance
Class for storing an instance as a sparse vector. A sparse instance
only requires storage for those attribute values that are non-zero.
Since the objective is to reduce storage requirements for datasets
with large numbers of default values, this also includes nominal
attributes -- the first nominal value (i.e. that which has index 0)
will not require explicit storage, so rearrange your nominal attribute
value orderings if necessary. Missing values will be stored
explicitly.
- Version:
- $Revision: 5970 $
- Author:
- Eibe Frank
- See Also:
-
Constructor Summary
ConstructorsConstructorDescriptionSparseInstance(double weight, double[] attValues) Constructor that generates a sparse instance from the given parameters.SparseInstance(double weight, double[] attValues, int[] indices, int maxNumValues) Constructor that inititalizes instance variable with given values.SparseInstance(int numAttributes) Constructor of an instance that sets weight to one, all values to be missing, and the reference to the dataset to null.SparseInstance(Instance instance) Constructor that generates a sparse instance from the given instance.SparseInstance(SparseInstance instance) Constructor that copies the info from the given instance. -
Method Summary
Modifier and TypeMethodDescriptionattributeSparse(int indexOfIndex) Returns the attribute associated with the internal index.copy()Produces a shallow copy of this instance.Returns the revision string.intindex(int position) Returns the index of the attribute stored at the given position.booleanisMissing(int attIndex) Tests if a specific value is "missing".intlocateIndex(int index) Locates the greatest index that is not greater than the given index.static voidMain method for testing this class.mergeInstance(Instance inst) Merges this instance with the given instance and returns the result.intReturns the number of attributes.intReturns the number of values in the sparse vector.voidreplaceMissingValues(double[] array) Replaces all missing values in the instance with the values contained in the given array.voidsetValue(int attIndex, double value) Sets a specific value in the instance to the given value (internal floating-point format).voidsetValueSparse(int indexOfIndex, double value) Sets a specific value in the instance to the given value (internal floating-point format).double[]Returns the values of each attribute as an array of doubles.toString()Returns the description of one instance in sparse format.doublevalue(int attIndex) Returns an instance's attribute value in internal format.Methods inherited from class weka.core.Instance
attribute, classAttribute, classIndex, classIsMissing, classValue, dataset, deleteAttributeAt, enumerateAttributes, equalHeaders, hasMissingValue, insertAttributeAt, isMissing, isMissingSparse, isMissingValue, missingValue, numClasses, relationalValue, relationalValue, setClassMissing, setClassValue, setClassValue, setDataset, setMissing, setMissing, setValue, setValue, setValue, setWeight, stringValue, stringValue, toString, toString, value, valueSparse, weight
-
Constructor Details
-
SparseInstance
Constructor that generates a sparse instance from the given instance. Reference to the dataset is set to null. (ie. the instance doesn't have access to information about the attribute types)- Parameters:
instance- the instance from which the attribute values and the weight are to be copied
-
SparseInstance
Constructor that copies the info from the given instance. Reference to the dataset is set to null. (ie. the instance doesn't have access to information about the attribute types)- Parameters:
instance- the instance from which the attribute info is to be copied
-
SparseInstance
public SparseInstance(double weight, double[] attValues) Constructor that generates a sparse instance from the given parameters. Reference to the dataset is set to null. (ie. the instance doesn't have access to information about the attribute types)- Parameters:
weight- the instance's weightattValues- a vector of attribute values
-
SparseInstance
public SparseInstance(double weight, double[] attValues, int[] indices, int maxNumValues) Constructor that inititalizes instance variable with given values. Reference to the dataset is set to null. (ie. the instance doesn't have access to information about the attribute types) Note that the indices need to be sorted in ascending order. Otherwise things won't work properly.- Parameters:
weight- the instance's weightattValues- a vector of attribute values (just the ones to be stored)indices- the indices of the given values in the full vector (need to be sorted in ascending order)maxNumValues- the maximium number of values that can be stored
-
SparseInstance
public SparseInstance(int numAttributes) Constructor of an instance that sets weight to one, all values to be missing, and the reference to the dataset to null. (ie. the instance doesn't have access to information about the attribute types)- Parameters:
numAttributes- the size of the instance
-
-
Method Details
-
attributeSparse
Returns the attribute associated with the internal index.- Overrides:
attributeSparsein classInstance- Parameters:
indexOfIndex- the index of the attribute's index- Returns:
- the attribute at the given position
- Throws:
UnassignedDatasetException- if instance doesn't have access to a dataset
-
copy
Produces a shallow copy of this instance. The copy has access to the same dataset. (if you want to make a copy that doesn't have access to the dataset, usenew SparseInstance(instance) -
index
public int index(int position) Returns the index of the attribute stored at the given position. -
isMissing
public boolean isMissing(int attIndex) Tests if a specific value is "missing". -
locateIndex
public int locateIndex(int index) Locates the greatest index that is not greater than the given index.- Returns:
- the internal index of the attribute index. Returns -1 if no index with this property could be found
-
mergeInstance
Merges this instance with the given instance and returns the result. Dataset is set to null.- Overrides:
mergeInstancein classInstance- Parameters:
inst- the instance to be merged with this one- Returns:
- the merged instances
-
numAttributes
public int numAttributes()Returns the number of attributes.- Overrides:
numAttributesin classInstance- Returns:
- the number of attributes as an integer
-
numValues
public int numValues()Returns the number of values in the sparse vector. -
replaceMissingValues
public void replaceMissingValues(double[] array) Replaces all missing values in the instance with the values contained in the given array. A deep copy of the vector of attribute values is performed before the values are replaced.- Overrides:
replaceMissingValuesin classInstance- Parameters:
array- containing the means and modes- Throws:
IllegalArgumentException- if numbers of attributes are unequal
-
setValue
public void setValue(int attIndex, double value) Sets a specific value in the instance to the given value (internal floating-point format). Performs a deep copy of the vector of attribute values before the value is set. -
setValueSparse
public void setValueSparse(int indexOfIndex, double value) Sets a specific value in the instance to the given value (internal floating-point format). Performs a deep copy of the vector of attribute values before the value is set.- Overrides:
setValueSparsein classInstance- Parameters:
indexOfIndex- the index of the attribute's indexvalue- the new attribute value (If the corresponding attribute is nominal (or a string) then this is the new value's index as a double).
-
toDoubleArray
public double[] toDoubleArray()Returns the values of each attribute as an array of doubles.- Overrides:
toDoubleArrayin classInstance- Returns:
- an array containing all the instance attribute values
-
toString
Returns the description of one instance in sparse format. If the instance doesn't have access to a dataset, it returns the internal floating-point values. Quotes string values that contain whitespace characters. -
value
public double value(int attIndex) Returns an instance's attribute value in internal format. -
main
Main method for testing this class. -
getRevision
Returns the revision string.- Specified by:
getRevisionin interfaceRevisionHandler- Overrides:
getRevisionin classInstance- Returns:
- the revision
-