Package

org.apache.spark.ml

classification

Permalink

package classification

Visibility
  1. Public
  2. All

Type Members

  1. class BinaryLogisticRegressionSummary extends LogisticRegressionSummary

    Permalink

    :: Experimental :: Binary Logistic regression results for a given model.

    :: Experimental :: Binary Logistic regression results for a given model.

    Annotations
    @Experimental() @Since( "1.5.0" )
  2. class BinaryLogisticRegressionTrainingSummary extends BinaryLogisticRegressionSummary with LogisticRegressionTrainingSummary

    Permalink

    :: Experimental :: Logistic regression training results.

    :: Experimental :: Logistic regression training results.

    Annotations
    @Experimental() @Since( "1.5.0" )
  3. abstract class ClassificationModel[FeaturesType, M <: ClassificationModel[FeaturesType, M]] extends PredictionModel[FeaturesType, M] with ClassifierParams

    Permalink

    :: DeveloperApi ::

    :: DeveloperApi ::

    Model produced by a Classifier. Classes are indexed {0, 1, ..., numClasses - 1}.

    FeaturesType

    Type of input features. E.g., Vector

    M

    Concrete Model type

    Annotations
    @DeveloperApi()
  4. abstract class Classifier[FeaturesType, E <: Classifier[FeaturesType, E, M], M <: ClassificationModel[FeaturesType, M]] extends Predictor[FeaturesType, E, M] with ClassifierParams

    Permalink

    :: DeveloperApi ::

    :: DeveloperApi ::

    Single-label binary or multiclass classification. Classes are indexed {0, 1, ..., numClasses - 1}.

    FeaturesType

    Type of input features. E.g., Vector

    E

    Concrete Estimator type

    M

    Concrete Model type

    Annotations
    @DeveloperApi()
  5. class DecisionTreeClassificationModel extends ProbabilisticClassificationModel[Vector, DecisionTreeClassificationModel] with DecisionTreeModel with DecisionTreeClassifierParams with MLWritable with Serializable

    Permalink

    Decision tree model (http://en.wikipedia.org/wiki/Decision_tree_learning) for classification.

    Decision tree model (http://en.wikipedia.org/wiki/Decision_tree_learning) for classification. It supports both binary and multiclass labels, as well as both continuous and categorical features.

    Annotations
    @Since( "1.4.0" )
  6. class DecisionTreeClassifier extends ProbabilisticClassifier[Vector, DecisionTreeClassifier, DecisionTreeClassificationModel] with DecisionTreeClassifierParams with DefaultParamsWritable

    Permalink

    Decision tree learning algorithm (http://en.wikipedia.org/wiki/Decision_tree_learning) for classification.

    Decision tree learning algorithm (http://en.wikipedia.org/wiki/Decision_tree_learning) for classification. It supports both binary and multiclass labels, as well as both continuous and categorical features.

    Annotations
    @Since( "1.4.0" )
  7. class GBTClassificationModel extends ProbabilisticClassificationModel[Vector, GBTClassificationModel] with GBTClassifierParams with TreeEnsembleModel[DecisionTreeRegressionModel] with MLWritable with Serializable

    Permalink

    Gradient-Boosted Trees (GBTs) (http://en.wikipedia.org/wiki/Gradient_boosting) model for classification.

    Gradient-Boosted Trees (GBTs) (http://en.wikipedia.org/wiki/Gradient_boosting) model for classification. It supports binary labels, as well as both continuous and categorical features.

    Annotations
    @Since( "1.6.0" )
    Note

    Multiclass labels are not currently supported.

  8. class GBTClassifier extends ProbabilisticClassifier[Vector, GBTClassifier, GBTClassificationModel] with GBTClassifierParams with DefaultParamsWritable with Logging

    Permalink

    Gradient-Boosted Trees (GBTs) (http://en.wikipedia.org/wiki/Gradient_boosting) learning algorithm for classification.

    Gradient-Boosted Trees (GBTs) (http://en.wikipedia.org/wiki/Gradient_boosting) learning algorithm for classification. It supports binary labels, as well as both continuous and categorical features.

    The implementation is based upon: J.H. Friedman. "Stochastic Gradient Boosting." 1999.

    Notes on Gradient Boosting vs. TreeBoost:

    • This implementation is for Stochastic Gradient Boosting, not for TreeBoost.
    • Both algorithms learn tree ensembles by minimizing loss functions.
    • TreeBoost (Friedman, 1999) additionally modifies the outputs at tree leaf nodes based on the loss function, whereas the original gradient boosting method does not.
    • We expect to implement TreeBoost in the future: [https://issues.apache.org/jira/browse/SPARK-4240]
    Annotations
    @Since( "1.4.0" )
    Note

    Multiclass labels are not currently supported.

  9. class LinearSVC extends Classifier[Vector, LinearSVC, LinearSVCModel] with LinearSVCParams with DefaultParamsWritable

    Permalink

    :: Experimental ::

    :: Experimental ::

    Linear SVM Classifier

    This binary classifier optimizes the Hinge Loss using the OWLQN optimizer. Only supports L2 regularization currently.

    Annotations
    @Since( "2.2.0" ) @Experimental()
  10. class LinearSVCModel extends ClassificationModel[Vector, LinearSVCModel] with LinearSVCParams with MLWritable

    Permalink

    :: Experimental :: Linear SVM Model trained by LinearSVC

    :: Experimental :: Linear SVM Model trained by LinearSVC

    Annotations
    @Since( "2.2.0" ) @Experimental()
  11. class LogisticRegression extends ProbabilisticClassifier[Vector, LogisticRegression, LogisticRegressionModel] with LogisticRegressionParams with DefaultParamsWritable with Logging

    Permalink

    Logistic regression.

    Logistic regression. Supports:

    • Multinomial logistic (softmax) regression.
    • Binomial logistic regression.

    This class supports fitting traditional logistic regression model by LBFGS/OWLQN and bound (box) constrained logistic regression model by LBFGSB.

    Annotations
    @Since( "1.2.0" )
  12. class LogisticRegressionModel extends ProbabilisticClassificationModel[Vector, LogisticRegressionModel] with LogisticRegressionParams with MLWritable

    Permalink

    Model produced by LogisticRegression.

    Model produced by LogisticRegression.

    Annotations
    @Since( "1.4.0" )
  13. sealed trait LogisticRegressionSummary extends Serializable

    Permalink

    Abstraction for Logistic Regression Results for a given model.

  14. sealed trait LogisticRegressionTrainingSummary extends LogisticRegressionSummary

    Permalink

    Abstraction for multinomial Logistic Regression Training results.

    Abstraction for multinomial Logistic Regression Training results. Currently, the training summary ignores the training weights except for the objective trace.

  15. class MultilayerPerceptronClassificationModel extends PredictionModel[Vector, MultilayerPerceptronClassificationModel] with Serializable with MLWritable

    Permalink

    Classification model based on the Multilayer Perceptron.

    Classification model based on the Multilayer Perceptron. Each layer has sigmoid activation function, output layer has softmax.

    Annotations
    @Since( "1.5.0" )
  16. class MultilayerPerceptronClassifier extends Predictor[Vector, MultilayerPerceptronClassifier, MultilayerPerceptronClassificationModel] with MultilayerPerceptronParams with DefaultParamsWritable

    Permalink

    Classifier trainer based on the Multilayer Perceptron.

    Classifier trainer based on the Multilayer Perceptron. Each layer has sigmoid activation function, output layer has softmax. Number of inputs has to be equal to the size of feature vectors. Number of outputs has to be equal to the total number of labels.

    Annotations
    @Since( "1.5.0" )
  17. class NaiveBayes extends ProbabilisticClassifier[Vector, NaiveBayes, NaiveBayesModel] with NaiveBayesParams with DefaultParamsWritable

    Permalink

    Naive Bayes Classifiers.

    Naive Bayes Classifiers. It supports Multinomial NB (see here) which can handle finitely supported discrete data. For example, by converting documents into TF-IDF vectors, it can be used for document classification. By making every vector a binary (0/1) data, it can also be used as Bernoulli NB (see here). The input feature values must be nonnegative.

    Annotations
    @Since( "1.5.0" )
  18. class NaiveBayesModel extends ProbabilisticClassificationModel[Vector, NaiveBayesModel] with NaiveBayesParams with MLWritable

    Permalink

    Model produced by NaiveBayes

    Model produced by NaiveBayes

    Annotations
    @Since( "1.5.0" )
  19. final class OneVsRest extends Estimator[OneVsRestModel] with OneVsRestParams with MLWritable

    Permalink

    Reduction of Multiclass Classification to Binary Classification.

    Reduction of Multiclass Classification to Binary Classification. Performs reduction using one against all strategy. For a multiclass classification with k classes, train k models (one per class). Each example is scored against all k models and the model with highest score is picked to label the example.

    Annotations
    @Since( "1.4.0" )
  20. final class OneVsRestModel extends Model[OneVsRestModel] with OneVsRestParams with MLWritable

    Permalink

    Model produced by OneVsRest.

    Model produced by OneVsRest. This stores the models resulting from training k binary classifiers: one for each class. Each example is scored against all k models, and the model with the highest score is picked to label the example.

    Annotations
    @Since( "1.4.0" )
  21. abstract class ProbabilisticClassificationModel[FeaturesType, M <: ProbabilisticClassificationModel[FeaturesType, M]] extends ClassificationModel[FeaturesType, M] with ProbabilisticClassifierParams

    Permalink

    :: DeveloperApi ::

    :: DeveloperApi ::

    Model produced by a ProbabilisticClassifier. Classes are indexed {0, 1, ..., numClasses - 1}.

    FeaturesType

    Type of input features. E.g., Vector

    M

    Concrete Model type

    Annotations
    @DeveloperApi()
  22. abstract class ProbabilisticClassifier[FeaturesType, E <: ProbabilisticClassifier[FeaturesType, E, M], M <: ProbabilisticClassificationModel[FeaturesType, M]] extends Classifier[FeaturesType, E, M] with ProbabilisticClassifierParams

    Permalink

    :: DeveloperApi ::

    :: DeveloperApi ::

    Single-label binary or multiclass classifier which can output class conditional probabilities.

    FeaturesType

    Type of input features. E.g., Vector

    E

    Concrete Estimator type

    M

    Concrete Model type

    Annotations
    @DeveloperApi()
  23. class RandomForestClassificationModel extends ProbabilisticClassificationModel[Vector, RandomForestClassificationModel] with RandomForestClassifierParams with TreeEnsembleModel[DecisionTreeClassificationModel] with MLWritable with Serializable

    Permalink

    Random Forest model for classification.

    Random Forest model for classification. It supports both binary and multiclass labels, as well as both continuous and categorical features.

    Annotations
    @Since( "1.4.0" )
  24. class RandomForestClassifier extends ProbabilisticClassifier[Vector, RandomForestClassifier, RandomForestClassificationModel] with RandomForestClassifierParams with DefaultParamsWritable

    Permalink

    Random Forest learning algorithm for classification.

    Random Forest learning algorithm for classification. It supports both binary and multiclass labels, as well as both continuous and categorical features.

    Annotations
    @Since( "1.4.0" )

Value Members

  1. object DecisionTreeClassificationModel extends MLReadable[DecisionTreeClassificationModel] with Serializable

    Permalink
    Annotations
    @Since( "2.0.0" )
  2. object DecisionTreeClassifier extends DefaultParamsReadable[DecisionTreeClassifier] with Serializable

    Permalink
    Annotations
    @Since( "1.4.0" )
  3. object GBTClassificationModel extends MLReadable[GBTClassificationModel] with Serializable

    Permalink
    Annotations
    @Since( "2.0.0" )
  4. object GBTClassifier extends DefaultParamsReadable[GBTClassifier] with Serializable

    Permalink
    Annotations
    @Since( "1.4.0" )
  5. object LinearSVC extends DefaultParamsReadable[LinearSVC] with Serializable

    Permalink
    Annotations
    @Since( "2.2.0" )
  6. object LinearSVCModel extends MLReadable[LinearSVCModel] with Serializable

    Permalink
    Annotations
    @Since( "2.2.0" )
  7. object LogisticRegression extends DefaultParamsReadable[LogisticRegression] with Serializable

    Permalink
    Annotations
    @Since( "1.6.0" )
  8. object LogisticRegressionModel extends MLReadable[LogisticRegressionModel] with Serializable

    Permalink
    Annotations
    @Since( "1.6.0" )
  9. object MultilayerPerceptronClassificationModel extends MLReadable[MultilayerPerceptronClassificationModel] with Serializable

    Permalink
    Annotations
    @Since( "2.0.0" )
  10. object MultilayerPerceptronClassifier extends DefaultParamsReadable[MultilayerPerceptronClassifier] with Serializable

    Permalink
    Annotations
    @Since( "2.0.0" )
  11. object NaiveBayes extends DefaultParamsReadable[NaiveBayes] with Serializable

    Permalink
    Annotations
    @Since( "1.6.0" )
  12. object NaiveBayesModel extends MLReadable[NaiveBayesModel] with Serializable

    Permalink
    Annotations
    @Since( "1.6.0" )
  13. object OneVsRest extends MLReadable[OneVsRest] with Serializable

    Permalink
    Annotations
    @Since( "2.0.0" )
  14. object OneVsRestModel extends MLReadable[OneVsRestModel] with Serializable

    Permalink
    Annotations
    @Since( "2.0.0" )
  15. object RandomForestClassificationModel extends MLReadable[RandomForestClassificationModel] with Serializable

    Permalink
    Annotations
    @Since( "2.0.0" )
  16. object RandomForestClassifier extends DefaultParamsReadable[RandomForestClassifier] with Serializable

    Permalink
    Annotations
    @Since( "1.4.0" )

Members