QuickstartΒΆ

Here is a quick run-down of how you accomplish common tasks.

Load some a FeatureSet from a file:

from skll import Reader

example_reader = Reader.for_path('myexamples.megam')
train_examples = example_reader.read()

Train a linear svm (assuming we have train_examples):

from skll import Learner

learner = Learner('LinearSVC')
learner.train(train_examples)

Evaluate a trained model:

test_examples = Reader.for_path('test.tsv').read()
conf_matrix, accuracy, prf_dict, model_params, obj_score = learner.evaluate(test_examples)

Perform ten-fold cross-validation with a radial SVM:

learner = Learner('SVC')
fold_result_list, grid_search_scores = learner.cross-validate(train_examples)

fold_result_list in this case is a list of the results returned by learner.evaluate for each fold, and grid_search_scores is the highest objective function value achieved when tuning the model.

Generate predictions from a trained model:

predictions = learner.predict(test_examples)