QuickstartΒΆ
Here is a quick run-down of how you accomplish common tasks.
Load some a FeatureSet from a file:
from skll import Reader
example_reader = Reader.for_path('myexamples.megam')
train_examples = example_reader.read()
Train a linear svm (assuming we have train_examples):
from skll import Learner
learner = Learner('LinearSVC')
learner.train(train_examples)
Evaluate a trained model:
test_examples = Reader.for_path('test.tsv').read()
conf_matrix, accuracy, prf_dict, model_params, obj_score = learner.evaluate(test_examples)
Perform ten-fold cross-validation with a radial SVM:
learner = Learner('SVC')
fold_result_list, grid_search_scores = learner.cross-validate(train_examples)
fold_result_list
in this case is a list of the results returned by
learner.evaluate
for each fold, and grid_search_scores
is the highest
objective function value achieved when tuning the model.
Generate predictions from a trained model:
predictions = learner.predict(test_examples)