In this paper we present TANC, i.e., a tree-augmented naive credal classifier based on imprecise probabilities; it models prior near-ignorance via the Extreme Imprecise Dirichlet Model (EDM) (Cano et al., 2007) and deals conservatively with missing data in the training set, without assuming them to be missing-at-random. The EDM is an approximation of the global Imprecise Dirichlet Model (IDM), which considerably simplifies the computation of upper and lower probabilities; yet, having been only recently introduced, the quality of the provided approximation needs still to be verified. As first contribution, we extensively compare the output of the naive credal classifier (one of the few cases in which the global IDM can be exactly implemented) when learned with the EDM and the global IDM; the output of the classifier appears to be identical in the vast majority of cases, thus supporting the adoption of the EDM in real classification problems. Then, by experiments we show that TANC is more reliable than the precise TAN (learned with uniform prior), and also that it provides better performance compared to a previous (Zaffalon, 2003} TAN model based on imprecise probabilities. TANC treats missing data by considering all possible completions of the training set, but avoiding an exponential increase of the computational times; eventually, we present some preliminary results with missing data.
Keywords. Imprecise Dirichlet Model, Extreme Imprecise Dirichlet Model, Classification, TANC, Credal dominance.
Paper Download
The paper is availabe in the following formats:
Plenary talk : Press here to get the file of the presentation.
Poster : Press here to get the file of the poster.
Authors addresses:
Giorgio Corani
IDSIA
CH-6928 Manno
Lugano
Switzerland
Cassio Campos
IDSIA
Galleria 2
6928 Manno-Lugano
Switzerland
Sun Yi
IDSIA
Manno
Switzerland
E-mail addresses:
Giorgio Corani | giorgio@idsia.ch |
Cassio Campos | cassio@idsia.ch |
Sun Yi | yi@idsia.ch |