PRDisData Contents

PRDisData User Guide

protein

PROTEIN

Dissimilarity dataset.

    D = PROTEIN

The protein data are provided as a 213x213 dissimilarity matrix comparing  the protein sequences based on the concept of an evolutionary distance.  It was used for classification in [Graepel] and for clustering in  [Denoeux and Masson]. There are four classes of globins: heterogeneous  globin (G), hemoglobin-A (HA), hemoglobin-B (HB) and myoglobin (M).

Reference(s)

T. Graepel, R. Herbrich, P. Bollmann-Sdorra, K. Obermayer, Classification on pairwise proximity data. In Adv. in Neural Information System Processing vol. 11, 438-444, 1999.

T. Denoeux, T. and M.-H. Masson, EVCLUS: Evidential clustering of proximity data. IEEE Transations on Systems, Man and Cybernetics, vol. 34, 95-109, 2004.

See also

prtools, datasets, prdisdata,

PRDisData Contents

PRDisData User Guide

This file has been automatically generated. If badly readable, use the help-command in Matlab.