Article information

2015 , Volume 20, ¹ 6, p.36-45

Ignatjev N.A.

Cluster analysis and choice of standard objects in supervised pattern recognition problems

Purpose. Search for solutions of the following problems: - to find the minimum cover of a training set E0={S1,…,Sm} by standard objects. The set E0 is divided into l(l≥2) of disjoint subsets (classes) K1,…,Kl. The objects are described by a set of features X(n)=(x1,…,xn);- a reduction of the dimension of the features space by constructing groups from the disjoint sets of features X(k1 ),…,X(kp ),k1+⋯+kp≤n and nonlinear mapping of their values on the real axis in the description of objects. Methodology. The method of partitioning the training set E0 into disjoint groups of objects using the property of their connectivity through the subset of boundary objects (shell) of classes L(E0,ρ) on a metric ρ(x,y) was developed. The set G=Sν1 ,…,Sνc, c≥2, G⊂Kt, v<|Kt| presents an area (group) with the constrained objects in the class Kt, if for any Sνi ,Sνj∈G the pathway Sνi↔Sνk↔⋯↔Sνj will exist. The standard objects of minimum covering of training set E0 are selected for each group G separately. The rule of hierarchical clustering of features offers for the nonlinear mapping of their values on the real axis. The use of this rule allows: - to form a new space from latent features; - to produce the ordered selection of informative features. Findings. It is proved, that the number of standard objects for covering which ensures the correct recognition on precedents in the training set, does not increase monotonously when the number of features with low value of information are reduced. Originality/value. Complex use of methods of cluster analysis for both group objects and features allows to reduce the volume of training sets and to increase the stability of the internally defined logical regularities.

[full text]
Keywords: pattern recognition, logical regularity, cluster analysis of data, shell of classes, standard objects

Author(s):
Ignatjev NikolayA.
Dr. , Associate Professor
Position: Professor
Office: National University of Uzbekistan
Address: 100022, Uzbekistan, Tashkent
Phone Office: (99871) 246-16-27
E-mail: n_ignatev@rambler.ru

References:
[1] Subbotin, S. A. The complex of characteristic and criteria of comparison of training. Matematichni mashini i sistemi. 2010; (1):25–39. (In Russ.)

[2] Zagoruiko, N.G., Kutnenko, O.A., Zyryanov, A.O., Levanov, D.A. Learning to recognition without overfitting. Journal of Machine Learning and Data Analysis. 2014; 1(7):891–901. (in Russ.)

[3] Dyuk, V.A. Creation of knowledge in artificial intelligence systems: geometric approach. Vestnik Akademicheskogo Tekhnicheskogo Tvorchestva. SPb. 1996; (2):46–67. (In Russ.)

[4] Berestneva, O. G., Muratova, E. A., Yankovskaya, A. E. Analysis of the structure of multidimensional data by the local geometry. Bulletin of the Tomsk Polytechnic University. 2003; 306(3):19–23. (In Russ.)

[5] Ignat’ev, N. A. gnat'ev N. A. Obobshchennye otsenki i lokal'nye metriki ob"ektov v intellektual'nom analize dannykh [The generalized estimations and local metrics of objects in data mining]. Tashkent: Universitet; 2014: 72. (In Russ.)

[6] Ignat’ev, N. A., Adilova, F. T., Matlatipov, G. R., Chernysh, P.P. Knowledge discovering from clinical data based on classification tasks solving. Medinfo. Amsterdam: Ios press; 2001:1354–1358.


Bibliography link:
Ignatjev N.A. Cluster analysis and choice of standard objects in supervised pattern recognition problems // Computational technologies. 2015. V. 20. ¹ 6. P. 36-45
Home| Scope| Editorial Board| Content| Search| Subscription| Rules| Contacts
ISSN 1560-7534
© 2024 FRC ICT