Aspects of multi-class nearest hypersphere classification

dc.contributor.advisorLamont, Morné Michael Connellen_ZA
dc.contributor.authorCoetzer, Francesen_ZA
dc.contributor.otherStellenbosch University. Faculty of Economic and Management Sciences. Dept. of Statistics and Actuarial Science.en_ZA
dc.date.accessioned2017-11-14T10:21:32Z
dc.date.accessioned2017-12-11T10:38:27Z
dc.date.available2017-11-14T10:21:32Z
dc.date.available2017-12-11T10:38:27Z
dc.date.issued2017-12
dc.descriptionThesis (MCom)--Stellenbosch University, 2017.en_ZA
dc.description.abstractENGLISH SUMMARY : Using hyperspheres in the analysis of multivariate data is not a common practice in Statistics. However, hyperspheres have some interesting properties which are useful for data analysis in the following areas: domain description (finding a support region), detecting outliers (novelty detection) and the classification of objects into known classes. This thesis demonstrates how a hypersphere is fitted around a single dataset to obtain a support region and an outlier detector. The all-enclosing and 𝜐-soft hyperspheres are derived. The hyperspheres are then extended to multi-class classification, which is called nearest hypersphere classification (NHC). Different aspects of multi-class NHC are investigated. To study the classification performance of NHC we compared it to three other classification techniques. These techniques are support vector machine classification, random forests and penalised linear discriminant analysis. Using NHC requires choosing a kernel function and in this thesis, the Gaussian kernel will be used. NHC also depends on selecting an appropriate kernel hyper-parameter 𝛾 and a tuning parameter 𝐶. The behaviour of the error rate and the fraction of support vectors for different values of 𝛾 and 𝐶 will be investigated. Two methods will be investigated to obtain the optimal 𝛾 value for NHC. The first method uses a differential evolution procedure to find this value. The R function DEoptim() is used to execute this. The second method uses the R function sigest(). The first method is dependent on the classification technique and the second method is executed independently of the classification technique.en_ZA
dc.description.abstractAFRIKAANSE OPSOMMING : Geen opsomming beskikbaar.af_ZA
dc.format.extentxi, 116 pages ; illustrations, includes annexures
dc.identifier.urihttp://hdl.handle.net/10019.1/102662
dc.language.isoen_ZAen_ZA
dc.publisherStellenbosch : Stellenbosch University
dc.rights.holderStellenbosch University
dc.subjectMultivariate analysisen_ZA
dc.subjectDiscriminant analysisen_ZA
dc.subjectNearest neighbor analysis (Statistics)en_ZA
dc.subjectStatistical classificationen_ZA
dc.subjectSupport vector machines -- Classificationen_ZA
dc.subjectMulticlass classificationen_ZA
dc.subjectKernel functions
dc.subjectUCTDen_ZA
dc.titleAspects of multi-class nearest hypersphere classificationen_ZA
dc.typeThesisen_ZA
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
coetzer_aspects_2017.pdf
Size:
2.53 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: