N-gram representations for comment filtering

dc.contributor.authorBrand, Dirken_ZA
dc.contributor.authorKroon, Steveen_ZA
dc.contributor.authorVan der Merwe, Brinken_ZA
dc.contributor.authorCleophas, Loeken_ZA
dc.date.accessioned2016-02-01T10:38:45Z
dc.date.available2016-02-01T10:38:45Z
dc.date.issued2015-09
dc.descriptionCITATION: Brand, D., Kroon, S., Van der Merwe, B. & Cleophas, L. 2015. N-Gram Representations For Comment Filtering in Proceeding SAICSIT '15. Proceedings of the 2015 Annual Research Conference on South African Institute of Computer Scientists and Information Technologists, Article No. 6. STIAS, Wallenberg Centre, Stellenbosch, South Africa. 28-30 September 2015. doi:10.1145/2815782.2815789.en_ZA
dc.descriptionThe original publication is available at http://dl.acm.org/authorize.cfm?key=N08849en_ZA
dc.descriptionSAICSIT '15. Proceedings of the 2015 Annual Research Conference on South African Institute of Computer Scientists and Information Technologists, Article No. 6. September 2015.en_ZA
dc.description.abstractAccurate classifiers for short texts are valuable assets in many applications. Especially in online communities, where users contribute to content in the form of posts and comments, an effective way of automatically categorising posts proves highly valuable. This paper investigates the use of N- grams as features for short text classification, and compares it to manual feature design techniques that have been popu- lar in this domain. We find that the N-gram representations greatly outperform manual feature extraction techniques.en_ZA
dc.description.versionPublishers versionen_ZA
dc.identifier.citationBrand, D., Kroon, S., Van der Merwe, B. & Cleophas, L. 2015. N-Gram Representations For Comment Filtering in Proceeding SAICSIT '15. Proceedings of the 2015 Annual Research Conference on South African Institute of Computer Scientists and Information Technologists, Article No. 6. STIAS, Wallenberg Centre, Stellenbosch, South Africa. 28-30 September 2015. doi:10.1145/2815782.2815789.en_ZA
dc.identifier.isbn978-1-4503-3683-3en_ZA
dc.identifier.otherdoi:10.1145/2815782.2815789en_ZA
dc.identifier.urihttp://hdl.handle.net/10019.1/98228
dc.language.isoen_ZAen_ZA
dc.publisherACM, Inc.en_ZA
dc.rights.holderAuthors retain copyrighten_ZA
dc.subjectN-gram modelsen_ZA
dc.subjectComputational linguisticsen_ZA
dc.subjectTexts -- Electronic analysisen_ZA
dc.subjectOnline texts -- Classificationen_ZA
dc.subjectInformation filtering systemsen_ZA
dc.subjectVector spacesen_ZA
dc.subjectText miningen_ZA
dc.titleN-gram representations for comment filteringen_ZA
dc.typeConference Paperen_ZA
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
kroon_ngram_2015.pdf
Size:
157.79 KB
Format:
Adobe Portable Document Format
Description:
Download article
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.95 KB
Format:
Item-specific license agreed upon to submission
Description: