Téléchargement | - Voir le manuscrit accepté : Feature space selection and combination for native language identification (PDF, 513 Kio)
|
---|
Auteur | Rechercher : Goutte, Cyril1; Rechercher : Léger, Serge1; Rechercher : Carpuat, Marine1 |
---|
Affiliation | - Conseil national de recherches du Canada. Technologies de l'information et des communications
|
---|
Format | Texte, Article |
---|
Conférence | 8th Workshop on Innovative Use of NLP for Building Educational Applications (BEA8), June 13, 2013, Atlanta, GA |
---|
Résumé | We decribe the submissions made by the National Research Council Canada to the Native Language Identification (NLI) shared task. Our submissions rely on a Support Vector Machine classifier, various feature spaces using a variety of lexical, spelling, and syntactic features, and on a simple model combination strategy relying on a majority vote between classifiers. Somewhat surprisingly, a classifier relying on purely lexical features performed very well and proved difficult to outperform significantly using various combinations of feature spaces. However, the combination of multiple predictors allowed to exploit their different strengths and provided a significant boost in performance. |
---|
Date de publication | 2013-08-01 |
---|
Dans | |
---|
Langue | anglais |
---|
Publications évaluées par des pairs | Oui |
---|
Numéro NPARC | 21270977 |
---|
Exporter la notice | Exporter en format RIS |
---|
Signaler une correction | Signaler une correction (s'ouvre dans un nouvel onglet) |
---|
Identificateur de l’enregistrement | 68cbd15c-c2f6-45b1-8017-ded569f2e8e5 |
---|
Enregistrement créé | 2014-02-20 |
---|
Enregistrement modifié | 2020-06-04 |
---|