[PDF][PDF] Automatic text categorization of mathematical word problems
Twenty-Second International FLAIRS Conference, 2009•cdn.aaai.org
This paper describes a novel application of text categorization for mathematical word
problems, namely Multiplicative Compare and Equal Group problems. The empirical results
and analysis show that common text processing techniques such as stopword removal and
stemming should be selectively used. It is highly beneficial not to remove stopwords and not
to do stemming. Part of speech tagging should also be used to distinguish words in
discriminative parts of speech from the nonYdiscriminative parts of speech which not only …
problems, namely Multiplicative Compare and Equal Group problems. The empirical results
and analysis show that common text processing techniques such as stopword removal and
stemming should be selectively used. It is highly beneficial not to remove stopwords and not
to do stemming. Part of speech tagging should also be used to distinguish words in
discriminative parts of speech from the nonYdiscriminative parts of speech which not only …
Abstract
This paper describes a novel application of text categorization for mathematical word problems, namely Multiplicative Compare and Equal Group problems. The empirical results and analysis show that common text processing techniques such as stopword removal and stemming should be selectively used. It is highly beneficial not to remove stopwords and not to do stemming. Part of speech tagging should also be used to distinguish words in discriminative parts of speech from the nonYdiscriminative parts of speech which not only fail to help but even mislead the categorization decision for mathematical word problems. An SVM classifier with these selectively used text processing techniques outperforms an SVM classifier with a default setting of text processing techniques (ie stopword removal and stemming). Furthermore, a probabilistic meta classifier is proposed to combine the weighted results of two SVM classifiers with different word problem representations generated by different text preprocessing techniques. The empirical results show that the probabilistic meta classifier further improves the categorization accuracy.
cdn.aaai.org
以上显示的是最相近的搜索结果。 查看全部搜索结果