UNIFORM: Automatic Alignment of Open Learning Datasets

L Cagliero, L Canale, L Farinetti - 2020 IEEE 44th Annual …, 2020 - ieeexplore.ieee.org
L Cagliero, L Canale, L Farinetti
2020 IEEE 44th Annual Computers, Software, and Applications …, 2020ieeexplore.ieee.org
Learning Analytics aims at supporting the understanding of learning mechanisms and their
effects by means of data-driven strategies. LA approaches commonly face two big
challenges: first, due to privacy reasons, most of the analyzed data are not in the public
domain. Secondly, the open data collections, which come from diverse learning contexts,
are quite heterogeneous. Therefore, the research findings are not easily reproducible and
the publicly available datasets are often too small to enable further data analytics. To …
Learning Analytics aims at supporting the understanding of learning mechanisms and their effects by means of data-driven strategies. LA approaches commonly face two big challenges: first, due to privacy reasons, most of the analyzed data are not in the public domain. Secondly, the open data collections, which come from diverse learning contexts, are quite heterogeneous. Therefore, the research findings are not easily reproducible and the publicly available datasets are often too small to enable further data analytics. To overcome these issues, there is an increasing need for integrating open learning data into unified models. This paper proposes UNIFORM, an open relational database integrating various learning data sources. It presents also a machine learning supported approach to automatically extending the integrated dataset as soon as new data sources become available. The proposed approach exploits a classifier to predict attribute alignments based on the correlations among the corresponding textual attribute descriptions. The integration phase has reached a promising quality level on most of the analyzed bechmark datasets. Furthermore, the usability of the UNIFORM data model has been demonstrated in a real case study, where the integrated data have been exploited to support learners' outcome prediction. The F1-score achieved on the integrated data is approximately 30% higher that those obtained on the original data.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果