Abstract
While games are a popular social media for children, there is a real risk that these children are exposed to potential sexual assault. A number of studies have already addressed this issue, however, the data used in previous research did not properly represent the real chats found in multiplayer online games. To address this issue, we obtained real chat data from MovieStarPlanet, a massively multiplayer online game for children. The research described in this paper aimed to detect predatory behaviors in the chats using machine learning methods. In order to achieve a high accuracy on this task, extensive preprocessing was necessary. We describe three different strategies for data selection and preprocessing, and extensively compare the performance of different learning algorithms on the different data sets and features.
Original language | English (US) |
---|---|
Article number | 7091007 |
Pages (from-to) | 220-232 |
Number of pages | 13 |
Journal | IEEE Transactions on Computational Intelligence and AI in Games |
Volume | 7 |
Issue number | 3 |
DOIs | |
State | Published - Sep 1 2015 |
Keywords
- Chat
- data mining
- game data
- natural language processing (NLP)
- preprocessing
- sexual predator
- text classification
ASJC Scopus subject areas
- Software
- Control and Systems Engineering
- Artificial Intelligence
- Electrical and Electronic Engineering