Scientific Research Project 1
RESEARCH ARTICLE
Automatic detection of cyberbullying in social
media text
Cynthia Van HeeID 1☯*, Gilles Jacobs1☯, Chris Emmery2, Bart Desmet1, Els Lefever1,
Ben Verhoeven 2 , Guy De Pauw
2 , Walter Daelemans
2 , Véronique Hoste
1
1 Department of Translation, Interpreting and Communication - Faculty of Arts and Philosophy, Ghent
University, Ghent, Belgium, 2 Department of Linguistics - Faculty of Arts, University of Antwerp, Antwerp,
Belgium
☯ These authors contributed equally to this work. * [email protected]
Abstract
While social media offer great communication opportunities, they also increase the vulnera-
bility of young people to threatening situations online. Recent studies report that cyberbully-
ing constitutes a growing problem among youngsters. Successful prevention depends on
the adequate detection of potentially harmful messages and the information overload on the
Web requires intelligent systems to identify potential risks automatically. The focus of this
paper is on automatic cyberbullying detection in social media text by modelling posts written
by bullies, victims, and bystanders of online bullying. We describe the collection and fine-
grained annotation of a cyberbullying corpus for English and Dutch and perform a series of
binary classification experiments to determine the feasibility of automatic cyberbullying
detection. We make use of linear support vector machines exploiting a rich feature set and
investigate which information sources contribute the most for the task. Experiments on a
hold-out test set reveal promising results for the detection of cyberbullying-related posts.
After optimisation of the hyperparameters, the classifier yields an F1 score of 64% and 61%
for English and Dutch respectively, and considerably outperforms baseline systems.
Introduction
Web 2.0 has had a substantial impact on communication and relationships in today’s society.
Children and teenagers go online more frequently, at younger ages, and in more diverse ways
(e.g. smartphones, laptops and tablets). Although most of teenagers’ Internet use is harmless
and the benefits of digital communication are evident, the freedom and anonymity experi-
enced online makes young people vulnerable with cyberbullying being one of the major threats
[1, 2].
Bullying is not a new phenomenon and cyberbullying has manifested itself as soon as digital
technologies have become primary communication tools. On the positive side, social media
like blogs, social networking sites (e.g. Facebook), and instant messaging platforms (e.g. What-
sApp) make it possible to communicate with anyone and at any time. Moreover, they are a
place where people engage in social interaction, offering the possibility to establish new
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 1 / 22
a1111111111
a1111111111
a1111111111
a1111111111
a1111111111
OPEN ACCESS
Citation: Van Hee C, Jacobs G, Emmery C, Desmet
B, Lefever E, Verhoeven B, et al. (2018) Automatic
detection of cyberbullying in social media text.
PLoS ONE 13(10): e0203794. https://doi.org/
10.1371/journal.pone.0203794
Editor: Hussein Suleman, University of Cape Town,
SOUTH AFRICA
Received: February 6, 2017
Accepted: August 28, 2018
Published: October 8, 2018
Copyright: © 2018 Van Hee et al. This is an open access article distributed under the terms of the
Creative Commons Attribution License, which
permits unrestricted use, distribution, and
reproduction in any medium, provided the original
author and source are credited.
Data Availability Statement: Because the actual
posts in our corpus could contain names or other
identifying information, we cannot share them
publicly in a repository. They can, however be
obtained upon request, for academic purposes
solely and via [email protected] or cynthia.
[email protected]. The replication data are
available through the Open Science Framework
repository https://osf.io/rgqw8/ with DOI 10.17605/
OSF.IO/RGQW8. This replication dataset allows
interested researchers to download 1) the feature
vectors of the corpus underlying the experiments
described in this paper, 2) the indices
relationships and maintain existing friendships [3, 4]. On the negative side however, social
media increase the risk of children being confronted with threatening situations including
grooming or sexually transgressive behaviour, signals of depression and suicidal thoughts, and
cyberbullying. Users are reachable 24/7 and are often able to remain anonymous if desired:
this makes social media a convenient way for bullies to target their victims outside the school
yard.
With regard to cyberbullying, a number of national and international initiatives have been
launched over the past few years to increase children’s online safety. Examples include KiVa (http://www.kivaprogram.net/), a Finnish cyberbullying prevention programme, the ‘Non au harcèlement’ campaign in France, Belgian governmental initiatives and helplines (e.g. clicksafe. be, veiligonline.be, mediawijs.be) that provide information about online safety, and so on.
In spite of these efforts, a lot of undesirable and hurtful content remains online. [2] analysed
a body of quantitative research on cyberbullying and observed cybervictimisation rates among
teenagers between 20% and 40%. [5] focused on 12 to 17 year olds living in the United States
and found that no less than 72% of them had encountered cyberbullying at least once within
the year preceding the questionnaire. [6] surveyed 9 to 26 year olds in the United States, Can-
ada, the United Kingdom and Australia, and found that 29% of the respondents had ever been
victimised online. A study among 2,000 Flemish secondary school students (age 12 to 18)
revealed that 11% of them had been bullied online at least once in the six months preceding
the survey [7]. Finally, the 2014 large-scale EU Kids Online Report [8] published that 20% of
11 to 16 year olds had been exposed to hate messages online. In addition, youngsters were 12%
more likely to be exposed to cyberbullying as compared to 2010, which clearly demonstrates
that cyberbullying is a growing problem.
The prevalence of cybervictimisation depends on the conceptualisation used in describing
cyberbullying, but also on research variables such as location and the number and age span of
the participants. Nevertheless, the above studies demonstrate that online platforms are increas-
ingly used for bullying, which is a cause for concern given its impact. As shown by [9–11],
cyberbullying may negatively impact the victim’s self-esteem, academic achievement and emo-
tional well-being. [12] found that self-reported effects of cyberbullying include negative effects
on school grades and feelings of sadness, anger, fear, and depression. In extreme cases, cyber-
bullying could even lead to self-harm and suicidal thoughts.
These findings demonstrate that cyberbullying is a serious problem the consequences of
which can be dramatic. Early detection of cyberbullying attempts is therefore of key impor-
tance to youngsters’ mental well-being. Successful detection depends on effective monitoring
of online content, but the amount of information on the Web makes it practically unfeasible
for moderators to monitor all user-generated content manually. To tackle this problem, intelli-
gent systems are required that process this information in a fast way and automatically signal
potential threats. This way, moderators can respond quickly and prevent threatening situations
from escalating. According to recent research, teenagers are generally in favour of such auto-
matic monitoring, provided that effective follow-up strategies are formulated, and that privacy
and autonomy are guaranteed [13].
Parental control tools (e.g. NetNanny, https://www.netnanny.com/) already block unsuited or undesirable content and some social networks make use of keyword-based moderation
tools (i.e. using lists of profane and insulting words to flag harmful content). However, such
approaches typically fail to detect implicit and subtle forms of cyberbullying in which no
explicit vocabulary is used. This creates the need for intelligent and self-learning systems that
go beyond keyword spotting and hence improve the recall of cyberbullying detection.
The ultimate goal of this type of research is to develop models that could improve manual
monitoring for cyberbullying on social networks. We explore the automatic detection of
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 2 / 22
corresponding to instances that were kept
separately to test the experimental design (referred
to as the "hold-out test set" in the paper), 3) a
feature mapping dictionary that allows to trace all
indices in the feature vector files back to the
corresponding feature types (e.g. the feature
indices 0 to 14,230 represent word 3-gram
features). We also share the seed terms that were
used to construct the corpora for our topic model
features. Lastly, we provide an Excel spreadsheet
presenting a results overview of all the tested
systems. All of this information is made available
for both the Dutch and English experiments.
Funding: The work presented in this paper was
carried out in the framework of the AMiCA IWT
SBO-project 120007 project to WD and VH, funded
by the government Flanders Innovation &
Entrepreneurship (VLAIO) agency; http://www.
vlaio.be. The funders had no role in study design,
data collection and analysis, decision to publish, or
preparation of the manuscript.
Competing interests: The authors have declared
that no competing interests exist.
textual signals of cyberbullying, in which cyberbulying is approached as a complex phenome-
non that can be realised in various ways (see the Annotation guidelines section for a detailed
overview). While the vast majority of the related research focuses on detecting cyberbullying
‘attacks’ (i.e. verbal aggression), the present study takes different types of cyberbullying into
account, including more implicit posts from the bully, but also posts written by victims and
bystanders. This is a more inclusive conceptualisation for the task of cyberbullying detection
and should aid in moderation and prevention efforts by capturing different and more implicit
signals of bullying.
To tackle this problem, we propose a machine learning method based on a linear SVM
classifier [14, 15] exploiting a rich feature set. The contribution we make is twofold: first, we
develop a complex classifier to detect signals of cyberbullying, which allows us to detect differ- ent types of cyberbullying that are related to different social roles involved in a cyberbullying
event. Second, we demonstrate that the methodology is easily portable to other languages, pro-
vided there is annotated data available, by performing experiments on an English and Dutch
dataset.
The remainder of this paper is structured as follows: the next section presents a definition
of cyberbullying and its participant roles and provides an overview of the state of the art in
cyberbullying detection. The Data collection and annotation section describes the corpus con- struction and annotation. Next, we present the experimental setup and discuss our experimen-
tal results for English and Dutch. Finally, the Conclusion and future research section concludes this paper and provides some perspectives for further research.
Related research
Both offline and online bullying are widely covered in the realm of social sciences and psychol-
ogy, and the increasing number of cyberbullying cases in recent years [16] has stimulated
research efforts to detect cyberbullying automatically. In the following section, we present a
definition of cyberbullying and identify its participant roles and we provide a brief overview of
automatic approaches to cyberbullying detection.
Cyberbullying definition and participant roles
A common starting point for conceptualising cyberbullying are definitions of traditional (i.e.
offline) bullying, one of the most influential ones being formulated by [17]. The researcher described bullying based on three main criteria, including i) intention (i.e. a bully intends to
inflict harm on the victim), ii) repetition (i.e. bullying acts take place repeatedly over time)
and iii) a power imbalance between the bully and the victim (i.e. a more powerful bully attacks
a less powerful victim). With respect to cyberbullying, a number of definitions are based on
the above criteria. A popular definition is that of [18, p. 376], which describes cyberbullying as
“an aggressive, intentional act carried out by a group or individual, using electronic forms of
contact, repeatedly and over time, against a victim who cannot easily defend him or herself”.
However, opinion on the applicability of the above characteristics to cyberbullying is very
much divided [19], and besides theoretical objections, a number of practical limitations have
been observed. Firstly, while [17] claims intention to be inherent to traditional bullying, this is
much harder to ascertain in an online environment. Online conversations lack the signals of a
face-to-face interaction like intonation, facial expressions and gestures, which makes them
more ambiguous than real-life conversations. The receiver may therefore get the wrong
impression that they are being offended or ridiculed [20]. Another criterion for bullying that
might not hold in online situations is the power imbalance between the bully and the victim.
This can be evident in real life (e.g. the bully is taller, stronger or older than the victim), but it
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 3 / 22
is hard to conceptualise or measure online, where power may be related to technological skills,
anonymity or the inability of the victim to escape from the bullying [19, 21]. Also empowering
for the bully are inherent characteristics of the Web: once defamatory or confidential informa-
tion is made public through the Internet, it is hard to remove.
Finally, while arguing that repetition distinguishes bullying from single acts of aggression,
[17] himself states that such a single aggressive action can be considered bullying under certain
circumstances. Accordingly, [21] claim that repetition in cyberbullying is problematic to oper-
ationalise, as it is unclear what the consequences are of a single derogatory message on a public
page. A single act of aggression or humiliation may cause continued distress and humiliation
for the victim if it is shared or liked by a large audience [21]. [22, p. 26] compare this with the
“snowball effect”: one post may be repeated or distributed by other people so that it becomes
out of the control of the initial bully and has larger effects than was originally intended.
Given these arguments, a number of less ‘strict’ definitions of cyberbullying were proposed
by among others [2, 5, 6], where a power imbalance and repetition are not deemed necessary
conditions for cyberbullying.
The above paragraphs demonstrate that defining cyberbullying is far from trivial, and vary-
ing prevalence rates (see the Introduction section) confirm that a univocal definition of the
phenomenon is still lacking in the literature [2]. Based on existing conceptualisations, we
define cyberbullying as content that is published online by an individual and that is aggressive or hurtful against a victim. Based on this definition, an annotation scheme was developed [23] to signal textual characteristics of cyberbullying, including posts from bullies, as well as reactions
from victims and bystanders.
Cyberbullying research also involves the identification of its participant roles. [24] were
among the first to define the roles in a bullying situation. Based on surveys among teenagers
involved in real-life bullying situations, they defined six participant roles: victims (i.e. who are
the target of repeated harassment), bullies (i.e. who are the initiative-taking perpetrators),
assistants of the bully (i.e. who encourage the bullying), reinforcers of the bully (i.e. who rein-
force the bullying), defenders (i.e. who comfort the victim, take their side or try to stop the bul-
lying) and outsiders (i.e. who ignore or distance themselves from the situation). In sum, in
addition to the bully and victim, the researchers distinguish four bystanders (i.e. assistants,
reinforcers, defenders and outsiders). [25], however, do not distinguish between reinforcers
and assistants of the bullying. Their typology includes victims, bullies and three types of
bystanders: i) bystanders who participate in the bullying, ii) bystanders who help or support
the victim and iii) bystanders who ignore the bullying. The cyberbullying roles that are identi-
fied in our annotation scheme are based on existing bullying role typologies, given that tradi-
tional bullying roles are applicable to cyberbullying as well [26, 27]. More details about the
different roles that we take into account are provided in the Data collection and annotation
section.
Bystanders and -to a lesser extent- victims are often overlooked in the related research. As a
result, these studies can be better characterised as verbal aggression detection concerned with
retrieving bully attacks. By taking bystanders into account, we capture different and more sub-
tle signals of a bullying episode. Note that while in this work we did not include classification
of the participant roles as such, they are essential to the conceptualisation of the current detec-
tion task.
Detecting and preventing cyberbullying
As mentioned earlier, although research on cyberbullying detection is more limited than social
studies on the phenomenon, some important advances have been made in recent years. In
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 4 / 22
what follows, we present a brief overview of the most important natural language processing
approaches to cyberbullying detection, but we refer to the survey paper by [28] for a more
detailed overview.
Although some studies have investigated the effectiveness of rule-based modelling [29], the
dominant approach to cyberbullying detection involves machine learning. Most machine
learning approaches are based on supervised [30, 30–32] or semi-supervised learning [33]. The
former involves the construction of a classifier based on labelled training data, whereas semi-
supervised approaches rely on classifiers that are built from a training corpus containing a
small set of labelled and a large set of unlabelled instances. Semi-supervised methods are often
used to handle data sparsity, a typical issue in cyberbullying research. As cyberbullying detec-
tion essentially involves the distinction between bullying and non-bullying posts, the problem
is generally approached as a binary classification task where the positive class is represented
by instances containing (textual) cyberbullying, while the negative class is devoid of bullying
signals.
A key challenge in cyberbullying research is the availability of suitable data, which is neces-
sary to develop models that characterise cyberbullying. In recent years, only a few datasets
have become publicly available for this particular task, such as the training sets provided in
the context of the CAW 2.0 workshop (http://caw2.barcelonamedia.org), a MySpace (https://
myspace.com) [34] and Formspring (http://www.formspring.me) cyberbullying corpus anno-
tated with the help of Mechanical Turk [29], and more recently, the Twitter Bullying Traces
dataset [35]. Many studies have therefore constructed their own corpus from social media
websites that are prone to bullying content, such as YouTube [30, 32], Twitter [36, 37], Insta-
gram [38], MySpace [31, 34], FormSpring [29, 39], Kaggle [40] and ASKfm [41]. Despite the
bottleneck of data availability, cyberbullying detection approaches have been successfully
implemented over the past years and the relevance of automatic text analysis techniques to
ensure child safety online has been recognised [42].
Among the first studies on cyberbullying detection are [29–31], who explored the predic-
tive power of n-grams (with and without tf-idf weighting), part-of-speech information (e.g. first and second pronouns), and sentiment information based on (polarity and profanity)
lexicons for this task. Similar features were not only exploited for coarse-grained cyberbully-
ing detection, but also for the detection of more fine-grained cyberbullying categories [41].
Despite their apparent simplicity, content-based features (i.e. lexical, syntactic and sentiment
information) are very often exploited in recent approaches to cyberbullying detection [33,
43]. In fact, as observed by [28], more than 41 papers have approached cyberbullying detec-
tion using content-based features, which confirms that this type of information is crucial for
the task.
More and more, however, content-based features are combined with semantic features
derived from topic model information [44], word embeddings and representation learning
[43, 45]. More recent studies have also demonstrated the added value of user-based informa-
tion for the task, more specifically by including users’ activities (i.e. the number of posts) on a
social network, their age, gender, location, number of friends and followers, and so on [32, 33,
46, 47]. A final feature type that gains increasing popularity in cyberbullying detection are net-
work-based features, whose application is motivated by the frequent use of social media data
for the task. By using network information, researchers aim to capture social relations between
participants in a conversation (e.g. bully versus victim), and other relevant information such as
the popularity of a person (i.e. which can indicate the power of a potential bully) on a social
network, the number of (historical) interactions between two people, and so on. [48] for
instance used network-based features to take the behavioural history of a potential bully into
account. [49] detected cyberbullying in tweets and included network features inspired by
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 5 / 22
Olweus’ [17] bullying conditions (see supra). More specifically, they measured the power
imbalance between a bully and victim, as well as the bully’s popularity based on interaction
graphs and the bully’s position in the network.
As mentioned earlier, social media are a commonly used genre for this type of tasks. More
recently, researchers have investigated cyberbullying detection in multi-modal data offered by
specific platforms. For instance [38] explored cyberbullying detection using multi-modal data
extracted from the social network Instagram. More precisely, they combined textual features
derived from the posts themselves with user metadata and image features and showed that
integrating the latter enhanced the classification performance. [37] also detected cyberbullying
in different data genres, including ASKfm, Twitter, and Instagram. They took role information
into account by integrating bully and victim scores as features, based on the occurrence of
bully-related keywords in their sent or received posts.
With respect to the datasets used in cyberbullying research, it can be observed that corpora
are often composed by keyword search (e.g. [43, 44]), which produces a biased dataset of posi-
tive (i.e. bullying) instances. To balance these corpora, negative data are often added from a
background corpus or data resampling [50] techniques are adopted [33, 47]. For this research,
data were randomly crawled across ASKfm and no keyword search was used to collect bullying
data. Instead, all instances were manually annotated for the presence of bullying. As a result,
our corpus contains a realistic distribution of bullying instances.
When looking at the performance of automatic cyberbullying, we see that scores vary
greatly and do not only depend on the implemented algorithm and parameter settings, but
also on a number of other variables. These include the metrics that are used to evaluate the sys-
tem (i.e. micro- or macro-averaged F1, precision, recall, AUC, etc.), the corpus genre (i.e. Face-
book, Twitter, ASKfm, Instagram) and class distribution (i.e. balanced or unbalanced), the
annotation method (i.e. automatic annotations or manual annotations using crowdsourcing
or by experts) and, perhaps the most important distinguishing factor, the conceptualisation of
cyberbullying that is used. More concretely, while some approaches identify sensitive topics
[30] or insulting language [29], others propose a more comprehensive approach by capturing
different types of cyberbullying [41] or by modelling the bully-victim communications
involved in a cyberbullying incident [37].
The studies discussed in this section demonstrated the variety of approaches that have been
used to tackle cyberbullying detection. However, most of them focused on cyberbullying
‘attacks’, or posts written by a bully. Moreover, it is not entirely clear if different forms of
cyberbullying were taken into account (e.g. sexual intimidation or harassment, or psychologi-
cal threats), in addition to derogatory language or insults. In the present study, cyberbullying
is considered a complex phenomenon comprising different forms of harmful online behav-
iour, which are described in more detail in our annotation scheme [23]. Purposing to facilitate
manual monitoring efforts on social networks, we developed a system that automatically
detects signals of cyberbullying, including attacks from bullies, as well as victim and bystander
reactions, the latter of which are generally overlooked in related research.
Most similar to this research is the work by [44], [43, 45], who investigated bullying traces
posted by different author roles (e.g. bully, victim, bystander, assistant, defender, reporter,
accuser, reinforcer). However, they collected tweets using the keywords bully, bullied and bully- ing. As a result, their corpus contained many reports or testimonials of cyberbullying (example 1), instead of actual cyberbullying. Moreover, their method implies that cyberbullying signals
that are devoid of such keywords are not included in the training corpus.
1. “Some tweens got violent on the n train, the one boy got off after blows 2 the chest. . . Saw him cryin as he walkd away: (bullying not cool” [44, p. 658]
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 6 / 22
What clearly distinguishes these works from the present is that their conceptualisation of
cyberbullying is not explained. It is, in other words, not clear which type of posts are consid-
ered bullying and which are not. In the present research, we identify different types of bullying
and all are included in the positive class of our experimental corpus.
For this research, English and Dutch social media data were annotated for fine-grained
forms of cyberbullying, based on the actors involved in a cyberbullying incident. After prelimi-
nary experiments for Dutch [41, 51], we currently present an optimised cyberbullying detec-
tion method for English and Dutch and hereby show that the proposed methodology can
easily be applied to different languages, provided that annotated data are available.
Data collection and annotation
To be able to build representative models for cyberbullying, a suitable dataset is required. This
section describes the construction of two corpora, English and Dutch, containing social media
posts that are manually annotated for cyberbullying according to our fine-grained annotation
scheme. This allows us to cover different forms and participants (or roles) involved in a cyber- bullying event.
Data collection
Two corpora were constructed by collecting data from the social networking site ASKfm,
where users can create profiles and ask or answer questions, with the option of doing so anon-
ymously. ASKfm data typically consists of question-answer pairs published on a user’s profile.
The data were retrieved by crawling a number of seed profiles using the GNU Wget software
(http://www.gnu.org/software/wget/) in April and October, 2013. After language filtering
(i.e. non-English or non-Dutch content was removed), the experimental corpora comprised
113,698 and 78,387 posts for English and Dutch, respectively.
Data annotation
Cyberbullying has been a widely covered research topic recently and studies have shed light on
direct and indirect types of cyberbullying, implicit and explicit forms, verbal and non-verbal
cyberbullying, and so on. This is important from a sociolinguistic point of view, but knowing
what cyberbullying involves is also crucial to build models for automatic cyberbullying detec-
tion. In the following paragraphs, we present our data annotation guidelines [23] and focus on
different types and roles related to the phenomenon.
Types of cyberbullying
Cyberbullying research is mainly centered around the conceptualisation, occurrence and pre-
vention of the phenomenon [1, 52, 53]. Sociolinguistic studies have identified different types
of cyberbullying [12, 54, 55] and compared these types with forms of traditional or offline
bullying [20]. Like traditional bullying, direct and indirect forms of cyberbullying have been
identified. Direct cyberbullying refers to actions in which the victim is directly involved (e.g.
sending a virus-infected file, excluding someone from an online group, insulting and threaten-
ing), whereas indirect cyberbullying can take place without awareness of the victim (e.g. outing or publishing confidential information, spreading gossip, creating a hate page on social net-
working sites) [20].
The present annotation scheme describes some specific textual categories related to cyber-
bullying, including threats, insults, defensive statements from a victim, encouragements to the
harasser, etc. (see the Data collection and annotation section for a complete overview). All of
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 7 / 22
these forms were inspired by social studies on cyberbullying [7, 20] and manual inspection of
cyberbullying examples.
Roles in cyberbullying
Similarly to traditional bullying, cyberbullying involves a number of participants that adopt
well-defined roles. Researchers have identified several roles in (cyber)bullying interactions.
Although traditional studies on bullying have mainly concentrated on bullies and victims [24],
the importance of bystanders in a bullying episode has been acknowledged [56, 57]. Bystanders
can support the victim and mitigate the negative effects caused by the bullying [57], especially
on social networking sites, where they hold higher intentions to help the victim than in real life
conversations [58]. [25] distinguish three main types of bystanders: i) bystanders who partici-
pate in the bullying, ii) who help or support the victim and iii) those who ignore the bullying.
Given that passive bystanders are hard to recognise in online text, only the former two are
included in our annotation scheme.
Annotation guidelines
To operationalise the task of automatic cyberbullying detection, we elaborated a detailed anno-
tation scheme for cyberbullying that is strongly embedded in the literature and applied it to
our corpora. The applicability of the scheme was iteratively tested. Our final guidelines for the
fine-grained annotation of cyberbullying are described in a technical report [23]. The objective
of the scheme was to indicate several types of textual cyberbullying and verbal aggression, their
severity, and the author participant roles. The scheme is formulated to be generic and is not
limited to a specific social media platform. All messages were annotated in context (i.e. pre-
sented within their original content or conversation event) when available.
Essentially, the annotation scheme describes two levels of annotation. Firstly, the annotators
were asked to indicate, at the message or post level, whether the text under investigation was
related to cyberbullying. If the message was considered harmful and thus contained indica-
tions of cyberbullying, annotators identified the author’s participant role. Based on the litera-
ture on role-allocation in cyberbullying episodes [25, 59], four roles are distinguished in the
annotation scheme, including victim, bully, and two types of bystanders.
1. Harasser or bully: person who initiates the bullying.
2. Victim: person who is harassed.
3. Bystander-defender: person who helps the victim and discourages the harasser from con-
tinuing his actions.
4. Bystander-assistant: person who does not initiate, but helps or encourages the harasser.
Secondly, at the sub-sentence level, the annotators were tasked with the identification of
fine-grained text categories related to cyberbullying. In the literature, different forms of cyber-
bullying are identified [12, 54, 55] and compared with traditional bullying [20]. Based on these
forms, the annotation scheme describes a number of textual categories that are often inherent
to a cyberbullying event, such as threats, insults, defensive statements from a victim, encour-
agements to the harasser, etc. Most of the categories are related to direct forms of cyberbullying
(as defined by [25]), while one is related to outing [25], an indirect form of cyberbullying, namely defamation. Additionally, a number of subcategories were defined to make the annota- tion scheme as concrete and distinctive as possible (e.g., discrimination as a subcategory of insult). All cyberbullying-related categories in the scheme are listed below, and an example post for each category is presented in Table 1.
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 8 / 22
• Threat/blackmail: expressions containing physical or psychological threats or indications of
blackmail.
• Insult: expressions meant to hurt or offend the victim.
• General insult: general expressions containing abusive, degrading or offensive language
that are meant to insult the addressee.
• Attacking relatives: insulting expressions towards relatives or friends of the victim.
• Discrimination: expressions of unjust or prejudicial treatment of the victim. Two types of
discrimination are distinguished (i.e. sexism and racism). Other forms of discrimination
should be categorised as general insults.
• Curse/exclusion: expressions of a wish that some form of adversity or misfortune will befall
the victim and expressions that exclude the victim from a conversation or a social group.
• Defamation: expressions that reveal confident or defamatory information about the victim
to a large public.
• Sexual Talk: expressions with a sexual meaning or connotation. A distinction is made
between innocent sexual talk and sexual harassment.
• Defense: expressions in support of the victim, expressed by the victim himself or by a
bystander.
• Bystander defense: expressions by which a bystander shows support for the victim or dis-
courages the harasser from continuing his actions.
• Victim defense: assertive or powerless reactions from the victim.
• Encouragement to the harasser: expressions in support of the harasser.
• Other: expressions that contain any other form of cyberbullying-related behaviour than the
ones described here.
It is important to note that the categories were always indicated in text, even if the post
in which they occurred was not considered harmful, for instance in the post “hi bitches, in
for a movie?”, “bitches” was annotated as an insult while the post itself was not considered
cyberbullying.
Table 1. Definitions and brat annotation examples of more fine-grained text categories related to cyberbullying.
Annotation
category
Annotation example
Threat/blackmail [I am going to find out who you are & I swear you are going to regret it.] THREAT
Insult [Kill yourself] CURS
[you fucking mc slut!!!!] GEN. INSULT
[NO ONE LIKES YOU!!!!!] GEN. INSULT
[You
are an ugly useless little whore!!!!] GEN. INSULT
Curse/Exclusion [Fuck you.] GEN. INSULT
[Now shush I don’t wanna hear anything.] CURSE OR EXCLUSION
Defamation [She slept with her ex behind his girlfiends back and she and him had broken up.] DEFAMATION
Sexual Talk [Naked pic of you now.] SEXUAL HARASSMENT
Defense [I would appreciate if you dindn’t talk shit about my bestfriend.] GEN. VICTIM DEFENSE
He has enough
to deal with already.
Encour. to har. [She is a massive slut] GEN. INSULT
[i agree with you @user she is!] ENCOUR. HARASSER
[LOL AT HER
mate, im on your side] ENCOUR. HARASSER
https://doi.org/10.1371/journal.pone.0203794.t001
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 9 / 22
To provide the annotators with some context, all posts were presented within their original
conversation when possible. All annotations were done using the brat rapid annotation tool
[60], some examples of which are presented in Table 1.
As can be deduced from the examples in the table, there were no restrictions as to what
form the annotations could take. They could be adjectives, noun phrases, verb phrases, and so
on. The only condition was that the annotation could not span more than one sentence and
less than one word. Posts that were (primarily) written in another language than the corpus
language (i.e. Dutch and English) were marked as such and required no further annotations.
We examined the validity of our guidelines and the annotations with an inter-annotator
agreement experiment that is described in the following section.
Annotation statistics
The English and Dutch corpora were independently annotated for cyberbullying by trained
linguists. All were Dutch native speakers and English second-language speakers. To demon-
strate the validity of our guidelines, inter-annotator agreement scores were calculated using
Kappa on a subset of each corpus. Inter-rater agreement for Dutch (2 raters) is calculated
using Cohen’s Kappa [61]. Fleiss’ Kappa [62] is used for the English corpus (> 2 raters). Kappa
scores for the identification of cyberbullying are κ = 0.69 (Dutch) and κ = 0.59 (English). As shown in Table 2, inter-annotator agreement for the identification of the more fine-
grained categories for English varies from fair to substantial [63], except for defamation, which appears to be more difficult to recognise. No encouragements to the harasser were present in
this subset of the corpus. For Dutch, the inter-annotator agreement is fair to substantial, except
for curse and defamation. Analysis revealed that one of both annotators often annotated the lat- ter as an insult, and in some cases even did not consider it as cyberbullying-related.
In short, the inter-rater reliability study shows that the annotation of cyberbullying is not
trivial and that more fine-grained categories like defamation, curse and encouragements are sometimes hard to recognise. It appears that defamations were sometimes hard to distinguish
from insults, whereas curses and exclusions were sometimes considered insults or threats.
The analysis further reveals that encouragements to the harasser are subject to interpretation.
Some are straightforward (e.g. “I agree we should send her hate”), whereas others are subject
to the annotator’s judgment and interpretation (e.g. “hahaha”, “LOL”).
Experimental setup
In this paper, we explore the feasibility of automatically recognising signals of cyberbullying. A
crucial difference with related research is that we do not only model bully ‘attacks’, but also
more implicit forms of cyberbullying and reactions from victims and bystanders (i.e. all under
one binary label ‘signals of cyberbullying’), since these could likewise indicate that cyberbully-
ing is going on. The experiments described in this paper focus on the automatic detection of
such cyberbullying signals that need to be further investigated by human moderators when
applied in a real-life moderation loop.
The English and Dutch corpus contain 113,698 and 78,387 posts, respectively. As shown in
Table 3, the experimental corpus features a heavily imbalanced class distribution with the
Table 2. Inter-annotator agreement on the fine-grained categories related to cyberbullying.
Threat Insult Defense Sexual talk Curse/exclusion Defamation Encouragements to the harasser
English 0.65 0.63 0.45 0.38 0.58 0.15 N/A
Dutch 0.52 0.66 0.63 0.53 0.19 0.00 0.21
https://doi.org/10.1371/journal.pone.0203794.t002
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 10 / 22
large majority of posts not being part of cyberbullying. In classification, this class imbalance
can lead to decreased performance. We apply cost-sensitive SVM as a possible hyperparameter
in optimisation to counter this. The cost-sensitive SVM reweighs the penalty parameter C of the error term by the inverse class-ratio. This means that misclassifications of the minority
positive class are penalised more than classification errors on the majority negative class.
Other pre-processing methods to handle data imbalance in classification include feature filter-
ing metrics and data resampling [64]. These methods were omitted as they were found to be
too computationally expensive given our high-dimensional dataset.
For the automatic detection of cyberbullying, we performed binary classification experi-
ments using a linear kernel support vector machine (SVM) implemented in LIBLINEAR [65]
by making use of Scikit-learn [66], a machine learning library for Python. The motivation
behind this is twofold: i) support vector machines (SVMs) have proven to work well for tasks
similar to the ones under investigation [67] and ii) LIBLINEAR allows fast training of large-
scale data which allow for a linear mapping (which was confirmed after a series of preliminary
experiments using LIBSVM with linear, RBF and polynomial kernels).
The classifier was optimised for feature type (see the Pre-processing and feature engineering
section) and hyperparameter combinations (see Table 4). Model selection was done using
10-fold cross validation in grid search over all possible feature types (i.e. groups of similar fea-
tures, like different orders of n-gram bag-of-words features) and hyperparameter configura- tions. The best performing hyperparameters are selected by F1 score on the positive class. The
winning model is then retrained on all held-in data and subsequently tested on a hold-out test
set to assess whether the classifier is over- or under-fitting. The hold-out set represents a ran-
dom sample (10%) of all data. The folds were randomly stratified splits over the hold-in class
distribution. Testing all feature type combinations is a rudimentary form of feature selection
and provides insight into which types of features work best for this particular task.
Feature selection over all individual features was not performed because of the large feature
space (NL: 795,072 and EN: 871,296 individual features). [68], among other researchers, dem-
onstrated the importance of joint optimisation, where feature selection and hyperparameter
optimisation are performed simultaneously, since the techniques mutually influence each
other.
The optimised models are evaluated against two baseline systems: i) an unoptimised lin-
ear-kernel SVM (configured with default parameter settings) based on word n-grams only and, ii) a keyword-based system that marks posts as positive for cyberbullying if they contain a
word from existing vocabulary lists composed by aggressive language and profanity terms.
Table 3. Statistics of the English and Dutch cyberbullying corpus.
Corpus size Number(ratio) of bullying posts
English 113,698 5,375(4.73%)
Dutch 78,387 5,106(6.97%)
https://doi.org/10.1371/journal.pone.0203794.t003
Table 4. Hyperparameters in grid-search model selection.
Hyperparameter Values
Penalty of error term C 1e{−3, −2, . . ., 2,3}
Loss function Hinge, squared hinge
Penalty: norm used in penalisation ‘l1’ (‘least absolute deviations’) or ‘l2’ (‘least squares’)
Class weight (sets penalty C of class i to weight�C)
None or ‘balanced’, i.e. weight inversely proportional to class
frequencies
https://doi.org/10.1371/journal.pone.0203794.t004
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 11 / 22
Pre-processing and feature engineering
As pre-processing, we applied tokenisation, PoS-tagging and lemmatisation to the data using
the LeTs Preprocess Toolkit [69]. In supervised learning, a machine learning algorithm takes a
set of training instances (of which the label is known) and seeks to build a model that generates
a desired prediction for an unseen instance. To enable the model construction, all instances
are represented as a vector of features (i.e. inherent characteristics of the data) that contain
information that is potentially useful to distinguish cyberbullying from non-cyberbullying
content.
We experimentally tested whether cyberbullying events can be recognised automatically by
lexical markers in a post. To this end, all posts were represented by a number of information
sources (or features) including lexical features like bags-of-words, sentiment lexicon features and topic model features, which are described in more detail below. Prior to feature extraction,
some data cleaning steps were executed, such as the replacement of hyperlinks and @-replies,
removal of superfluous white spaces, and the replacement of abbreviations by their full form
(based on an existing mapping dictionary: http://www.chatslang.com/terms/abbreviations/).
Additionally, tokenisation was applied before n-gram extraction and sentiment lexicon match- ing, and stemming was applied prior to extracting topic model features.
After pre-processing of the corpus, the following feature types were extracted:
• Word n-gram bag-of-words: binary features indicating the presence of word unigrams, bigrams and trigrams.
• Character n-gram bag-of-words: binary features indicating the presence of character bigrams, trigrams and fourgrams (without crossing word boundaries). Character n-grams provide some abstraction from the word level and provide robustness to the spelling varia-
tion that characterises social media data.
• Term lists: one binary feature derived for each one out of six lists, indicating the presence of
an item from the list in a post:
• proper names: a gazetteer of named entities collected from several resources (e.g. Wikipedia).
• ‘allness’ indicators (e.g. “always”, “everybody”): forms which indicate rhetorical superlativ-
ity [70] which can be helpful in identifying the often hyperbolic bullying language.
• diminishers (e.g. “slightly”, “relatively”): diminishers, intensifiers and negation words were
all obtained from an English grammar describing these lexical classes and existing senti-
ment lexicons (see further).
• intensifiers (e.g. “absolutely”, “amazingly”)
• negation words
• aggressive language and profanity words: for English, we used the Google Profanity list
(https://code.google.com/archive/p/badwordslist/downloads). For Dutch, a public profan-
ity lexicon was consulted (http://scheldwoorden.goedbegin.nl).
Person alternation is a binary feature indicating whether the combination of a first and
second person pronoun occurs in order to capture interpersonal intent.
• Subjectivity lexicon features: positive and negative opinion word ratios, as well as the over-
all post polarity were calculated using existing sentiment lexicons. For Dutch, we made use
of the Duoman [71] and Pattern [72] lexicons. For English, we included the Liu and Hu
opinion lexicon [73], the MPQA lexicon [74], the General Inquirer Sentiment Lexicon [75],
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 12 / 22
AFINN [76], and MSOL [77]. For both languages, we included the relative frequency of all
68 psychometric categories in the Linguistic Inquiry and Word Count (LIWC) dictionary
for English [78] and Dutch [79].
• Topic model features: by making use of the Gensim topic modelling library [80], several
LDA [81] and LSI [82] topic models with varying granularity (k = 20, 50, 100 and 200) were trained on data corresponding to each fine-grained category of a cyberbullying event (e.g.
threats, defamations, insults, defenses). The topic models were based on a background cor-
pus (EN: ± 1,200,000 tokens, NL: ± 1,400,000 tokens) scraped with the BootCaT [83] web- corpus toolkit. BootCaT collected ASKfm user profiles using lists of manually determined
seed words that are characteristic of the cyberbullying categories.
When applied to the training data, this resulted in 871,296 and 795,072 features for English
and Dutch, respectively.
Results
In this section, we present the results of our experiments to automatically detect cyberbullying
signals in an English and Dutch corpus of ASKfm posts. Ten-fold cross-validation was per-
formed in exhaustive grid search over different feature type and hyperparameter combinations
(see the Experimental setup section). The unoptimised word n-gram-based classifier and key- word-matching system serve as baselines for comparison. Precision, Recall and F1 performance metrics were calculated on the positive class. We also report Area Under the Receiver Operator
Curve (AUROC) scores, a performance metric that is more robust to data imbalance than pre-
cision, recall and F score [84].
Table 5 gives us an indication of which feature type combinations score best and hence con-
tribute most to this task. It presents the cross-validation and hold-out scores of a set of feature
Table 5. Cross-validated and hold-out scores (%) according to different metrics (F1, precision, recall, accuracy and area under the curve) for the English and Dutch
three best and worst combined feature type systems.
Feature combination Cross-validation scores Hold-out scores
F1 P R Acc AUROC F1 P R Acc AUROC
English
Best three B + C + D + E 64.26 73.32 57.19 96.97 78.07 63.69 74.13 55.82 97.21 77.47
A + B + C 64.24 73.22 57.23 96.96 78.09 64.32 74.08 56.83 97.24 77.96
A + C + E 63.84 73.21 56.59 96.94 77.78 62.94 72.82 55.42 97.14 77.24
Worst three D 40.48 38.98 42.12 94.10 69.41 39.56 39.56 39.56 94.71 68.39
A + D + E 38.95 31.47 51.10 92.37 72.76 40.71 33.87 51.00 93.49 73.22
E 17.35 9.73 79.91 63.72 71.41 15.70 8.72 78.51 63.07 70.44
Baseline word n-gram 58.17 67.55 51.07 96.54 74.93 59.63 69.57 52.17 96.57 75.50 profanity 17.17 9.61 80.14 63.73 71.53 17.61 9.90 78.51 63.79 71.34
Dutch
Best three A + B + C + E 61.20 56.76 66.40 94.47 81.42 58.13 54.03 62.90 94.58 79.75
A + B + C + D + E 61.03 71.55 53.20 95.53 75.86 58.72 67.40 52.03 95.62 75.21
A + C + E 60.82 71.66 52.84 95.53 75.68 58.15 67.71 50.96 95.61 74.71
Worst three D + B 32.90 29.23 37.63 89.91 65.61 30.16 34.72 26.65 92.61 61.73
D 28.65 19.36 55.10 81.97 69.48 25.13 16.73 50.53 81.99 67.26
B 24.74 21.24 29.61 88.16 60.94 17.99 23.15 14.71 91.98 55.80
Baseline word n-gram 50.39 67.80 40.09 94.81 69.38 49.54 64.29 40.30 95.09 69.44 profanity 28.46 19.24 54.66 81.99 69.28 25.13 16.73 50.53 81.99 67.26
https://doi.org/10.1371/journal.pone.0203794.t005
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 13 / 22
combinations, which are explained in the feature groups legend (Table 6). A total of 31 feature
type combinations, each with 28 different hyperparameter sets have been tested. Table 5 shows
the results for the three best scoring systems by included feature types with optimised hyper-
parameters. The maximum obtained F1 score in cross-validation is 64.26% for English and
61.20% for Dutch and shows that the classifier benefits from a variety of feature types. The
results on the hold-out test set show that the trained systems generalise well on unseen data,
indicating little under- or overfitting. The simple keyword-matching baseline system has the
lowest performance for both languages even though it obtains high recall for both languages,
especially for English (80.14%), suggesting that profane language characterises many cyberbul-
lying-related posts. Feature group and hyperparameter optimisation provides a considerable
performance increase over the unoptimised word n-gram baseline system. The top-scoring systems for each language do not differ a lot in performance, except the best system for Dutch,
which trades recall for precision when compared to the runner-ups.
Table 7 presents the scores of the (hyperparameter-optimised) single feature type systems,
to gain insight into the performance of these feature types when used individually. Analysis of
the combined and single feature type sets reveals that word n-grams, character n-grams, and subjectivity lexicons prove to be strong features for this task. In effect, adding character n- grams always improved classification performance for both languages. They are likely to pro-
vide robustness to lexical variation in social media text, as compared to word n-grams. While subjectivity lexicons appear to be discriminative features, term lists perform badly on their
own as well as in combinations for both languages. This shows once again (see the profanity
baseline) that cyberbullying detection requires more sophisticated information sources than
profanity lists. Topic models seem to do badly for both languages on their own, but in
Table 6. Feature group mapping (Table 5).
A word n-grams B subjectivity lexicons
C character n-grams D term lists
E topic models
https://doi.org/10.1371/journal.pone.0203794.t006
Table 7. Cross-validated and hold-out scores (%) according to different metrics (F1, precision, recall, accuracy and area under the ROC curve) for English and
Dutch single feature type systems.
Feature type Cross-validation scores hold-out scores
F1 P R Acc AUROC F1 P R Acc AUROC
English
word n-grams 60.09 60.49 59.69 96.22 78.87 58.35 57.12 59.64 96.27 78.79 subjectivity lexicons 56.82 73.32 46.38 96.64 72.77 56.16 72.61 45.78 96.87 72.50
character n-grams 52.69 58.70 47.80 95.91 73.06 53.33 62.37 46.59 96.43 72.65 term lists 40.48 38.98 42.12 94.10 69.41 39.56 39.56 39.56 94.71 68.39
topic models 17.35 9.73 79.91 63.72 71.41 15.70 8.72 78.51 63.07 70.44
Dutch
word n-grams 55.53 72.64 44.94 95.27 71.88 54.99 70.20 45.20 95.57 71.99 subjectivity lexicons 54.34 54.12 54.56 93.97 75.65 51.82 50.61 53.09 94.09 74.90
character n-grams 51.70 67.58 41.86 94.86 70.22 50.46 65.20 41.15 95.17 69.88 term lists 28.65 19.36 55.10 81.97 69.48 25.13 16.73 50.53 81.99 67.26
topic models 24.74 21.24 29.61 88.16 60.94 17.99 23.15 14.71 91.98 55.80
https://doi.org/10.1371/journal.pone.0203794.t007
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 14 / 22
combination with other features, they improve Dutch performance consistently. A possible
explanation for their varying performance in both languages would be that the topic models
trained on the Dutch background corpus are of better quality than the English ones. In effect,
a random selection of background corpus texts reveals that the English scrape contains more
noisy data (i.e. low word-count posts and non-English posts) compared to the Dutch scraped
corpus.
A shallow qualitative analysis of the classification output provided insight into some of the
classification mistakes.
Table 8 gives an overview of the error rates per cyberbullying category of the best perform-
ing and baseline systems. This could give an indication of the types of bullying are hard to
detect by the current classifier. All categories are always considered positive for cyberbullying
(i.e. the error rate equals the false negative rate), except for Sexual and Insult which can also be negative (in case of harmless sexual talk and ‘socially acceptable’ insulting language like “hi
bitches, in for a movie?” the corresponding category was indicated, but the post itself was not
annotated as cyberbullying) and Not cyberbullying, which is always negative. Error rates often being lowest for the profanity baseline confirms that it performs particularly well in terms of
recall (at the expense of precision, see Table 5). When looking at the best system for both lan-
guages, we see that Defense is the hardest category to classify. This should not be a surprise as the category comprises defensive posts from bystanders and victims, which contain less aggres-
sive language than cyberbullying attacks and are often shorter in length than the latter. Asser-
tive defensive posts (i.e. a subcategory of Defense) which attack the bully are, however, more often correctly classified. There are not sufficient instances of the Encouragement class for either language in the hold-out set to be representative. In both languages, threats, curses and
incidences of sexual harassment are most easily recognisable, showing (far) lower error rates
than the categories Defamation, Defense, Encouragements to the harasser, and Insult. A qualitative error analysis of the English and Dutch predictions reveals that false positives
often contain aggressive language directed at a second person, often denoting personal flaws
Table 8. Error rates (%) per cyberbullying subcategory on hold-out for English and Dutch systems.
Category Nr. occurrences in hold-out Profanity baseline Word n-gram baseline Best system
English
Curse n = 109 14.68 30.28 24.77 Defamation n = 21 23.81 47.62 38.10 Defense n = 165 22.42 52.12 43.64 Encouragement n = 1 0.00 100.00 100.00 Insult n = 345 26.67 41.74 35.94 Sexual n = 165 63.80 21.47 21,47 Threat n = 12 8.33 41.67 25.00 Not cyberbullying n = 10,714 36.94 1.10 0.76
Dutch
Curse n = 96 39.58 50.00 22.92 Defamation n = 6 100.00 66.67 33.33 Defense n = 200 52.50 63.50 46.00 Encouragement n = 5 40.00 60.00 40.00 Insult n = 355 43.38 47.89 28.17 Sexual n = 37 37.84 21.62 27.03 Threat n = 15 33.33 46.67 20.00 Not cyberbullying n = 7,295 15.63 1.23 3.07
https://doi.org/10.1371/journal.pone.0203794.t008
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 15 / 22
or containing sexual and profanity words. We see that misclassifications are often short posts
containing just a few words and that false negatives often lack explicit verbal signs of cyberbul-
lying (e.g. insulting or profane words) or are ironic (examples 2 and 3). Additionally, we see
that cyberbullying posts containing misspellings or grammatical errors and incomplete words
are also hard to recognise as such (examples 4 and 5). The Dutch and English corpus are over-
all similar with respect to qualitative properties of classification errors.
2. You might want to do some sports ahah x
3. Look who is there… my thousandth anonymous hater, congratulations!
4. ivegot 1 word foryou… yknow whatit is? ! slut
5. One word for you: G—A—…
In short, the experiments show that our classifier clearly outperforms both a keyword-
based and word n-gram baseline. However, analysis of the classifier output reveals that false negatives often lack explicit clues that cyberbullying is going on, indicating that our system
might benefit from irony recognition and integrating world knowledge to capture such
implicit realisations of cyberbullying.
Our annotation scheme allowed to indicate different author roles, which provides better
insight into the realisation of cyberbullying. Table 9 presents the error rates of our classifier for
the different author roles, being harasser, victim, and two types of bystanders. We observe that
the error rates are high for bystander assistant and victim, but there are not sufficient instances in the hold-out set of the former role for either language to be representative. Error rates for
the victim class of 50.39% and 54% in English and Dutch respectively indicate that the role is hard to recognise by the classifier. A possible explanation for this could be that victim posts
in our corpus either expressed powerlessness facing the bully (example 6) or either contained
explicit aggressive language as well (example 7).
6. Your the one going round saying im a cunt and a twat and im ugly. tbh all im doing is stick- ing up for myself.
7. You’re fucked up saying I smell from sweat, because unlike some other people I shower every day BITCH
According to the figures, the most straightforward roles in detection are bystander defender and harasser.
Table 9. Error rates (%) per cyberbullying participant role on hold-out for English and Dutch systems.
Participant role Nr. occurrences in hold-out Profanity baseline Word n-gram baseline Best system
English
Harasser n = 328 20.43 48.48 43.60 Bystander assistant n = 2 50.00 100.00 100.00 Bystander defender n = 39 7.69 38.46 25.64 Victim n = 129 27.91 57.36 50.39 Not cyberbullying n = 10872 37.64 1.24 0.89
Dutch
Harasser n = 261 47.13 56.70 29.89 Bystander assistant n = 6 50.00 66.67 50.00 Bystander defender n = 52 25.00 38.46 23.08 Victim n = 150 62.00 72.00 54.00 Not cyberbullying n = 7370 16.01 1.42 3.41
https://doi.org/10.1371/journal.pone.0203794.t009
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 16 / 22
In the light of comparison with state-of-the-art approaches to cyberbullying, we observe
that competitive results are obtained with regard to [30–32, 41]. However, the fundamental
differences with respect to data collection, sources, and conceptualisations of bullying hardly
allow for direct comparison. Table 10 presents the experimental results obtained by [43–45]
who, like the current study, approach the task as detecting posts from bullies as well as from
victims and bystanders. Given their experimental setup (i.e. task description, data genre and
classifier), their work can be considered most similar to ours so their results might function
as benchmarks. Also here, a number of crucial differences with the current approach can be
observed: Firstly, their corpora were collected using the keywords “bully”, “bullying” and “bul-
lied”, which may bias the dataset towards the positive class and ensures that many explicit lexi-
calisations are present in the positive class. Second, it is not clear which types of cyberbullying
(i.e. explicit and implicit bullying, threats, insults, sexual harassment) are included in the posi-
tive class. Furthermore, as can be deduced from Table 10, the datasets are considerably smaller
than ours and show a more balanced class distribution (respectively 39% cyberbullying posts
in [43] and [44], and 29%/26% in [45]) than the ratio of bullying posts in our corpus (see
Table 3: 5% for English, 7% for Dutch). Hence, any comparison should be made with caution
due to these differences.
These studies obtain higher scores on similar task but vastly different datasets. Notably,
[45] shows a great improvement in classification performance using deep representational
learning with a semantic-enhanced marginalized denoising auto-encoder over traditional n- gram and topic modelling features.
Conclusions and future research
The goal of the current research was to investigate the automatic detection of cyberbullying-
related posts on social media. Given the information overload on the web, manual monitoring
for cyberbullying has become unfeasible. Automatic detection of signals of cyberbullying
would enhance moderation and allow to respond quickly when necessary.
Cyberbullying research has often focused on detecting cyberbullying ‘attacks’ and hence
overlook other or more implicit forms of cyberbullying and posts written by victims and
bystanders. However, these posts could just as well indicate that cyberbullying is going on. The
main contribution of this paper is that it presents a system to automatically detect signals of
cyberbullying on social media, including different types of cyberbullying, covering posts from
bullies, victims and bystanders. We evaluated our system on a manually annotated cyberbully-
ing corpus for English and Dutch and hereby demonstrated that our approach can easily be
applied to different languages, provided that annotated data for these languages are available.
A set of binary classification experiments were conducted to explore the feasibility of auto-
matic cyberbullying detection on social media. In addition, we sought to determine which
information sources contribute most to the task. Two classifiers were trained on an English
and Dutch ASKfm corpus and evaluated on a hold-out test of the same genre. Our experiments
reveal that the current approach is a promising strategy for detecting signals of cyberbullying
Table 10. Overview of the most related cyberbullying detection approaches.
Reference Classifier Corpus Bully rate F1 score
[44] SVM 1,762 tweets 39% 77%
[43] wvec+SVM 1,762 tweets 39% 78%
[45] smSDA+SVM 7,321 tweets 29% 72%
[45] smSDA+SVM 1,539 MySpace posts 26% 78%
https://doi.org/10.1371/journal.pone.0203794.t010
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 17 / 22
on social media automatically. After feature and hyperparameter optimisation of our models, a
maximum F1 score of 64.32% and 58.72% was obtained for English and Dutch, respectively.
The classifiers hereby significantly outperformed a keyword and an (unoptimised) n-gram baseline. A qualitative analysis of the results revealed that false positives often include implicit
cyberbullying or offenses through irony, the challenge of which will constitute an important
area for future work. Error rates on the different author roles in our corpus revealed that espe-
cially victims are hard to recognise, as they react differently in our corpus, showing either pow-
erlessness facing the bully or reacting in an assertive and sometimes even aggressive way.
As shown in [45] deep representation learning is a promising avenue for this task. We
therefore intent to apply deep learning techniques to improve classifier performance.
Another interesting direction for future work would be the detection of fine-grained cyber-
bullying categories such as threats, curses and expressions of racism and hate. When applied in
a cascaded model, the system could find severe cases of cyberbullying with high precision. This
would be particularly interesting for monitoring purposes. Additionally, our dataset allows for
detection of participant roles typically involved in cyberbullying. When applied as moderation
support on online platforms, such a system enables feedback in function of the recipient (i.e. a
bully, victim, or bystander).
Author Contributions
Conceptualization: Cynthia Van Hee, Gilles Jacobs, Chris Emmery, Bart Desmet, Els Lefever,
Ben Verhoeven, Guy De Pauw, Walter Daelemans, Véronique Hoste.
Data curation: Cynthia Van Hee, Gilles Jacobs, Els Lefever, Ben Verhoeven, Guy De Pauw.
Funding acquisition: Walter Daelemans, Véronique Hoste.
Methodology: Cynthia Van Hee, Gilles Jacobs, Bart Desmet, Els Lefever, Walter Daelemans,
Véronique Hoste.
Project administration: Walter Daelemans, Véronique Hoste.
Resources: Cynthia Van Hee, Gilles Jacobs, Els Lefever.
Software: Cynthia Van Hee, Gilles Jacobs, Bart Desmet.
Supervision: Walter Daelemans, Véronique Hoste.
Writing – original draft: Cynthia Van Hee, Gilles Jacobs.
Writing – review & editing: Cynthia Van Hee, Gilles Jacobs, Chris Emmery, Bart Desmet, Els
Lefever, Ben Verhoeven, Guy De Pauw, Walter Daelemans, Véronique Hoste.
References 1. Livingstone S, Haddon L, Görzig A, Ólafsson K. Risks and safety on the internet: The perspective of
European children. Initial Findings. London: EU Kids Online; 2010.
2. Tokunaga RS. Following You Home from School: A Critical Review and Synthesis of Research on
Cyberbullying Victimization. Computers in Human Behavior. 2010; 26(3):277–287. https://doi.org/10.
1016/j.chb.2009.11.014
3. Mckenna KY, Bargh JA. Plan 9 From Cyberspace: The Implications of the Internet for Personality and
Social Psychology. Personality & Social Psychology Review. 1999; 4(1):57–75. https://doi.org/10.1207/
S15327957PSPR0401_6
4. Gross EF, Juvonen J, Gable SL. Internet Use and Well-Being in Adolescence. Journal of Social Issues.
2002; 58(1):75–90. https://doi.org/10.1111/1540-4560.00249
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 18 / 22
5. Juvonen J, Gross EF. Extending the school grounds?—Bullying experiences in cyberspace. Journal of
School Health. 2008; 78(9):496–505. https://doi.org/10.1111/j.1746-1561.2008.00335.x PMID:
18786042
6. Hinduja S, Patchin JW. Bullies Move Beyond the Schoolyard: A Preliminary Look at Cyberbullying.
Youth Violence And Juvenile Justice. 2006; 4(2):148–169. https://doi.org/10.1177/1541204006286288
7. Van Cleemput K, Bastiaensens S, Vandebosch H, Poels K, Deboutte G, DeSmet A, et al. Zes jaar
onderzoek naar cyberpesten in Vlaanderen, België en daarbuiten: een overzicht van de bevindingen.
(Six years of research on cyberbullying in Flanders, Belgium and beyond: an overview of the findings.)
(White Paper). University of Antwerp & Ghent University; 2013.
8. Livingstone S, Haddon L, Vincent J, Giovanna M, Ólafsson K. Net Children Go Mobile: The Uk report;
2014. Available from: http://netchildrengomobile.eu/reports. [Accessed 30th March 2018].
9. O’Moore M, Kirkham C. Self-esteem and its relationship to bullying behaviour. Aggressive Behavior.
2001; 27(4):269–283. https://doi.org/10.1002/ab.1010
10. Fekkes M, Pijpers FIM, Fredriks AM, Vogels T, Verloove-Vanhorick SP. Do Bullied Children Get Ill, or
Do Ill Children Get Bullied? A Prospective Cohort Study on the Relationship Between Bullying and
Health-Related Symptoms. Pediatrics. 2006; 117(5):1568–1574. https://doi.org/10.1542/peds.2005-
0187 PMID: 16651310
11. Cowie H. Cyberbullying and its impact on young people’s emotional health and well-being. The Psychia-
trist. 2013; 37(5):167–170. https://doi.org/10.1192/pb.bp.112.040840
12. Price M, Dalgleish J. Cyberbullying: Experiences, Impacts and Coping Strategies as Described by Aus-
tralian Young People. Youth Studies Australia. 2010; 29(2):51–59.
13. Van Royen K, Poels K, Daelemans W, Vandebosch H. Automatic monitoring of cyberbullying on social
networking sites: From technological feasibility to desirability. Telematics and Informatics. 2014;.
14. Cortes C, Vapnik V. Support-Vector Networks. Machine Learning. 1995; 20(3):273–297. https://doi.org/
10.1007/BF00994018
15. Chang CC, Lin CJ. LIBSVM: A Library for Support Vector Machines. ACM Transactions on Intelligent
Systems and Technology (TIST). 2011; 2(3):27:1–27:27.
16. Dalla Pozza V, Di Pietro A, Morel S, Emma P. Cyberbullying among Young People. European Parlia-
ment; 2016.
17. Olweus D. Bullying at School: What We Know and What We Can Do. 2nd ed. Wiley; 1993.
18. Smith PK, Mahdavi J, Carvalho M, Fisher S, Russell S, Tippett N. Cyberbullying: its nature and impact
in secondary school pupils. Journal of Child Psychology and Psychiatry. 2008; 49(4):376–385. https://
doi.org/10.1111/j.1469-7610.2007.01846.x PMID: 18363945
19. Vandebosch H, Van Cleemput K. Defining cyberbullying: a qualitative research into the perceptions of
youngsters. Cyberpsychology and behavior: the impact of the Internet, multimedia and virtual reality on
behavior and society. 2008; 11(4):499–503. https://doi.org/10.1089/cpb.2007.0042
20. Vandebosch H, Van Cleemput K. Cyberbullying among youngsters: profiles of bullies and victims. New
Media & Society. 2009; 11(8):1349–1371. https://doi.org/10.1177/1461444809341263
21. Dooley JJ, Cross D. Cyberbullying versus face-to-face bullying: A review of the similarities and differ-
ences. Journal of Psychology. 2009; 217:182–188.
22. Slonje R, Smith PK, Frisén A. The Nature of Cyberbullying, and Strategies for Prevention. Compututers
in Human Behavior. 2013; 29(1):26–32. https://doi.org/10.1016/j.chb.2012.05.024
23. Van Hee C, Verhoeven B, Lefever E, De Pauw G, Daelemans W, Hoste V. Guidelines for the Fine-
Grained Analysis of Cyberbullying, version 1.0. LT3, Language and Translation Technology Team–
Ghent University; 2015. LT3 15-01.
24. Salmivalli C, Lagerspetz K, Björkqvist K, Österman K, Kaukiainen A. Bullying as a group process: Par-
ticipant roles and their relations to social status within the group. Aggressive Behavior. 1996; 22(1):1–
15. https://doi.org/10.1002/(SICI)1098-2337(1996)22:1%3C1::AID-AB1%3E3.0.CO;2-T
25. Vandebosch H, Van Cleemput K, Mortelmans D, Walrave M. Cyberpesten bij jongeren in Vlaanderen:
Een studie in opdracht van het viWTA (Cyberbullying among youngsters in Flanders: a study commis-
soned by the viWTA). Brussels: viWTA; 2006. Available from: https://wise.vub.ac.be/fattac/mios/
Eindrapport%20cyberpesten%20viwta%202006.pdf. [Accessed 30th March 2018].
26. Dehue F, Bolman C, Völlink T. Cyberbullying: Youngsters’ Experiences and Parental Perception.
Cyberpsychology and Behavior. 2008; 11(2):217–223. https://doi.org/10.1089/cpb.2007.0008 PMID:
18422417
27. Salmivalli C, Pöoyhönen V. In: Cyberbullying in Finland. 2nd ed. Wiley-Blackwell; 2012. p. 57–72.
28. Salawu S, he Y, Lumsden J. Approaches to Automated Detection of Cyberbullying: A Survey. IEEE
Transactions on Affective Computing. forthcoming;p. 1.
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 19 / 22
29. Reynolds K, Kontostathis A, Edwards L. Using Machine Learning to Detect Cyberbullying. In: Proceed-
ings of the 2011 10th International Conference on Machine Learning and Applications and Workshops.
ICMLA’11. Washington, DC, USA: IEEE Computer Society; 2011. p. 241–244.
30. Dinakar K, Reichart R, Lieberman H. Modeling the Detection of Textual Cyberbullying. In: The Social
Mobile Web. vol. WS-11-02 of AAAI Workshops. AAAI; 2011. p. 11–17.
31. Yin D, Davison BD, Xue Z, Hong L, Kontostathis A, Edwards L. Detection of Harassment on Web 2.0.
In: Proceedings of the Content Analysis in the Web 2.0 (CAW2.0). Madrid, Spain; 2009.
32. Dadvar M. Experts and machines united against cyberbullying [PhD thesis]. University of Twente; 2014.
33. Nahar V, Al-Maskari S, Li X, Pang C. Semi-supervised Learning for Cyberbullying Detection in Social
Networks. In: ADC.Databases Theory and Applications. Springer International Publishing; 2014. p.
160–171.
34. Bayzick J, Kontostathis A, Edwards L. Detecting the Presence of Cyberbullying Using Computer Soft-
ware. In: Proceedings of the 3rd International Web Science Conference. WebSci11; 2011.
35. Sui J. Understanding and Fighting Bullying with Machine Learning [PhD thesis]. Department of Com-
puter Sciences, University of Wisconsin-Madison; 2015.
36. Galán-Garcı́a P, Puerta JGdl, Gómez CL, Santos I, Bringas PG. Supervised machine learning for the
detection of troll profiles in twitter social network: application to a real case of cyberbullying. Logic Jour-
nal of the IGPL. 2016; 24(1):42–53. Available from: http://dx.doi.org/10.1093/jigpal/jzv048.
37. Raisi E, Huang B. Cyberbullying Detection with Weakly Supervised Machine Learning. In: Proceedings
of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
2017. ASONAM’17. New York, NY, USA: ACM; 2017. p. 409–416.
38. Hosseinmardi H. Survey of Computational Methods in Cyberbullying Research. In: Proceedings of the
First International Workshop on Computational Methods for CyberSafety. CyberSafety’16. New York,
NY, USA: ACM; 2016. p. 4–4.
39. Nandhini B Sri, Sheeba JI. Online Social Network Bullying Detection Using Intelligence Techniques.
Procedia Computer Science. 2015; 45:485–492. https://doi.org/10.1016/j.procs.2015.03.085
40. Chavan VS, S SS. Machine learning approach for detection of cyber-aggressive comments by peers on
social media network. In: 2015 International Conference on Advances in Computing, Communications
and Informatics (ICACCI); 2015. p. 2354–2358.
41. Van Hee C, Lefever E, Verhoeven B, Mennes J, Desmet B, De Pauw G, et al. Detection and fine-
grained classification of cyberbullying events. In: Angelova G, Bontcheva K, Mitkov R, editors. Proceed-
ings of Recent Advances in Natural Language Processing, Proceedings; 2015. p. 672–680.
42. Van Royen K, Poels K, Vandebosch H. Harmonizing freedom and protection: Adolescents’ voices on
automatic monitoring of social networking sites. Children and Youth Services Review. 2016; 64:35–41.
https://doi.org/10.1016/j.childyouth.2016.02.024
43. Zhao R, Zhou A, Mao K. Automatic Detection of Cyberbullying on Social Networks Based on Bullying
Features. In: Proceedings of the 17th International Conference on Distributed Computing and Network-
ing. No. 43 in ICDCN’16. New York, NY, USA: ACM; 2016. p. 43:1–43:6.
44. Xu JM, Jun KS, Zhu X, Bellmore A. Learning from Bullying Traces in Social Media. In: Proceedings of
the 2012 Conference of the North American Chapter of the Association for Computational Linguistics:
Human Language Technologies. NAACL HLT’12. Stroudsburg, PA, USA: Association for Computa-
tional Linguistics; 2012. p. 656–666.
45. Zhao R, Mao K. Cyberbullying Detection Based on Semantic-Enhanced Marginalized Denoising Auto-
Encoder. EEE Transactions on Affective Computing. 2017; 8(3):328–339. https://doi.org/10.1109/
TAFFC.2016.2531682
46. Huang Q, Singh VK, Atrey PK. Cyber Bullying Detection Using Social and Textual Analysis. In: Pro-
ceedings of the 3rd International Workshop on Socially-Aware Multimedia. SAM’14. New York, NY,
USA: ACM; 2014. p. 3–6.
47. Al-garadi MA, Dewi Varathan K, Devi Ravana S. Cybercrime detection in online communications: The
experimental case of cyberbullying detection in the Twitter network. Computers in Human Behavior.
2016; 63:433–443. https://doi.org/10.1016/j.chb.2016.05.051
48. Squicciarini A, Rajtmajer S, Liu Yh, Griffin C. Identification and Characterization of Cyberbullying
Dynamics in an Online Social Network. In: Proceedings of the 2015 IEEE/ACM International Confer-
ence on Advances in Social Networks Analysis and Mining 2015. ASONAM’15. New York, NY, USA:
ACM; 2015. p. 280–285.
49. Chatzakou D, Kourtellis N, Blackburn J, De Cristofaro E, Stringhini G, Vakali A. Mean Birds: Detecting
Aggression and Bullying on Twitter. In: Proceedings of the 2017 ACM on Web Science Conference.
WebSci’17. New York, NY, USA: ACM; 2017. p. 13–22.
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 20 / 22
50. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer PW. SMOTE: Synthetic Minority Over-sampling Tech-
nique. Journal of Artificial Intelligence Research (JAIR). 2002; 16:321–357. https://doi.org/10.1613/jair.
953
51. Van Hee C, Lefever E, Verhoeven B, Mennes J, Desmet B, De Pauw G, et al. Automatic detection and
prevention of cyberbullying. In: Lorenz P, Bourret C, editors. International Conference on Human and
Social Analytics, Proceedings. IARIA; 2015. p. 13–18.
52. Hinduja S, Patchin JW. Cyberbullying: Neither an epidemic nor a rarity. European Journal of Develop-
mental Psychology. 2012; 9(5):539–543. https://doi.org/10.1080/17405629.2012.706448
53. Slonje R, Smith PK. Cyberbullying: Another main type of bullying? Scandinavian Journal of Psychology.
2008; 49(2):147–154. https://doi.org/10.1111/j.1467-9450.2007.00611.x PMID: 18352984
54. O’Sullivan PB, Flanagin AJ. Reconceptualizing ‘flaming’ and other problematic messages. New Media
& Society. 2003; 5(1):69–94. https://doi.org/10.1177/1461444803005001908
55. Willard NE. Cyberbullying and Cyberthreats: Responding to the Challenge of Online Social Aggression,
Threats, and Distress. 2nd ed. Research Publishers LLC; 2007.
56. Bastiaensens S, Vandebosch H, Poels K, Van Cleemput K, DeSmet A, De Bourdeaudhuij I. Cyberbully-
ing on social network sites. An experimental study into bystanders’ behavioural intentions to help the
victim or reinforce the bully. Computers in Human Behavior. 2014; 31:259–271. https://doi.org/10.1016/
j.chb.2013.10.036
57. Salmivalli C. Bullying and the peer group: A review. Aggression and Violent Behavior. 2010; 15(2):112–
120. https://doi.org/10.1016/j.avb.2009.08.007
58. Bastiaensens S, Vandebosch H, Poels K, Van Cleemput K, DeSmet A, De Bourdeaudhuij I. ‘Can I
afford to help?’ How affordances of communication modalities guide bystanders’ helping intentions
towards harassment on social network sites. Behaviour & Information Technology. 2015; 34(4):425–
435. https://doi.org/10.1080/0144929X.2014.983979
59. Salmivalli C, Voeten M, Poskiparta E. Bystanders Matter: Associations Between Reinforcing, Defend-
ing, and the Frequency of Bullying Behavior in Classrooms. Journal of Clinical Child & Adolescent Psy-
chology. 2011; 40(5):668–676. https://doi.org/10.1080/15374416.2011.597090
60. Stenetorp P, Pyysalo S, Topić G, Ohta T, Ananiadou S, Tsujii J. brat: a Web-based Tool for NLP-Assis- ted Text Annotation. In: Proceedings of the Demonstrations Session at EACL 2012. Avignon, France;
2012. p. 102–107.
61. Cohen J. A coefficient of agreement for nominal scales. Educational and Psychological Measurement.
1960; 20(1):37–46. https://doi.org/10.1177/001316446002000104
62. Fleiss JL. Measuring nominal scale agreement among many raters. Psychological Bulletin. 1971; 76
(5):378–382. https://doi.org/10.1037/h0031619
63. McHugh ML. Interrater reliability: the kappa statistic. Biochemia Medica. 2012; 22(3):276–282. https://
doi.org/10.11613/BM.2012.031 PMID: 23092060
64. He H, Garcia EA. Learning from imbalanced data. IEEE Transactions on knowledge and data engineer-
ing. 2009; 21(9):1263–1284. https://doi.org/10.1109/TKDE.2008.239
65. Fan RE, Chang KW, Hsieh CJ, Wang XR, Lin CJ. LIBLINEAR: A Library for Large Linear Classification.
Journal of Machine Learning Research. 2008; 9:1871–1874.
66. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine
Learning in Python. Journal of Machine Learning Research. 2011; 12:2825–2830.
67. Desmet B. Finding the online cry for help: automatic text classification for suicide prevention [PhD the-
sis]. Ghent University; 2014.
68. Hoste V. Optimization Issues in Machine Learning of Coreference Resolution [PhD thesis]. Antwerp
University; 2005.
69. van de Kauter M, Coorman G, Lefever E, Desmet B, Macken L, Hoste V. LeTs Preprocess: The multilin-
gual LT3 linguistic preprocessing toolkit. Computational Linguistics in the Netherlands Journal. 2013;
3:103–120.
70. Osgood CE, Walker EG. Motivation and language behavior: A content analysis of suicide notes. The
Journal of Abnormal and Social Psychology. 1959; 59(1):58. https://doi.org/10.1037/h0047078
71. Jijkoun V, Hofmann K. Generating a Non-English Subjectivity Lexicon: Relations That Matter. In: Pro-
ceedings of the 12th Conference of the European Chapter of the Association for Computational Linguis-
tics. Stroudsburg, PA, USA; 2009. p. 398–405.
72. De Smedt T, Daelemans W. “Vreselijk mooi!” (“Terribly Beautiful!”): A Subjectivity Lexicon for Dutch
Adjectives. In: Proceedings of the Eight International Conference on Language Resources and Evalua-
tion. LREC’12. Istanbul, Turkey; 2012. p. 3568–3572.
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 21 / 22
73. Hu M, Liu B. Mining and summarizing customer reviews. In: Proceedings of the 10th ACM SIGKDD
international conference on Knowledge discovery and data mining. KDD04. ACM; 2004. p. 168–177.
74. Wilson T, Wiebe J, Hoffmann P. Recognizing Contextual Polarity in Phrase-level Sentiment Analysis.
In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural
Language Processing. HLT’05. Association for Computational Linguistics; 2005. p. 347–354.
75. Stone PJ, Dunphy DCD, Smith MS, Ogilvie DM. The General Inquirer: A Computer Approach to Content
Analysis. The MIT Press; 1966.
76. Nielsen FÅ. A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs. In: Rowe M, Stankovic M, Dadzie AS, Hardey M, editors. Proceedings of the ESWC2011 Workshop on ‘Making
Sense of Microposts’: Big things come in small packages. vol. 718 of CEUR Workshop Proceedings.
CEUR-WS.org; 2011. p. 93–98.
77. Mohammad S, Dunne C, Dorr B. Generating High-coverage Semantic Orientation Lexicons from
Overtly Marked Words and a Thesaurus. In: Proceedings of the 2009 Conference on Empirical Methods
in Natural Language Processing: Volume 2. EMNLP’09. Stroudsburg, PA, USA: Association for Compu-
tational Linguistics; 2009. p. 599–608.
78. Pennebaker JW, Francis ME, Booth RJ. Linguistic Inquiry and Word Count: LIWC 2001. Mahwah, NJ:
Lawrence Erlbaum Associates; 2001.
79. Zijlstra H, Van Meerveld T, Van Middendorp H, Pennebaker JW, Geenen R. De Nederlandse versie van
de ‘linguistic inquiry and word count’ (LIWC). Gedrag Gezond. 2004; 32:271–281.
80. Rehurek R, Sojka P. Software framework for topic modelling with large corpora. In: The LREC 2010
Workshop on new Challenges for NLP Frameworks. University of Malta; 2010. p. 45–50.
81. Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. Journal of Machine Learning Research. 2003;
3:993–1022.
82. Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R. Indexing by latent semantic analy-
sis. Journal of the American Society for Information Science. 1990; 41:391–407. https://doi.org/10.
1002/(SICI)1097-4571(199009)41:6%3C391::AID-ASI1%3E3.0.CO;2-9
83. Baroni M, Bernardini S. BootCaT: Bootstrapping Corpora and Terms from the Web. In: Proceedings of
the Fourth International Conference on Language Resources and Evaluation. LREC’04; 2004. p. 1313–
1316.
84. Fawcett T. An introduction to ROC analysis. Pattern recognition letters. 2006; 27(8):861–874. https://
doi.org/10.1016/j.patrec.2005.10.010
Automatic detection of cyberbullying in social media text
PLOS ONE | https://doi.org/10.1371/journal.pone.0203794 October 8, 2018 22 / 22
Copyright of PLoS ONE is the property of Public Library of Science and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use.