This colloquium consists of two talks:
The blessing and the curse of lexical information in author profiling (Malvina Nissim)
Automatic Misogyny Identification (AMI) in Twitter (Paolo Rosso)
Below you can find more detailed information about the presentations and the speakers.
The blessing and the curse of lexical information in author profiling
State-of-the-art performance on author profiling (e.g., gender and age) for English in social media is around 80%. How realistic is this figure? Although it is known that function words are good predictors for this task, most best performing systems are trained on word and character n-grams, thus heavily leveraging lexical information, too. And it works. But because of this, shifts of topic or domain affect performance seriously. If we create conditions to reduce the influence of lexical information, we might be able to identify features that are indeed more relevant for profiling. I will describe experiments where we get rid of lexical information still preserving some linguistic structure and see how well we can do with what's left. We stretch this to see whether free of lexical information we can perform cross-language profiling, and we also compare our models to human performance in quite an interesting way!
Malvina Nissim is Associate Professor in Language Technology at the University of Groningen. She has extensive experience in sentiment analysis and author identification and profiling, as well as in modelling the interplay of lexical semantics and pragmatics, especially regarding the computational treatment of figurative language and modality. She is the author of 100+ publications in international venues, is a member of the main associations in the field, and annually reviews for the major conferences and journals. She has recently co-chaired the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017), and she is the general chair of the Seventh Joint Conference on Lexical and Computational Semantics (*SEM 2018). She is also active in the field of resource and system evaluation, as both organizer and participant of shared tasks, and is interested in the philosophy behind them. She graduated in Linguistics from the University of Pisa and obtained her PhD in Linguistics from the University of Pavia. Before joining the University of Groningen, she was a tenured researcher at the University of Bologna (2006-2014), and a post-doc at the Institute for Cognitive Science and Technology of the National Research Council in Rome (2006) and at the University of Edinburgh (2001-2005). In 2017, she was elected as the 2016 University of Groningen Lecturer of the Year.
Automatic Misogyny Identification (AMI) in Twitter
Hate speech may take different forms in online social media. Most of the investigations in the literature are focused on detecting abusive language in discussions about religion, immigration and sexual orientation. In this talk, I will address the problem of automatic misogyny identification (AMI) in Twitter, which entails identifying cases of aggressiveness and hate speech towards women. Moreover, I will also present the results obtained at two shared tasks on exactly this topic: AMI-IberEval-2018 (https://amiibereval2018.wordpress.com/) on Spanish and English Twitter data, and AMI-EvalIta-2018 (https://amievalita2018.wordpress.com/) on English and Italian Twitter data.
Paolo Rosso is an Associate Professor at the Technical University of Valencia. He obtained a PhD in Computer Science from Trinity College University in Dublin in 1999. His work centers on Natural Language Processing and Information Retrieval in social media and he has extensive experience in the field of author profiling and opinion mining. He has actively contributed to the field of controlled system evaluation and shared tasks as Deputy Steering Committee Chair for the Conference at the CLEF Intitiative and by chairing the PAN shared task collective for the past nine editions (2009-2018). He was recently part of the Organizing Committee for the European Chapter of the Association for Computational Linguistics (EACL 2017) and has taken up roles as editor and reviewer of many notable conferences and journals in computational linguistics.