AJOU Central Library Repository: Explainable Hate Speech Detection through Masked Rationale Prediction

BROWSE

Graduate School of Ajou University Department of Artificial Intelligence 3. Theses(Master)

Explainable Hate Speech Detection through Masked Rationale Prediction

DC Field	Value	Language
dc.contributor.advisor	손경아	-
dc.contributor.author	김지윤	-
dc.date.accessioned	2022-11-29T03:01:24Z	-
dc.date.available	2022-11-29T03:01:24Z	-
dc.date.issued	2022-08	-
dc.identifier.other	32114	-
dc.identifier.uri	https://dspace.ajou.ac.kr/handle/2018.oak/21125	-
dc.description	학위논문(석사)--아주대학교 일반대학원 :인공지능학과,2022. 8	-
dc.description.tableofcontents	제1장 Introduction 1 제2장 Related Works 5 제1절 Hate Speech Detection 5 제2절 Pre-finetuning on an intermediate task 5 제3절 Explainable NLP and rationale 6 제3장 Method 7 제1절 Task 7 제2절 Masked rationale prediction 8 제3절 Hate speech detection 10 제4장 Experiments 11 제1절 Dataset 11 제2절 Metrics 12 제3절 Models and Experimental settings 14 제4절 Comparisons of results 15 제5절 Qualitative results 19 제5장 Conclusion 21 제6장 References 22	-
dc.language.iso	eng	-
dc.publisher	The Graduate School, Ajou University	-
dc.rights	아주대학교 논문은 저작권에 의해 보호받습니다.	-
dc.title	Explainable Hate Speech Detection through Masked Rationale Prediction	-
dc.type	Thesis	-
dc.contributor.affiliation	아주대학교 일반대학원	-
dc.contributor.department	일반대학원 인공지능학과	-
dc.date.awarded	2022. 8	-
dc.description.degree	Master	-
dc.identifier.localId	1254222	-
dc.identifier.uci	I804:41038-000000032114	-
dc.identifier.url	https://dcoll.ajou.ac.kr/dcollection/common/orgView/000000032114	-
dc.subject.keyword	Explainable NLP	-
dc.subject.keyword	Hate speech detection	-
dc.subject.keyword	Rationale	-
dc.description.alternativeAbstract	Hate speech detection is important in that the spread of hate speech strengthens critical social discrimination against its target social group not only online but also in the real world. We propose Masked Rationale Prediction (MRP) to improve the performance of hate speech detection considering two important aspects—the model bias and explainability. Understanding the context of hate speech is important for hate speech detection. Hate speech cannot be identified based solely on the presence of specific words considered hateful. However, existing models are easily biased on the specific expressions and make wrong detection results. Even though they correctly predict, the model rationale is often not explained in a convincing manner. Thus, to implement a hate speech detection model, bias and explainability should be considered. MRP is a task to predict the masked human rationales—snippets of a sentence that are grounds for human judgment—by referring to surrounding tokens combined with their unmasked rationales. the human rationales are randomly masked and inputted into the model by being combined with each of the tokens. We pre-finetune a pre-trained model on MRP as an intermediate task and then finetune on hate speech detection. As the model learns its reasoning ability based on rationales by MRP, it performs hate speech detection robustly in terms of bias and explainability. The proposed method generally achieves state-of-the-art performance in various metrics, demonstrating its effectiveness for hate speech detection.	-

Appears in Collections:: Graduate School of Ajou University > Department of Artificial Intelligence > 3. Theses(Master)

Files in This Item:: There are no files associated with this item.

Show simple item record

qrcode

트윗하기

License

STATISTICS: Total Visit :3,725,162; Total Download :1,818; Today View :2,820

AJOU Central Library Repository는 국립중앙도서관 OAK 보급사업으로 구축되었습니다.

BROWSE

Browse