GenAI Reliability in Content Analysis: Assessing Agreement Between LLMs in Measuring Discursive Violence

This study investigates the reliability of three leading large language models (LLMs), ChatGPT 4.5, Claude 3.7 Sonnet, and Gemini 2.0 Flash, in measuring discursive violence against women in Eminem's lyrics. Through a three-phase experimental design, we assessed both inter-coder reliability bet...

Celý popis

Uložené v:

Podrobná bibliografia
Vydané v:	International Conference on Control Systems and Computer Science (Online) s. 604 - 611
Hlavní autori:	Rughinis, Cosima, Dascalu, Mihai, Rasnayake, Susantha
Médium:	Konferenčný príspevok..
Jazyk:	English
Vydavateľské údaje:	IEEE 27.05.2025
Predmet:	AI-assisted research Artificial intelligence Chatbots Content management Correlation Discursive violence Engines Generative AI Generative content analysis Inter-coder reliability Large language models Measurement fidelity Particle measurements Reliability Reliability engineering
ISSN:	2379-0482
On-line prístup:	Získať plný text
Tagy:	Pridať tag Žiadne tagy, Buďte prvý, kto otaguje tento záznam!

Buďte prvý, kto okomentuje tento záznam!