nits_teja_srikar at GenAI Detection Task 2: Distinguishing Human and AI-Generated Essays Using Machine Learning and Transformer Models
Sai Teja Lekkala, Annepaka Yadagiri, Mangadoddi Srikar Vardhan, Partha Pakray
Abstract
This paper presents models to differentiate between human-written and AI-generated essays, addressing challenges posed by advanced AI models like ChatGPT and Claude. Using a structured dataset, we fine-tune multiple machine learning models, including XGBoost and Logistic Regression, along with ensemble learning and k-fold cross-validation. The dataset is processed through TF-IDF vectorization, followed by text cleaning, lemmatization, stemming, and part-of-speech tagging before training. Our team nits_teja_srikar achieves high accuracy, with DistilBERT performing at 77.3% accuracy, standing at 20th position for English, and XLM-RoBERTa excelling in Arabic at 92.2%, standing at 14th position in the official leaderboard, demonstrating the model’s potential for real-world applications.- Anthology ID:
- 2025.genaidetect-1.30
- Volume:
- Proceedings of the 1stWorkshop on GenAI Content Detection (GenAIDetect)
- Month:
- January
- Year:
- 2025
- Address:
- Abu Dhabi, UAE
- Editors:
- Firoj Alam, Preslav Nakov, Nizar Habash, Iryna Gurevych, Shammur Chowdhury, Artem Shelmanov, Yuxia Wang, Ekaterina Artemova, Mucahid Kutlu, George Mikros
- Venues:
- GenAIDetect | WS
- SIG:
- Publisher:
- International Conference on Computational Linguistics
- Note:
- Pages:
- 278–283
- Language:
- URL:
- https://preview.aclanthology.org/Author-page-Marten-During-lu/2025.genaidetect-1.30/
- DOI:
- Cite (ACL):
- Sai Teja Lekkala, Annepaka Yadagiri, Mangadoddi Srikar Vardhan, and Partha Pakray. 2025. nits_teja_srikar at GenAI Detection Task 2: Distinguishing Human and AI-Generated Essays Using Machine Learning and Transformer Models. In Proceedings of the 1stWorkshop on GenAI Content Detection (GenAIDetect), pages 278–283, Abu Dhabi, UAE. International Conference on Computational Linguistics.
- Cite (Informal):
- nits_teja_srikar at GenAI Detection Task 2: Distinguishing Human and AI-Generated Essays Using Machine Learning and Transformer Models (Lekkala et al., GenAIDetect 2025)
- PDF:
- https://preview.aclanthology.org/Author-page-Marten-During-lu/2025.genaidetect-1.30.pdf