Research Article Open Access Double-Blind Peer Review

CUSTOMIZED WEB APPLICATION FOR ADDRESSING LANGUAGE MODEL MISALIGNMENT THROUGH REINFORCEMENT LEARNING FROM HUMAN FEEDBACK

Chinonso Emmanuel Okeke·Emmanuel Joseph Essien
Published 17 January 2025
Vol. 12, No. 1 (2024)
pp. 30-39
CC BY 4.0
  1. 1
    Chinonso Emmanuel Okeke
    Department of Technical Education, Ignatius Ajuru University of Education, Port Harcourt Rivers State, Nigeria
    NG
  2. 2
    Emmanuel Joseph Essien
    Department of Computer Science, Akwa Ibom State University, Mkpat Enin, Nigeria.
    NG

Recent years have seen tremendous progress in the field of artificial intelligence, which has sparked the creation of cutting-edge tools like OpenAI ChatGPT. The OpenAI GPT -3 family of big language models serves as the foundation for ChatGPT, which is enhanced through the use of supervised and reinforcement learning methodologies. Its goal is to produce text that can't be distinguished from human-written information. It can hold conversations with users in a way that is surprisingly clear-cut and uncomplicated. Reinforcement Learning from Human Feedback (RLHF) is the technique employed. Human input and machine learning methods (Supervised Learning) are used to train the model. It is employed in the training phases to reduce biased, damaging, and false outputs. The resulting InstructGPT models are much better at following instructions than GPT-3. Above all, customized ChatGPT web application that can fine-tune a given input and generate text that is of high quality, harmless, truthful and appropriate, without biased outputs. A key motivation for our work is to increase helpfulness and truthfulness output while mitigating the harms and biases of language models. In conclusion, our results show that reinforcement learning from human feedback (RLHF) techniques is effective at significantly improving the alignment of general-purpose AI systems with human intentions

JournalCurrent Research and Innovations Journal
ISSN3065-0712
Volume / IssueVol. 12, No. 1 (2024)
Pages30-39
Published17 January 2025
DOI10.5281/zenodo.14677340
Access Open Access
LicenseCC BY 4.0 — reuse with attribution
PublisherKeith Publications
Okeke , C., Essien, E. (2025). CUSTOMIZED WEB APPLICATION FOR ADDRESSING LANGUAGE MODEL MISALIGNMENT THROUGH REINFORCEMENT LEARNING FROM HUMAN FEEDBACK. Current Research and Innovations Journal, Vol. 12 No. 1, pp. 30-39. DOI: https://doi.org/10.5281/zenodo.14677340

 Submit Your Research to Current Research and Innovations Journal

We invite original research articles, review papers, and case studies. Benefit from rigorous double-blind peer review, rapid decision within 4–8 weeks, DOI for every article, and worldwide open-access distribution.