CUSTOMIZED WEB APPLICATION FOR ADDRESSING LANGUAGE MODEL MISALIGNMENT THROUGH REINFORCEMENT LEARNING FROM HUMAN FEEDBACK

Okeke , Chinonso Emmanuel; Essien, Emmanuel Joseph

doi:10.5281/zenodo.14677340

Keith Publications

Open-access scholarly publisher dedicated to advancing global research through rigorous peer review and ethical publishing practices.

Publisher Home

Open Access

All articles freely available immediately upon publication
Licensed under CC BY 4.0
No embargo period
Permanent, unrestricted online access

Full OA Policy

Publication Fees (APC)

We operate under an Article Processing Charge (APC) model to support open access publishing.

Standard APC USD 100

Waiver available Yes

Submission fee None

Fee waivers available for authors from low-income countries. Contact us for details.

Peer Review

Double-blind peer review
Average review: 4–6 weeks
Minimum 2 independent reviewers
Transparent revision process

Peer Review Policy

Publication Ethics

COPE guidelines followed
Strict conflict-of-interest policy
Authorship criteria (ICMJE)
Retraction & correction policy
Data integrity required

Ethics Policy

Copyright & Licensing

Authors retain copyright
CC BY 4.0 International
Free to share & adapt

Copyright Notice

Plagiarism Policy

All submissions screened
iThenticate / Crossref Similarity Check
<15% similarity threshold
Zero tolerance for misconduct

Indexing & Abstracting

Google Scholar
CrossRef / DOI
DOAJ (Directory of Open Access Journals)
Zenodo
Scopus (under evaluation)
PubMed (under evaluation)

Digital Preservation

Zenodo long-term archiving
DOI-based permanent links
LOCKSS-compatible

Author Resources

Contact Publisher

[email protected]

1252 Columbia Rd NW, Washington, DC 20009, USA

keithpub.com

Research Article Open Access Double-Blind Peer Review

CUSTOMIZED WEB APPLICATION FOR ADDRESSING LANGUAGE MODEL MISALIGNMENT THROUGH REINFORCEMENT LEARNING FROM HUMAN FEEDBACK

Chinonso Emmanuel Okeke·Emmanuel Joseph Essien

Published 17 January 2025

DOI: 10.5281/zenodo.14677340

Vol. 12, No. 1 (2024)

pp. 30-39

CC BY 4.0

View PDF Download PDF Browse Issue

Authors & Affiliations

1

Chinonso Emmanuel Okeke

Department of Technical Education, Ignatius Ajuru University of Education, Port Harcourt Rivers State, Nigeria

NG
2

Emmanuel Joseph Essien

Department of Computer Science, Akwa Ibom State University, Mkpat Enin, Nigeria.

NG

Abstract

Recent years have seen tremendous progress in the field of artificial intelligence, which has sparked the creation of cutting-edge tools like OpenAI ChatGPT. The OpenAI GPT -3 family of big language models serves as the foundation for ChatGPT, which is enhanced through the use of supervised and reinforcement learning methodologies. Its goal is to produce text that can't be distinguished from human-written information. It can hold conversations with users in a way that is surprisingly clear-cut and uncomplicated. Reinforcement Learning from Human Feedback (RLHF) is the technique employed. Human input and machine learning methods (Supervised Learning) are used to train the model. It is employed in the training phases to reduce biased, damaging, and false outputs. The resulting InstructGPT models are much better at following instructions than GPT-3. Above all, customized ChatGPT web application that can fine-tune a given input and generate text that is of high quality, harmless, truthful and appropriate, without biased outputs. A key motivation for our work is to increase helpfulness and truthfulness output while mitigating the harms and biases of language models. In conclusion, our results show that reinforcement learning from human feedback (RLHF) techniques is effective at significantly improving the alignment of general-purpose AI systems with human intentions

Article Information

Journal	Current Research and Innovations Journal
ISSN	3065-0712
Volume / Issue	Vol. 12, No. 1 (2024)
Pages	30-39
Published	17 January 2025
DOI	10.5281/zenodo.14677340
Access	Open Access
License	CC BY 4.0 — reuse with attribution
Publisher	Keith Publications

How to Cite

Okeke , C., Essien, E. (2025). CUSTOMIZED WEB APPLICATION FOR ADDRESSING LANGUAGE MODEL MISALIGNMENT THROUGH REINFORCEMENT LEARNING FROM HUMAN FEEDBACK. Current Research and Innovations Journal, Vol. 12 No. 1, pp. 30-39. DOI: https://doi.org/10.5281/zenodo.14677340

Submit Your Research to Current Research and Innovations Journal

We invite original research articles, review papers, and case studies. Benefit from rigorous double-blind peer review, rapid decision within 4–8 weeks, DOI for every article, and worldwide open-access distribution.

Submit a Manuscript Author Guidelines