Pinned Loading
-
LSLM-Listening-while-Speaking-Language-Model
LSLM-Listening-while-Speaking-Language-Model PublicLSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances human-computer interaction through real-time spoken dialogue…
-
Self-Correcting-LLM--Reinforcement-Learning-
Self-Correcting-LLM--Reinforcement-Learning- PublicThis my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by google
-
THINKING-LLMS
THINKING-LLMS Publicthis is based on the paper THINKING LLMS: GENERAL INSTRUCTION FOLLOWING WITH THOUGHT GENERATION I might add new stuff that is not related to the paper
-
-
Self-Taught-Evaluator
Self-Taught-Evaluator Publicthis is based on the paper Self-Taught Evaluators
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.