#

preference-learning

Here are 36 public repositories matching this topic...

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

preference-learning rlhf

Updated Oct 23, 2024
Python

tournesol-app / tournesol

Free and open source code of the https://tournesol.app platform. Meet the community on Discord https://discord.gg/WvcSG55Bf3

python youtube django reactjs django-rest-framework dataset recommendation-engine preference-learning social-choice ai-ethics bradley-terry-model golden-ratio-optimization preference-aggregation

Updated Nov 14, 2024
Python

qxcv / magical

The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)

reinforcement-learning imitation-learning preference-learning reinforcement-learning-environments

Updated Dec 5, 2023
Python

metis

JanoschMenke / metis

Python-based GUI to collect Feedback of Chemist in Molecules

machine-learning drug-discovery human-in-the-loop preference-learning de-novo-drug-design generative-ai

Updated Oct 15, 2024
Python

SMARTlab-Purdue / SAN-NaviSTAR

This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning". For more details, please refer to our project website at https://sites.google.com/view/san-navistar.

machine-learning reinforcement-learning transformer preference-learning robot-navigation socially-aware-navigation

Updated Nov 9, 2024
Python

IAAR-Shanghai / ICSFSurvey

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

decoding self-improvement knowledge-distillation data-augmentation reasoning self-consistency preference-learning hallucination self-correction attention-head large-language-models chain-of-thought large-language-model internal-consistency self-feedback self-refine self-correct

Updated Nov 15, 2024
Jupyter Notebook

SMARTlab-Purdue / SAN-FAPL

This repository contains the source code for our paper: "Feedback-efficient Active Preference Learning for Socially Aware Robot Navigation", accepted to IROS-2022. For more details, please refer to our project website at https://sites.google.com/view/san-fapl.

machine-learning reinforcement-learning learning-from-demonstration preference-learning robot-navigation socially-aware-navigation

Updated Oct 17, 2022
Python

jimparr19 / pypbl

Python library for preference based learning

recommendation-engine bayesian-inference preference-learning

Updated Jun 25, 2021
Python

Intelligent-Systems-Group / jpl-framework

Java framework for Preference Learning

machine-learning collaborative-filtering ranking preference-learning label-ranking object-ranking

Updated Mar 5, 2018
Java

vicgalle / configurable-safety-tuning

Data and models for the paper "Configurable Safety Tuning of Language Models with Synthetic Preference Data"

alignment safety preference-learning dpo llm

Updated Jul 27, 2024
Python

albiboni / User-RecSys

Code for the project: "Analysis of Recommendation-systems based on User Preferences".

preferences booking user preferences-learning reccomender booking-system user-preferences preference-learning reccommendation reccomendersystem

Updated Mar 6, 2018
Python

sail-sg / dice

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

alignment preference-learning large-language-models rlhf

Updated Jul 29, 2024
Python

julilien / PLDepth

Code for "Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model" as published at CVPR 2021.

machine-learning deep-learning learning-to-rank cvpr weakly-supervised-learning preference-learning monocular-depth monocular-depth-estimation plackett-luce cvpr2021 relative-depth

Updated Feb 3, 2024
Python

afiliot / Preference-Learning-And-Movie-Reviews

Project on preference learning - ENSAE ParisTech

ensae preference-learning regret-minimization label-learning movie-recommender preference-graph instance-preference label-preference

Updated Apr 7, 2023
Python

makgyver / PRL

[P]reference and [R]ule [L]earning algorithm implementation for Python 3 (https://arxiv.org/abs/1812.07895)

machine-learning algorithm game-theory preference-learning

Updated Mar 17, 2019
Python

typoverflow / WiseRL

PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms

reinforcement-learning pytorch preference-learning

Updated Nov 17, 2024
Python

CJReinforce / RIME_ICML2024

Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)

reinforcement-learning deep-learning robotics artificial-intelligence manipulation locomotion preference-learning reinforcement-learning-from-human-feedback

Updated Oct 15, 2024
Python

GAN-Assisted-Preference-Based-Learning

98k-bot / GAN-Assisted-Preference-Based-Learning

A paper under AAAI-20 review

gan reinforcement-learning-algorithms preference-learning

Updated Aug 27, 2019
Python

BSBT

rowlandseymour / BSBT

Bayesian Spatial Bradley--Terry

bayesian-inference preference-learning bradley-terry comparative-judgement

Updated Nov 24, 2023
R

BARUDA-AI / Awesome-Preference-Optimization

Survey of preference alignment algorithms

alignment direct preference-learning rlhf preference-alignment

Updated Feb 25, 2024

Improve this page

Add a description, image, and links to the preference-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the preference-learning topic, visit your repo's landing page and select "manage topics."