faithfulness

Here are 9 public repositories matching this topic...

pkuserc / ChatGPT_for_IE

Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

performance evaluation information-extraction calibration named-entity-recognition event-detection event-extraction relation-extraction entity-typing relation-classification explainability large-language-models chatgpt faithfulness

Updated Aug 17, 2024
Python

khuangaf / CHOCOLATE

Star

Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"

factuality faithfulness large-vision-language-models chart-understanding chart-captioning chart-summarization

Updated Jun 5, 2024
Jupyter Notebook

MinhVuong2000 / LLMReasonCert

Star

Official Implementation of ACL2024 paper "Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs"(https://arxiv.org/abs/2402.11199).

framework evaluation knowledge-graph reasoning evaluation-framework llms faithfulness

Updated Jul 27, 2024
Python

YisongMiao / DiSQ-Score

Star

The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understanding of Discourse Relations> @ ACL 2024

evaluation discourse language-model faithfulness socratic-method

Updated Aug 7, 2024
Python

About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning" . Do not hesitate to open an issue if you run into any trouble!

nlp reasoning faithfulness chain-of-thought-reasoning

Updated Sep 6, 2024

vggls / medical_xai

Star

On the evaluation of deep learning interpretability methods for medical images under the scope of faithfulness

computer-vision grad-cam haas x-ray digital-pathology explainable-ai medical-ai aopc hirescam faithfulness max-sensitivity

Updated Aug 19, 2024
Jupyter Notebook

KomeijiForce / Active_Passive_Constraint_Koishiday_2024

Star

[NeurIPS 2024] An advanced persona-driven role-playing system with global faithfulness quantification and optimization. In memory of the Koishi's Day of 2024.

role-playing metrics global-optimization quantification factuality-checking faithfulness komeiji ai-character