MedAdapter

This is the code for our paper "MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning" in EMNLP 2024.

Framework

Data

You need to first download the raw data from the following sources: MedMCQA, MedQA, MMLU-Med, PubMedQA, BioASQ, MedNLI, MediQA-RQE, PubHealth, MediQA, and CORD19. Please download the data and save in the ./data/<DATASET_NAME>/ directory.

Experiments

Candidate Solution Generation

Before training the model, we need to first leverage the LLM to generate candidate solution for the following reward model training. We need to generate the candidates on both training and test sets. For OpenAI commercial models (e.g., gpt-3.5-turbo, gpt-4), we need to call main-openai.py to initiate the generation process:

python main-openai.py --debug generation --config configs/bioasq/bioasq-gen-test.yaml
python main-openai.py --debug generation --config configs/bioasq/bioasq-gen-train.yaml

For the open-sourced LLMs, we need to leverage the vLLM as the inference model engine and need to run main.py to initiate the generation process:

CUDA_VISIBLE_DEVICES=1,2 accelerate launch --mixed_precision fp16 --main_process_port 29650 main.py --debug generation --config configs/bioasq/bioasq-gen-test.yaml
CUDA_VISIBLE_DEVICES=1,2 accelerate launch --mixed_precision fp16 --main_process_port 29650 main.py --debug generation --config configs/bioasq/bioasq-gen-train.yaml

The --config indicate the generation configuration during inference, containing important hyperparameters. The configuration files should be stored in the directory ./configs/<Dataset_Name>/<Model_Name>/.

Outcome-Supervised Adapter Training

To train the model, we need to run the main.py entry program:

CUDA_VISIBLE_DEVICES=0 accelerate launch --mixed_precision fp16 --main_process_port 29666 main.py --debug reward --config configs/bioasq/bioasq-reward.yaml

The configuration file of training the adapter should also be saved under ./configs/<Dataset_Name>/<Model_Name>/ directory.

Best-of-K Inference

Same as the previous two stages, we need to run the entry program main.py again to inference on the previously generated candidates on test set:

CUDA_VISIBLE_DEVICES=0 accelerate launch --mixed_precision fp16 --main_process_port 29666 main.py --debug reward_guide --config configs/bioasq/bioasq-guide.yaml

The configuration file of training the adapter should also be saved under ./configs/<Dataset_Name>/<Model_Name>/ directory.

Running Scripts

We also offer several examples of running commands in the directory ./scripts.

Citation

If you find this repository valuable for your research, we kindly request that you acknowledge our paper by citing the follwing paper. We appreciate your consideration.

@misc{shi2024medadapterefficienttesttimeadaptation,
      title={MedAdapter: Efficient Test-Time Adaptation of Large Language Models towards Medical Reasoning}, 
      author={Wenqi Shi and Ran Xu and Yuchen Zhuang and Yue Yu and Haotian Sun and Hang Wu and Carl Yang and May D. Wang},
      year={2024},
      eprint={2405.03000},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2405.03000}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
configs/bioasq/gpt-35		configs/bioasq/gpt-35
eval_fn		eval_fn
generator		generator
inference		inference
models		models
reward_model/orm		reward_model/orm
scripts		scripts
utils		utils
README.md		README.md
evaluate_score.py		evaluate_score.py
main-openai.py		main-openai.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MedAdapter

Framework

Data

Experiments

Candidate Solution Generation

Outcome-Supervised Adapter Training

Best-of-K Inference

Running Scripts

Citation

About

Releases

Packages

Languages

wshi83/MedAdapter

Folders and files

Latest commit

History

Repository files navigation

MedAdapter

Framework

Data

Experiments

Candidate Solution Generation

Outcome-Supervised Adapter Training

Best-of-K Inference

Running Scripts

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages