Question 1

How do I train a medical LLM from scratch using MedicalGPT?

Accepted Answer

Start by preparing your data in supported formats, then run the provided shell scripts like run_pt.sh for pretraining and run_sft.sh for fine-tuning. The project includes Colab notebooks for a step-by-step pipeline, but expect to handle GPU resource management and configuration tweaks.

Question 2

What's the difference between MedicalGPT and just using Hugging Face Transformers for medical fine-tuning?

Accepted Answer

MedicalGPT offers a specialized, end-to-end pipeline with pre-configured stages for medical domain adaptation, including DPO and ORPO alignment, whereas Hugging Face Transformers provides general tools requiring more manual setup for multi-stage training and healthcare-specific optimizations.

Question 3

How much GPU memory is needed to fine-tune a 13B model with LoRA in MedicalGPT?

Accepted Answer

According to the hardware table, fine-tuning a 13B model with LoRA at 16-bit precision requires about 32GB of VRAM. You can reduce this with QLoRA at lower bits, but performance may trade off.

Question 4

Does MedicalGPT support retrieval-augmented generation (RAG) for medical Q&A?

Accepted Answer

Yes, it includes a ChatPDF demo for RAG-based file问答, allowing you to combine fine-tuned LLMs with knowledge bases. However, this is an add-on feature and requires additional setup for document processing.

Question 5

Can I use my own medical dataset with MedicalGPT, and what format does it need?

Accepted Answer

Yes, you can use custom datasets by converting them to JSONL formats similar to shibing624/medical. The tools directory has scripts like convert_dataset.py to help with formatting, but you may need to adjust for specific data structures.

Question 6

Which alignment method should I choose: DPO, ORPO, or RLHF in MedicalGPT?

Accepted Answer

DPO and ORPO are easier to implement and train without complex reinforcement learning, suitable for quick preference alignment. RLHF with reward modeling is more resource-intensive but can yield fine-grained control. The choice depends on your dataset size and computational budget.

MedicalGPT

What is MedicalGPT?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions