llama-factory fine-tuning-1

ShareGPT is a dialogue dataset actively contributed to and shared by users. It contains conversation samples from different domains, topics, styles, and emotions, covering a variety of types such as chit-chat, Q&A, stories, poetry, and song lyrics. This dataset is characterized by high quality, diversity, personalization, and emotional richness, which can provide conversational robots with more abundant and authentic linguistic knowledge and semantic information.

here is it's data structure. vicuna model is fine-tuning from llama2 by sharegpt style dataset.

[
  {
    "conversations": [
      {
        "from": "human",
        "value": "user instruction"
      },
      {
        "from": "gpt",
        "value": "model response"
      }
    ]
  }
]

medical alapaca style fine-tuning

data conversion

we can load huggingface dataset directly, but we have to filter the data, so we download the data and save as JSON file(alpaca style), we can do that with bellow code.

from datasets import load_dataset
import os
import json


dataset = load_dataset("shibing624/medical", "finetune")


save_path = "../medical"
os.makedirs(save_path, exist_ok=True)  

def save_as_json(data, filename):
    file_path = os.path.join(save_path, filename)
    with open(file_path, 'w', encoding='utf-8') as f:

        data_to_save = [item for item in data]
        json.dump(data_to_save, f, ensure_ascii=False, indent=4)

save_as_json(dataset['train'], 'train.json')
save_as_json(dataset['validation'], 'validation.json')
save_as_json(dataset['test'], 'test.json')

select english part named as alpaca_medical_en.json then move into Llama-Factory/data/

command

CUDA_VISIBLE_DEVICES=1 python src/train_bash.py \
    --stage sft \
    --model_name_or_path ../llama/models_hf/7B \
    --do_train \
    --dataset alpaca_medical_en \
    --template default \
    --finetuning_type lora \
    --lora_target q_proj,v_proj \
    --output_dir ./FINE/llama2-7b-medical_single \
    --overwrite_cache \
    --per_device_train_batch_size 1 \
    --gradient_accumulation_steps 4 \
    --lr_scheduler_type cosine \
    --logging_steps 10 \
    --save_steps 1000 \
    --learning_rate 5e-5 \
    --num_train_epochs 3.0 \
    --plot_loss \
    --fp16