HdLM (Hierarchical Decoding Layer by Layer: Uncovering Multiscale Abstraction in Language Models)

This repository provides codes of HdLM for hierarchical classification and generation tasks based Llama3-8B, demonstrating the experiment on the Web of Science (WoS) and ESConv datasets. The implementation supports easy transfer to other datasets like DBPedia, EmpatheticDialogues, and AQuA.

📂 For WoS Dataset

dataset preparation

1. Obtain the Dataset

The Web of Science (WoS) dataset contains 46,985 documents across 134 hierarchical categories. For details, see WoS Dataset | Papers With Code.

2. Preprocessing Steps

# Navigate to WoS data directory
cd data/Depth2/WoS

# Create required directories
mkdir -p alpaca_datasets splited_jsondata

# Process data
python data_processor.py
python generate_dataset.py

Processed datasets will be available in:

data/Depth2/WoS/alpaca_datasets/
├── wos_train_data.json
├── wos_test_data.json
└── wos_valid_data.json  # Optional validation set

🚀 Training Configuration

1. Register Dataset

Update dataset registry in train/LLaMA-Factory/data/dataset_info.json:

{
  "WoS-HTC":{
        "file_name": "/your/wos_dataset/path/",
        "formatting": "alpaca",
        "add_thought": true,
        "has_subtask": true,
        "columns":{
            "prompt": "system",
            "query": "input",
            "thought": "think",
            "subtask": "subtask",
            "response": "assistant"
        }
    }
}

2. Configure Training Script

Edit train/LLaMA-Factory/training_scripts/wos_train.sh, replace the model addresses in the script, including the base model and the fine-tuned output model.

3. Start Training

cd train/LLaMA-Factory
bash training_scripts/wos_train.sh

🔍 Evaluate

Inference can be performed using specific commands to run the model and obtain predictions, as outlined below:

layer_index=25
python infer/WoS/infer.py --model_path /YOUR/MODEL/PATH/ --intermediate_layer_index $layer_index

After running the above command, you will be able to see the Macro-F1 and Micro-F1 score results for the WoS dataset.

📂 For ESconv Dataset

🚀 Training Configuration

1. Register Dataset

Update dataset registry in train/LLaMA-Factory/data/dataset_info.json:

{
  "ESC-Depth2":{
        "file_name": "/your/ESC_dataset/path/",
        "formatting": "alpaca",
        "add_thought": true,
        "has_subtask": true,
        "columns":{
            "prompt": "system",
            "query": "input",
            "thought": "think",
            "subtask": "subtask",
            "response": "assistant"
        }
    }
}

2. Configure Training Script

Edit train/LLaMA-Factory/training_scripts/esc_d2_train.sh, replace the model addresses in the script, including the base model and the fine-tuned output model.

3. Start Training

cd train/LLaMA-Factory
bash  training_scripts/esc_d2_train.sh

Inference

Inference can be performed using specific commands to run the model and obtain predictions, as outlined below:

layer_index=28
python infer/ESC/infer.py --model_path /YOUR/MODEL/PATH/ --intermediate_layer_index $layer_index

After running the above command, you will be able to see the Macro-F1 and Micro-F1 score results for the ESConv dataset.

Evaluation

Refer to the following command to run the evaluation script:

cd eval
bash ESC/run.sh

This command evaluates the inference results against the correct answers, calculating the following metrics:

Dist-2: Measures the diversity of the generated text; a higher value indicates richer and more varied text.
CIDEr: A metric used to evaluate the similarity between generated text and reference text. It is commonly used in image description generation tasks and can also be employed here to assess the quality of generated responses.

By comparing the results of our method with those of conventional baselines, We demonstrate that our approach achieves performance that not only matches but also exceeds that of conventional baseline methods, showcasing its superiority. Moreover, under comparable performance, our method offers the advantage of significantly reduced computational demands during both training and inference.

⚠️ Notes

If encountering syntax error in Linux:

vim train/LLaMA-Factory/training_scripts/wos_train.sh +":set fileformat=unix" +wq

Here, WoS and ESC are taken as examples. Other datasets provide cases. The original datasets can be preprocessed according to demo_cases.json

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
eval		eval
infer		infer
train/LLaMA-Factory		train/LLaMA-Factory
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HdLM (Hierarchical Decoding Layer by Layer: Uncovering Multiscale Abstraction in Language Models)

📂 For WoS Dataset

dataset preparation

1. Obtain the Dataset

2. Preprocessing Steps

🚀 Training Configuration

1. Register Dataset

2. Configure Training Script

3. Start Training

🔍 Evaluate

📂 For ESconv Dataset

🚀 Training Configuration

1. Register Dataset

2. Configure Training Script

3. Start Training

Inference

Evaluation

⚠️ Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HdLM (Hierarchical Decoding Layer by Layer: Uncovering Multiscale Abstraction in Language Models)

📂 For WoS Dataset

dataset preparation

1. Obtain the Dataset

2. Preprocessing Steps

🚀 Training Configuration

1. Register Dataset

2. Configure Training Script

3. Start Training

🔍 Evaluate

📂 For ESconv Dataset

🚀 Training Configuration

1. Register Dataset

2. Configure Training Script

3. Start Training

Inference

Evaluation

⚠️ Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages