ChetKao
/

Bohdi-Llama-3.1-8B-Instruct

@@ -1,9 +1,11 @@
 ---
 license: mit
 ---
 <div align="center">
 <div style="display: flex; justify-content: center; align-items: center; text-align: center;">
   <div style="display: flex; align-items: center; margin: auto;">
     <strong style="margin-left: 0px; font-size: 24px;">Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration</strong>
@@ -29,7 +31,7 @@ _<sup>†</sup> Corresponding Author_
 ### 📄 Introduction
 Bohdi is a novel framework for heterogeneous Large Language Model (LLM) fusion that integrates the strengths of multiple source LLMs into a target LLM through adaptive knowledge exploration and automatic data generation. Unlike existing methods that rely on real data from limited domains and use fixed data allocation proportions, Bohdi dynamically adjusts sampling based on the target LLM's performance and generates data automatically through a hierarchical knowledge tree structure. This ensures comprehensive domain coverage and balanced capability enhancement without the need for real data. Our github page is [Bohdi](https://github.com/gjq100/Bohdi).
 ### ✨ Features
@@ -54,7 +56,7 @@ conda env create -f opencompass_env.yaml
 ```bash
 # The version we used: opencompass 0.3.4
 git clone https://github.com/open-compass/opencompass opencompass
-cd opencompass
 pip install -e .
 ```
@@ -64,7 +66,7 @@ pip install -e .
 To train the target LLM using Bohdi, follow these steps:
 1. **Prepare Source LLMs**: Ensure you have access to the source LLMs you want to fuse. If you want to follow our setup, please download the following models:
-   ```Python
    # Source Models
    Qwen/Qwen2.5-14B-Instruct
    mistralai/Mistral-Small-24B-Instruct-2501
@@ -79,7 +81,7 @@ To train the target LLM using Bohdi, follow these steps:
 Please first configure the relevant paths in `run_bohdi.sh` according to your actual paths, and then run:
    ```bash
    source activate bohdi
-   cd your project path
    bash run_bohdi.sh
    ```
@@ -87,10 +89,20 @@ Please first configure the relevant paths in `run_bohdi.sh` according to your ac
 We use <a href="https://github.com/open-compass/opencompass/tree/main">OpenCompass</a> for evaluation and perform inference based on VLLM. To evaluate your model, please configure the relevant paths in `eval_opencompass.sh` according to your actual paths, and then run:
 ```bash
 source activate opencompass
-cd your project path
 bash eval_opencompass.sh
 ```
 ### 📚 Citation
 ```
 @article{gao2025bohdi,

 ---
 license: mit
+library_name: transformers
+pipeline_tag: text-generation
 ---
 <div align="center">
 <div style="display: flex; justify-content: center; align-items: center; text-align: center;">
   <div style="display: flex; align-items: center; margin: auto;">
     <strong style="margin-left: 0px; font-size: 24px;">Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration</strong>
 ### 📄 Introduction
 Bohdi is a novel framework for heterogeneous Large Language Model (LLM) fusion that integrates the strengths of multiple source LLMs into a target LLM through adaptive knowledge exploration and automatic data generation. Unlike existing methods that rely on real data from limited domains and use fixed data allocation proportions, Bohdi dynamically adjusts sampling based on the target LLM's performance and generates data automatically through a hierarchical knowledge tree structure. This ensures comprehensive domain coverage and balanced capability enhancement without the need for real data. Our github page is [Bohdi](https://github.com/gjq100/Bohdi).
+We release model weights of the resulting LLMs which are finetuned with Bohdi.
 ### ✨ Features
 ```bash
 # The version we used: opencompass 0.3.4
 git clone https://github.com/open-compass/opencompass opencompass
+cd [your project path]/opencompass
 pip install -e .
 ```
 To train the target LLM using Bohdi, follow these steps:
 1. **Prepare Source LLMs**: Ensure you have access to the source LLMs you want to fuse. If you want to follow our setup, please download the following models:
+   ```python
    # Source Models
    Qwen/Qwen2.5-14B-Instruct
    mistralai/Mistral-Small-24B-Instruct-2501
 Please first configure the relevant paths in `run_bohdi.sh` according to your actual paths, and then run:
    ```bash
    source activate bohdi
+   cd [your project path]/Bohdi
    bash run_bohdi.sh
    ```
 We use <a href="https://github.com/open-compass/opencompass/tree/main">OpenCompass</a> for evaluation and perform inference based on VLLM. To evaluate your model, please configure the relevant paths in `eval_opencompass.sh` according to your actual paths, and then run:
 ```bash
 source activate opencompass
+cd [your project path]/opencompass
 bash eval_opencompass.sh
 ```
+### Direct Download and Usage
+If you would like to directly use the distilled models for evaluation, our distilled models can be found directly on Hugging Face:
+```python
+ChetKao/Bohdi-Llama-3.2-3B-Instruct
+ChetKao/Bohdi-Llama-3.1-8B-Instruct
+ChetKao/Bohdi-Qwen2.5-7B-Instruct
+ChetKao/Bohdi-gemma-2-9b-it
+```
 ### 📚 Citation
 ```
 @article{gao2025bohdi,