NLPMed-Engine

The NLP backend for NLPMed-Portal.

NLPMed-Engine is a robust and extensible natural language processing engine tailored for medical text. It supports a range of NLP tasks commonly used in clinical and biomedical applications.

⚠️ Important: This software is intended for research use only. It must not be used in real-world medical or clinical decision-making settings.

Installation

Option 1 - Install nlpmed_engine Python package:

git clone https://github.com/ang-li-lab/NLPMed-Engine.git
cd NLPMed-Engine

Install based on your environment:

CPU:
```
pip install -e .
```
Apple GPU (MPS):
```
pip install -e ".[gpu_apple]"
```
Apple CUDA 11:
```
pip install -e ".[gpu_cuda11]"
```
Apple CUDA 12:
```
pip install -e ".[gpu_cuda12]"
```

Option 2 - Install dependencies only:

git clone https://github.com/ang-li-lab/NLPMed-Engine.git
cd NLPMed-Engine

Install dependencies based on your environment:

CPU:
```
pip install -r requirements/cpu.txt
```

Apple GPU (MPS):

pip install -r requirements/gpu_apple.txt

Apple CUDA 11:

pip install -r requirements/gpu_guda11.txt

Apple CUDA 12:

pip install -r requirements/gpu_guda12.txt

Usage

Run REST API

Create a .env file with the following template:

API_ML_MODEL_NAMES=VTE,BLEED

API_ML_VTE_MULTICLASS_DEVICE=cpu
API_ML_VTE_MULTICLASS_MODEL_PATH=/Users/model
API_ML_VTE_MULTICLASS_TOKENIZER_PATH=/Users/tokenizer
API_ML_VTE_MULTICLASS_MAX_LENGTH=512

API_ML_BLEED_BINARY_DEVICE=cuda:0
API_ML_BLEED_BINARY_MODEL_PATH=/Users/model
API_ML_BLEED_BINARY_TOKENIZER_PATH=/Users/tokenizer
API_ML_BLEED_BINARY_MAX_LENGTH=512

API_HOST=127.0.0.1
API_PORT=10010
API_WORKERS=1

Run the API:
```
python scripts/run_api.py
```

Run Single or Batch Pipelines

Instead of using the API, you can directly use the SinglePipeline or BatchPipeline classes in your Python code.

Sample Jupyter notebooks are provided under the notebooks/ directory.
Note: Since BatchPipeline uses parallel processing, the output order may differ from the input. Always use patient_id, note_id when merging results back with the input data.

Resources

Demo: Visit our demo site
VTE-BERT Model: Our fine-tuned model optimized for VTE classification is available under gated access on Hugging Face.
Publication: Development and Validation of VTE-BERT Natural Language Processing Model for Venous Thromboembolism (Open Access)

Documentation

See documentation for full API and module reference (generated with Sphinx).

Terms of Use

This project includes software, models, or a federated learning framework that are governed by additional terms beyond the AGPLv3 license.

By using this software or model, you agree to the Terms & Conditions.

Citation

If you use NLPMed-Engine in your research or applications, please cite our paper:

@article{jafaridevelopment,
  title={Development and Validation of VTE-BERT Natural Language Processing Model for Venous Thromboembolism},
  author={Jafari, Omid and Ma, Shengling and Lam, Barbara D and Jiang, Jun Y and Zhou, Emily and Ranjan, Mrinal and Ryu, Justine and Bandyo, Raka and Maghsoudi, Arash and Peng, Bo and others},
  journal={Journal of Thrombosis and Haemostasis},
  publisher={Elsevier},
  year={2025},
  doi={10.1016/j.jtha.2025.07.021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github/workflows		.github/workflows
.reuse/templates		.reuse/templates
.vscode		.vscode
docs		docs
nlpmed_engine		nlpmed_engine
notebooks		notebooks
requirements		requirements
scripts		scripts
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CONTRIBUTORS.txt		CONTRIBUTORS.txt
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
TERMS.md		TERMS.md
nlpmed_engine.service		nlpmed_engine.service
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLPMed-Engine

Installation

Option 1 - Install nlpmed_engine Python package:

Option 2 - Install dependencies only:

Usage

Run REST API

Run Single or Batch Pipelines

Resources

Documentation

Terms of Use

Citation

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NLPMed-Engine

Installation

Option 1 - Install nlpmed_engine Python package:

Option 2 - Install dependencies only:

Usage

Run REST API

Run Single or Batch Pipelines

Resources

Documentation

Terms of Use

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages