MD-informed Self-Attention

We introduce Medical Evidence Dependency(MD)-informed Self-Attention, a Neuro-Symbolic Model for understanding free-text medical evidence in literature. We hypothesize this method can get the best of both: the high capacity of neural networks and the rigor, semantic clarity and reusability of symbolic logic.

Citation: Kang, T., Turfah, A. Kim, J. Perotte, A. and Weng, C. (2021). A Neuro-Symbolic Method for Understanding Free-text Medical Evidence. Journal of the American Medical Informatics Association (in press)
Contact: Tian Kang (tk2624@cumc.columbia.edu)
Affiliation: Department of Biomedical Informatics, Columbia Univerisity (Dr. Chunhua Weng's lab)

Repository

MDAtt.py generate Medical Evidence Dependency-information attention head.
MED_modeling.py: modified from bert/modeling.py (attention_layer, transformer and bert classes).
run_MDAttBert.py: run BERT with Medical Evidence Dependency-information attention head.

Model description

Model We develop a symbolic compositional representation called Medical evidence Dependency (MD) to represent the basic medical evidence entities and relations following the PICO framework widely adopted among clinicians for searching evidence. We use Transformer as the backbone and train one head in the Multi-Head Self-Attention to attend to MD and to pass linguistic and domain knowledge onto later layers (MD-informed). We integrated MD-informed Attention into BioBERT and evaluated it on two public MRC benchmarks for medical evidence from literature: i.e., Evidence Inference 2.0 and PubMedQA.

Medical Evidence Dependency (MD) and Proposition

Medical Evidence Dependency (MD) Matrix

Medical Evidence Dependency (MD)-informed Self Attention

Results The integration of MD-informed Attention head improves BioBERT substantially for both benchmarks—as large as by +30% in the F1 score—and achieves the new state-of-the-art performance on the Evidence Inference 2.0. By visualizing the weights learned from MD-informed Attention head, we find the model can capture clinically meaningful relations separated by long passages of text.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
figures		figures
general_utils		general_utils
model		model
src		src
MDAtt.py		MDAtt.py
MED_modeling.py		MED_modeling.py
README.md		README.md
label2id.pkl		label2id.pkl
parser_config.py		parser_config.py
run_MDAttBert.py		run_MDAttBert.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MD-informed Self-Attention

Repository

Model description

About

Releases

Packages

Languages

WengLab-InformaticsResearch/MD-Attention

Folders and files

Latest commit

History

Repository files navigation

MD-informed Self-Attention

Repository

Model description

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages