SyntaxShap is a model agnostic attribution-based explainability method for text generation. It accounts for the syntax in the text data by extending Shapley values to consider parsing-based syntactic dependencies. We adopt a game-theoretic approach to compute the contributions of each token in the input text to the model's prediction. We scope ourselves to explaining text generation by autoregressive language models. We compare SyntaxShap to the random baseline and popular explainers for text data, including LIME, SHAP, Partition. In addition, we also extend our method to a weighted variant SyntaxShap-w.
Abstract
Citation
@article{amara2022SyntaxShap, title=SyntaxShap: Syntax-aware Explainability Method for Text Generation}, author={Amara, Kenza and Sevastjanova, Rita and El-Assady, Mennatallah }, journal={arXiv preprint https://arxiv.org/pdf/2402.09259}, year={2024} }
Contribute to SyntaxShap!