Hacker News Clone

Explaining Large Language Models Decisions Using Shapley Values

by veryluckyxyz on 12/28/2024, 12:44 AM with 19 comments

by xianshou on 12/28/2024, 4:38 PM
This doesn't replicate using gpt-4o-mini, which always picks Flight B even when Flight A is made somewhat more attractive.
Source: just ran it on 0-20 newlines with 100 trials apiece, raising temperature and introducing different random seeds to prevent any prompt caching.
by goldemerald on 12/28/2024, 6:23 AM
While I love XAI and am always happy to see more work in this area, I wonder if other people use the same heuristics as me when judging a random arxiv link. This paper has one author, was not written in latex, and no comment referencing a peer reviewed venue. Do other people in this field look at these same signals and pre-judge the paper negatively?
I did attempt to check my bias and skim the paper, it does seem well written and takes a decent shot towards understanding LLMs. However, I am not a fan of black-box explanations, so I didn't read much (I really like Sparse autoencoders). Has anyone else read the paper? How is the quality?
by scottiescottie on 12/28/2024, 6:16 PM
explainable AI just ain't there yet.
I wonder if the author took a class with Lipton, since he's at CMU. We literally had a lecture about Shapley Values "explaining" AI. It's BS.
by DEEP-MELTDOWN on 12/28/2024, 9:26 AM
[dead]