Proof of the Gumbel Max Trick
#Statement
Assume that
where
. Then
#Proof
Set
#Application
The trick is commonly used in DL to make sampling over a discrete distribution differentiable.
#References
Author: hsfzxjy.
Link: https://i.hsfzxjy.site/proof-of-gumbel-max-trick/.
License: CC BY-NC-ND 4.0.
All rights reserved by the author.
Commercial use of this post in any form is NOT permitted.
Non-commercial use of this post should be attributed with this block of text.