An alternative construction of Shannon entropy – Rich Pang
TL;DR: Shannon’s entropy formula is usually justified by showing it satisfies key mathematical criteria, or by computing how much space is needed to encode a variable. But one can also construct Shannon’s formula starting purely from the simpler notion of entropy as a (logarithm of a) count—of how many different ways a distribution could have emerged from a sequence of samples.
Entropy of a probability distribution
The underpinning of information theory is Shannon’s formula for the entropy H of ...
Read more at rkp.science