Encoding distribution and "true" distribution modeling

Hello @YodaEmbedding,

I was reading your answer in this previously posted issue:

> 
> 
> 
> For a *single* fixed encoding distribution $p$, the average rate cost for encoding a single symbol that is drawn from the same distribution $p$ is:
> 
> 
> 
> $$R = \sum_t - p(t) \\, \log p(t)$$
> 
> 
> 
> But this is not what we're doing. What we're actually interested in is the cross-entropy. That is the average rate cost for encoding a single symbol drawn from the true distribution $\hat{p}$:
> 
> 
> 
> $$R = \sum_t - \hat{p}(t) \\, \log p(t)$$
> 
> 
> 
> To be consistent with our notation above, we should also sprinkle in some $i$ s:
> 
> 
> 
> $$R_i = \sum_t - \hat{p}_i(t) \\, \log p_i(t)$$
> 
> 
> 
> In our case, we know exactly what $\hat{p}$ is...
> 
> 
> 
> $$\hat{p}_i(t) = \delta[t - \hat{y}_i] =\begin{cases}1 & \text{if } t = \hat{y}_i \\ 
>0 & \text{otherwise}\end{cases}$$
> 
> 
> 
> If we plug this into the earlier equation, the rate cost for encoding the $i$-th element becomes:
> 
> 
> 
> $$R_i = -\log p_i(\hat{y}_i)$$ 

 _Originally posted by @YodaEmbedding in [#314](https://github.com/InterDigitalInc/CompressAI/issues/314#issuecomment-2440670564)_

I’m trying to better understand the reasoning behind this formulation, and I have two main questions:

1. Why is the true distribution $\hat{p}_i$ considered different from the encoding distribution $p_i$?
What does it mean to refer to a “true” distribution in this context?


2. Why is $\hat{p}_i$ modeled as a delta function?
This seems to imply that there is no uncertainty at all — only one possible symbol $\hat{y}_i$ with probability 1. If that’s the case, what motivates using a probabilistic framework at all?

Thanks in advance for any clarification!
 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Encoding distribution and "true" distribution modeling #348

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Encoding distribution and "true" distribution modeling #348

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions