Just a quick note, I find it disappointing that so many point-wise estimators are used in general. If we have a given metric for error, $d\left(y,\hat{y}\right)$, and a prior and likelihood, it seems intuitive that optimization should focus on solving:$$\underset{\hat{y}}{\mathrm{argmin}} \int d\left(y,\hat{y}\right) p\left(x|y\right)p\left(y\right)dy$$But usually you don't see it cast like this, which is a shame because you can end up with the wrong peak and bad overfit. Plus, you can put all sorts of fun regularizers in the prior, such as variation minimizers. Anyway, bad post, just was living on my mind.
Subscribe to:
Post Comments (Atom)
March thoughts
Lets start by taking any system of ordinary differential equations. We can of course convert this to a first order system by creating stand-...
-
A couple days ago I encountered a really neat solution to creating a basis using a Cholesky decomposition. Particularly, when we have a set...
-
Given a vector $\vec{x}$ with unit magnitude, we want to find a $m \times n$ matrix B such that $BB^T = I$, with first row $\vec{B}_0 = \vec...
-
Boring setup: Suppose we have a probability mass function which we choose to represent as a vector $\vec{p} = \{ p_0, p_1, p_2, \cdots p_...
No comments:
Post a Comment