Add subset selection application

author: Thibaut Horel <thibaut.horel@gmail.com> 2015-04-02 12:59:58 -0400
committer: Thibaut Horel <thibaut.horel@gmail.com> 2015-04-02 12:59:58 -0400
commit: 4ad2f2ed18bacf270db38f4aaf5c35e13615a98e (patch)
tree: 57500b3312c9e5297250c4c5c56dd9d2686dfcc1
parent: b07276072fd182c9228e6a6b800f0390672d05f1 (diff)
download: learn-optimize-4ad2f2ed18bacf270db38f4aaf5c35e13615a98e.tar.gz
1 files changed, 9 insertions, 0 deletions
diff --git a/results.tex b/results.tex
index 8e6a186..ce5368d 100644
--- a/results.tex
+++ b/results.tex
@@ -209,6 +209,15 @@ sets are the sets of size at most one).
         This can be written as multivariate concave over modular
         (\textbf{TODO:} I think multivariate concave over modular is not
         submodular in general, it is for $\log\det$. Understand this better).
+    \item \emph{data subset selection/summarization:} in statistical machine
+        translation, Bilmes used sum of concave over modular:
+        \begin{displaymath}
+            f(S) = \sum_{f} \lambda_f \phi\left(\sum_{e\in S}w_f(e)\right)
+        \end{displaymath}
+        where each $f$ represents a feature, $w_f(e)$ represents how much of
+        $f$ element $e$ has, and $\phi$ captures decreasing marginal gain when
+        we have a lot of a given feature.
+        Facility location functions are also commonly used for subset selection.
 \end{itemize}
 
 \section{Passive Optimization}
author	Thibaut Horel <thibaut.horel@gmail.com>	2015-04-02 12:59:58 -0400
committer	Thibaut Horel <thibaut.horel@gmail.com>	2015-04-02 12:59:58 -0400
commit	4ad2f2ed18bacf270db38f4aaf5c35e13615a98e (patch)
tree	57500b3312c9e5297250c4c5c56dd9d2686dfcc1
parent	b07276072fd182c9228e6a6b800f0390672d05f1 (diff)
download	learn-optimize-4ad2f2ed18bacf270db38f4aaf5c35e13615a98e.tar.gz