1 files changed, 91 insertions, 65 deletions
diff --git a/problem.tex b/problem.tex
index a8b89f4..5631f7a 100644
--- a/problem.tex
+++ b/problem.tex
@@ -1,4 +1,4 @@
-\subsection{Optimal Experimental Design}
+\subsection{Experimental Design}
 
 The theory of experimental design \cite{pukelsheim2006optimal,atkinson2007optimum}  studies how an experimenter should select the parameters of a set of experiments she is about to conduct. In general, the optimality of a particular design depends on the purpose of the experiment, \emph{i.e.}, the quantity the experimenter is trying to learn or the hypothesis she is trying to validate. Due to their ubiquity in statistical analysis, a large literature on the subject focuses on learning \emph{linear models}, whereby the experimenter wishes to  fit  a linear map to the data she has collected.
 
@@ -8,7 +8,7 @@ More precisely, putting cost considerations aside, suppose that an experimenter
 \end{align}
 where $\beta$ a vector in $\reals^d$, commonly referred to as the \emph{model}, and $\varepsilon_i$ (the \emph{measurement noise}) are independent, normally distributed random  variables with zero mean and variance $\sigma^2$. 
 
-The purpose of these experiments is to allow the experimenter to estimate the model $\beta$. In particular, assuming Gaussian noise, the maximum likelihood estimator of $\beta$ is the \emph{least squares} estimator: for $X_S=[x_i]_{i\in S}\in \reals^{|S|\times d}$ the matrix of experiment features and
+The purpose of these experiments is to allow the experimenter to estimate the model $\beta$. In particular, under \eqref{model}, the maximum likelihood estimator of $\beta$ is the \emph{least squares} estimator: for $X_S=[x_i]_{i\in S}\in \reals^{|S|\times d}$ the matrix of experiment features and
 $y_S=[y_i]_{i\in S}\in\reals^{|S|}$ the observed measurements, 
 \begin{align} \hat{\beta} &=\max_{\beta\in\reals^d}\prob(y_S;\beta) =\argmin_{\beta\in\reals^d } \sum_{i\in S}(\T{\beta}x_i-y_i)^2 \nonumber\\
 & = (\T{X_S}X_S)^{-1}X_S^Ty_S\label{leastsquares}\end{align} 
@@ -20,9 +20,9 @@ the noise terms $\varepsilon_i$). In particular, $\hat{\beta}$ has mean $\beta$
 (\emph{i.e.}, it is an \emph{unbiased estimator}) and covariance
 $(\T{X_S}X_S)^{-1}$.
 
-Let $V:2^\mathcal{N}\to\reals$ be a value function, quantifying how informative a set of experiments $S$ is in estimating $\beta$. The standard optimal experimental design problem amounts to finding a set $S$ that maximizes $V(S)$ subject to the constraint $|S|\leq k$. 
+Let $V:2^\mathcal{N}\to\reals$ be a \emph{value function}, quantifying how informative a set of experiments $S$ is in estimating $\beta$. The standard optimal experimental design problem amounts to finding a set $S$ that maximizes $V(S)$ subject to the constraint $|S|\leq k$. 
 
-A variety of different value functions are used in experimental design\cite{pukelsheim2006optimal}; almost all make use of the covariance $(\T{X_S}X_S)^{-1}$ of the estimator $\hat{\beta}$. A value functioned preferred because of its relationship to entropy is the \emph{$D$-optimality criterion}: %which yields the following optimization problem
+A variety of different value functions are used in experimental design~\cite{pukelsheim2006optimal}; almost all make use of the covariance $(\T{X_S}X_S)^{-1}$ of the estimator $\hat{\beta}$. A value function preferred because of its relationship to entropy is the \emph{$D$-optimality criterion}: %which yields the following optimization problem
 \begin{align}
  V(S) &= \frac{1}{2}\log\det \T{X_S}X_S \label{dcrit} %\\
 \end{align}
@@ -38,97 +38,123 @@ the uncertainty on $\beta$, as captured by the entropy of its estimator.
 %\end{align}
 %There are several  reasons
 
-Value function \eqref{dcrit} has several appealing properties. To begin with, it is a submodular set function (see Lemma~\ref{...} and Thm.~\ref{...}). In addition, the maximization of convex relaxations of this function is a  well-studied problem \cite{boyd}. Note that \eqref{dcrit} is undefined when $\mathrm{rank}(\T{X_S}X_S)<d$; in this case, we take $V(S)=-\infty$ (so that $V$ takes values in the extended reals).
-
+ Value function \eqref{dcrit} is undefined when $\mathrm{rank}(\T{X_S}X_S)<d$; in this case, we take $V(S)=-\infty$ (so that $V$ takes values in the extended reals).
+Note that \eqref{dcrit} is a submodular set function, \emph{i.e.}, 
+$V(S)+V(T)\geq V(S\cup T)+V(S\cap T)$ for all $S,T\subseteq \mathcal{N}$; it is also monotone, \emph{i.e.}, $V(S)\leq V(T)$ for all $S\subset T$. %In addition, the maximization of convex relaxations of this function is a  well-studied problem \cite{boyd}.
 
-\subsection{Budget Feasible Mechanism Design}
-In this paper, we approach the problem of optimal experimental design from the
-perspective of \emph{a budget feasible reverse auction}
-\cite{singer-mechanisms}. In particular, we assume that each experiment $i\in
-\mathcal{N}$ is associated with a cost $c_i$, that the experimenter needs to
-pay in order to conduct the experiment.  The experimenter has a budget
-$B\in\reals_+$. In the \emph{full information case}, the costs are common
-knowledge; optimal design in this context amounts to selecting a set $S$
+\subsection{Budget Feasible Reverse Auctions}
+A \emph{budget feasible reverse auction}
+\cite{singer-mechanisms} comprises a set of items $\mathcal{N}=\{1,\ldots,n\}$ as well as a single buyer. Each item has a cost $c_i\in\reals_+$. Moreover, the buyer has a positive value function  $V:2^{\mathcal{N}}\to \reals_+$, as well as a budget $B\in \reals_+$. In the \emph{full information case}, the costs $c_i$ are common
+knowledge; the objective of the buyer in this context is to select a set $S$
 maximizing the value $V(S)$ subject to the constraint $\sum_{i\in S} c_i\leq
-B$. We write:
+B$.We write:
 \begin{equation}\label{eq:non-strategic}
     OPT(V,\mathcal{N}, B) = \max_{S\subseteq\mathcal{N}} \left\{V(S) \mid
     \sum_{i\in S}c_i\leq B\right\}
 \end{equation}
-the best value we can reach under the budget constraint $B$ when selecting
-experiments from the set $\mathcal{N}$.
+for the optimal value achievable in the full-information case. \stratis{Should be $OPT(V,c,B)$\ldots better drop the arguments here and introduce them wherever necessary.}
 
-As in \cite{singer-mechanisms,chen}, we focus in a \emph{strategic scenario}:
-experiment $i$ corresponds to a \emph{strategic agent}, whose cost $c_i$ is
-private.  For example, each $i$ may correspond to a human participant; the
-feature vector $x_i$ may correspond to a normalized vector of her age, weight,
-gender, income, \emph{etc.}, and the measurement $y_i$ may capture some
-biometric information (\emph{e.g.}, her red cell blood count, a genetic marker,
-etc.). The cost $c_i$ is the amount the participant deems sufficient to
-incentivize her participation in the study.
-
-A mechanism $\mathcal{M} = (f,p)$ comprises (a) an \emph{allocation function}
+In the \emph{strategic case}, each items in $\mathcal{N}$ is held by a different strategic agent, whose cost is a-priori private.
+A \emph{mechanism} $\mathcal{M} = (f,p)$ comprises (a) an \emph{allocation function}
 $f:\reals_+^n \to 2^\mathcal{N}$ and (b) a \emph{payment function}
 $p:\reals_+^n\to \reals_+^n$. Given the
-vector or costs $[c_i]_{i\in\mathcal{N}}$, the allocation function determines the set
-$S\subseteq \mathcal{N}$ of experiments to be conducted, while the payment function
-returns a vector of payments $[p_i]_{i\in \mathcal{N}}$. As in
+vector or costs $c=[c_i]_{i\in\mathcal{N}}$, the allocation function $f$ determines the set in
+$ \mathcal{N}$ of items to be purchased, while the payment function
+returns a vector of payments $[p_i]_{i\in \mathcal{N}}$. Let $s_i(c) = \id_{i\in f_i(c)}$ be the binary indicator of  $i\in f(c)$. As in
 \cite{singer-mechanisms, chen}, we study mechanisms that are normalized
-($i\notin S$ implies $p_i=0$), individually rational ($p_i\geq c_i$) and have
-no positive transfers $p_i\geq 0$. 
+($i\notin f(c)$ implies $p_i(c)=0$), individually rational ($p_i(c)\geq c_i\cdot s_i(c)$) and have
+no positive transfers ($p_i(c)\geq 0$). 
 
-Moreover, we want to design a mechanism which has the following properties:
+In addition to the above, mechanism design in budget feasible reverse auctions seeks mechanisms that have the following properties \cite{singer-mechanisms,chen}:
 \begin{itemize}
-    \item \emph{Truthful.} An agent has no incentive to report a cost $c_i'$
-    different from his true cost $c_i$. Formally, let us write $(c_i', c_{-i})$
-    the vector of costs when agent $i$ reports cost $c_i'$ (all the other costs
-    $c_{-i}$ being the same). Let $[p_i']_{i\in \mathcal{N}} = p(c_i',
-    c_{-i})$, then the mechanism is truthful iff (a) $i\notin
-    f(c_i,c_{-i})\implies i\notin f(c_i',c_{-i})$ and (b) $p_i - c_i \geq p_i'
-    - c_i$.
-    \item \emph{Budget Feasible.} The sum of the payments should not exceed the
+    \item \emph{Truthfulness.} An agent has no incentive to missreport her true cost. Formally, let $c_{-i}$
+ be a vector of costs of all agents except $i$. %    $c_{-i}$ being the same). Let $[p_i']_{i\in \mathcal{N}} = p(c_i',
+%    c_{-i})$, then the 
+A mechanism is truthful iff  every $i \in \mathcal{N}$ and every two cost vectors $(c_i,c_{-i})$ and $(c_i',c_{-i})$
+ $p_i(c_i,c_{-i}) - s_i(c_i,c_{-i})\cdot c_i \geq p_i(c_i',c_{-i}) - s(c_i',c_{-i})\cdot c_i$.
+    \item \emph{Budget Feasibility.} The sum of the payments should not exceed the
     budget constraint:
     \begin{displaymath}
         \sum_{i\in\mathcal{N}} p_i \leq B.
     \end{displaymath}
     \item \emph{Approximation ratio.} The value of the allocated set should not
-    be too far from the optimum value of the non-strategic case
+    be too far from the optimum value of the full information case
     \eqref{eq:non-strategic}. Formally, there must exist some $\alpha\geq 1$
     such that:
     \begin{displaymath}
         OPT(V,\mathcal{N}, B) \leq \alpha V(S).
     \end{displaymath}
-    The approximation ratio captures the \emph{price of truthfulness}: this is
-    what you loose by moving from the non-strategic case to the strategic case
-    with a truthfulness constraint.
+    The approximation ratio captures the \emph{price of truthfulness}, \emph{i.e.},  the relative value loss incurred by adding the truthfulness constraint. % to the value function maximization.
     \item \emph{Computationally efficient.} The allocation and payment function
-    should be computable in polynomial time in the number $|\mathcal{N}|$ of
-    agents. \thibaut{Should we say something about the black-box model for $V$?
-    Needed to say something in general, but not in our case where the value
-    function can be computed in polynomial time}.
+    should be computable in polynomial time in the number of
+    agents $n$. %\thibaut{Should we say something about the black-box model for $V$?  Needed to say something in general, but not in our case where the value function can be computed in polynomial time}.
 \end{itemize}
 
-Note that this is a \emph{single parameter} auction: each agent has only one
-private value. In this case, Myerson's theorem \cite{myerson} gives
+As noted in \cite{singer-mechanisms, chen}, budget feasible reverse auctions are  \emph{single parameter} auctions: each agent has only one
+private value. In this case, Myerson's Theorem \cite{myerson} gives
 a characterization of truthful mechanisms.
 
-\begin{theorem}[Myerson \cite{myerson}]\label{thm:myerson}
-A normalized mechanism $\mathcal{M} = (f,p)$ for a single parameter auction is
+\begin{lemma}[Myerson \cite{myerson}]\label{thm:myerson}
+\sloppy A normalized mechanism $\mathcal{M} = (f,p)$ for a single parameter auction is
 truthful iff:
-\begin{enumerate}
-\item $f$ is \emph{monotone}: for any agent $i$ and $c_i' \leq c_i$, for any
+%\begin{enumerate}
+%\item 
+(a) $f$ is \emph{monotone}, \emph{i.e.}, for any agent $i$ and $c_i' \leq c_i$, for any
 fixed costs $c_{-i}$ of agents in $\mathcal{N}\setminus\{i\}$, $i\in f(c_i,
-c_{-i})\implies i\in f(c_i', c_{-i})$.
-\item agents are paid their \emph{threshold payments}: agent $i$ receives
-$\inf\{c_i: i\in f(c_i, c_{-i})\}$.
-\end{enumerate}
-\end{theorem}
+c_{-i})$ implies $i\in f(c_i', c_{-i})$, and (b)
+%\item
+ agents are paid \emph{threshold payments}, \emph{i.e.}, for all $i\in f(c)$, $p_i(c)=\inf\{c_i': i\in f(c_i', c_{-i})\}$.
+%\end{enumerate}
+\end{lemma}
+\fussy
+Myerson's Theorem is particularly useful when designing a budget feasible truthful
+mechanism, as it allows us to focus on designing a monotone allocation function. Then, the
+mechanism will be truthful as long as we give each agent her threshold payment---the caveat being that the latter need to sum to a value below $B$.
+
+\subsection{Budget Feasible Experimental Design}
+In this paper, we approach the problem of optimal experimental design from the
+perspective of a budget feasible reverse auction, as defined above.
+ In particular, we assume the experimenter has a budget
+$B\in\reals_+$ and plays the role of the buyer. 
+Each experiment $i\in
+\mathcal{N}$ corresponds to a strategic agent, whose cost $c_i$ is
+private. In order to obtain the measurement $y_i$, the experimenter  needs to
+pay agent $i$ a price that exceeds her cost. 
+
+For example, each $i$ may correspond to a human participant; the
+feature vector $x_i$ may correspond to a normalized vector of her age, weight,
+gender, income, \emph{etc.}, and the measurement $y_i$ may capture some
+biometric information (\emph{e.g.}, her red cell blood count, a genetic marker,
+etc.). The cost $c_i$ is the amount the participant deems sufficient to
+incentivize her participation in the study. Note that, in this setup, the feature vectors $x_i$ are public information that the experimenter can consult prior the experiment design. Moreover, though a participant may lie about her true cost $c_i$, she cannot lie about $x_i$ (\emph{i.e.}, all features are verifiable upon collection) or $y_i$ (\emph{i.e.}, she cannot falsify her measurement).
+
+%\subsection{D-Optimality Criterion}
+Ideally, motivated by the $D$-optimality criterion, we would like to design a mechanism that maximizes \eqref{dcrit} within a good approximation ratio. As \eqref{dcrit} may take arbitrarily small negative values, to define a meaningful approximation one would consider  the (equivalent) maximization of $V(S) = \det\T{X_S}X_S$. %, for some strictly increasing, on-to function $f:\reals_+\to\reals_+$. 
+However, the following lower bound implies that such an optimization goal cannot be attained under the costraints of  truthfulness, budget feasibility, and individional rationallity.
+
+\begin{lemma}
+For any $M>1$, there is no $M$-approximate, truthful, budget feasible, individionally rational mechanism for a budget feasible reverse auction with value fuction $V(S) = \det{\T{X_S}X_S}$.
+\end{lemma}
+\begin{proof}
+\input{proof_of_lower_bound1}
+\end{proof}
+
+This negative result motivates us to study a problem with a modified objective:
+\begin{center}
+\textsc{ExperimentalDesign} (EDP)
+\begin{subequations}
+\begin{align}
+\text{Maximize}\quad V(S) &= \log\det(I_d+\T{X_S}X_S) \label{modified} \\
+\text{subject to}\quad \sum_{i\in S} c_i&\leq B
+\end{align}\label{edp}
+\end{subequations}
+\end{center} where $I_d\in \reals^{d\times d}$ is the identity matrix.
+One possible interpretation of \eqref{modified} is that, prior to the auction, the experimenter has free access to $d$ experiments whose features form an orthonormal basis in $\reals^d$. However, \eqref{modified} can also be motivated in the context of \emph{Bayesian experimental design} \cite{chaloner1995bayesian}. In short, the objective \eqref{modified} arises naturally when the experimenter retrieves the model $\beta$ through \emph{ridge regression}, rather than the linear regression \eqref{leastsquares} over the observed data; we explore this connection in Section~\ref{sec:bed}. 
+
+Note that maximizing \eqref{modified} is equivalent to maximizing \eqref{dcrit} in the full-information case. In particular, $\det(\T{X_S}X_S)> \det(\T{X_{S'}}X_{S'})$ iff $\det(I_d+\T{X_S}X_S)>\det(I_d+\T{X_{S'}}X_{S'})$. In addition, \eqref{edp} (and the equivalent problem with objective \eqref{dcrit}) are NP-hard; to see this, note that \textsc{Knapsack} reduces to EDP with dimension $d=1$ by mapping the weight of each item $w_i$ to an experiment with $x_i=w_i$. Nevertheless, \eqref{modified} is submodular, monotone and satifies $V(\emptyset) = 0$, allowing us to use the extensive machinery for the optimization of submodular functions, as well as recent results in the context of budget feasible auctions \cite{chen,singer-mechanisms}. 
+
+\stratis{A potential problem is that all of the above properties hold also for, \emph{e.g.}, $V(S)=\log(1+\det(\T{X_S}X_S))$\ldots}
 
-This theorem is particularly useful when designing a budget feasible truthful
-mechanism: we can focus on designing a monotone allocation function. Then, the
-mechanism will be truthful as long as we give each agent her threshold payment.
-Finally, it suffices to prove that the sum of threshold payments does not
-exceed the budget to ensure budget feasibility.
 
 \begin{comment}
 \subsection{Experimental Design}