diff options
Diffstat (limited to 'intro.tex')
| -rw-r--r-- | intro.tex | 6 |
1 files changed, 5 insertions, 1 deletions
@@ -1,4 +1,8 @@ -In the classic setting of experimental design \cite{pukelsheim2006optimal,atkinson2007optimum}, +The statistical analysis of user data is a widely spread practice among Internet companies, which routinely use machine learning techniques over vast records of user data to perform inference and classification tasks integral to their daily operations. Statistical analysis of personal data collected through surveys or experimentation is also the cornerstone of marketing research, as well as research in a variety of experimental sciences such as medicine and sociology. + +This state of affairs has motivated several recent studies of \emph{data markets}, in which an analyst wishes to purchase data from a set of users \cite{...}. Eah user discloses her data to the analyst only if she receives an appropriate compensation. Assuming that the analyst has a limited budget, a natural question to ask is how she should allocate her budget across different users. + + In the classic setting of experimental design \cite{pukelsheim2006optimal,atkinson2007optimum}, an {\em experimenter} \E\ has access to a population of $n$ potential experiment subjects. Each subject $i\in \{1,\ldots,n\}$ is associated with a set of parameters (or features) $x_i\in \reals^d$, known to the experimenter. |
