summaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
-rw-r--r--intro.tex6
-rw-r--r--stoc_paper.pdfbin328249 -> 307619 bytes
2 files changed, 5 insertions, 1 deletions
diff --git a/intro.tex b/intro.tex
index 68eb17f..d43d62b 100644
--- a/intro.tex
+++ b/intro.tex
@@ -1,4 +1,8 @@
-In the classic setting of experimental design \cite{pukelsheim2006optimal,atkinson2007optimum},
+The statistical analysis of user data is a widely spread practice among Internet companies, which routinely use machine learning techniques over vast records of user data to perform inference and classification tasks integral to their daily operations. Statistical analysis of personal data collected through surveys or experimentation is also the cornerstone of marketing research, as well as research in a variety of experimental sciences such as medicine and sociology.
+
+This state of affairs has motivated several recent studies of \emph{data markets}, in which an analyst wishes to purchase data from a set of users \cite{...}. Eah user discloses her data to the analyst only if she receives an appropriate compensation. Assuming that the analyst has a limited budget, a natural question to ask is how she should allocate her budget across different users.
+
+ In the classic setting of experimental design \cite{pukelsheim2006optimal,atkinson2007optimum},
an {\em experimenter} \E\ has access to a population of $n$ potential experiment subjects.
Each subject $i\in \{1,\ldots,n\}$ is associated with a set of parameters (or features) $x_i\in \reals^d$,
known to the experimenter.
diff --git a/stoc_paper.pdf b/stoc_paper.pdf
index 9ae80aa..7b322ef 100644
--- a/stoc_paper.pdf
+++ b/stoc_paper.pdf
Binary files differ