summaryrefslogtreecommitdiffstats
path: root/experimental.tex
diff options
context:
space:
mode:
Diffstat (limited to 'experimental.tex')
-rw-r--r--experimental.tex51
1 files changed, 45 insertions, 6 deletions
diff --git a/experimental.tex b/experimental.tex
index 2f4d6d4..127cd78 100644
--- a/experimental.tex
+++ b/experimental.tex
@@ -36,14 +36,13 @@ a skeleton was detected by the Microsoft SDK. For each frames the
following data is available:
\begin{itemize}
\item the 3D coordinates of 20 body joints
-\item the z-value: this is the distance from the detected skeleton to
- the camera
+\item a picture recored by the video camera
\end{itemize}
-For some of frames, one or several joints were occluded by another
-part of the body. In those cases, the coordinates of these joints are
+For some of frames, one or several joints are occluded by another part
+of the body. In those cases, the coordinates of these joints are
either absent from the frame or present but tagged as \emph{Inferred}
-by the Microsoft SDK. It means that even though the joint was not
-present on the frame, the skeleton-fitting algorithm was able to guess
+by the Microsoft SDK. It means that even though the joint is not
+present on the frame, the skeleton-fitting algorithm is able to guess
its location.
Each frame also has a skeleton ID number. If this numbers stays the
@@ -52,6 +51,46 @@ algorithm was able to detect the skeleton in a contiguous way. This
allows us to define the concept of a \emph{run}: a sequence of frames
with the same skeleton ID.
+Ground truth person recognition is obtained by manually labelling each
+run based on the images captured by the video camera of the
+Kinect. For ease of labelling, only the runs with people walking
+toward the camera are kept. These are the runs where the average
+distance from the skeleton joints to the camera is increasing.
+
+Several reductions are then applied to the data set to extract
+\emph{features} from the raw data:
+\begin{itemize}
+\item from the joints coordinates, the lengths of 15 body parts are
+ computed. These are distances between two contiguous joints in the
+ human body. If one of the two joints of a body part is not present
+ or inferred in a frame, the corresponding body part is reported as
+ absent for that frame.
+\item the number of features is then reduced to 9 by using the
+ vertical symmetry of the human body: if two body parts are symmetric
+ about the vertical axis, we bundle them into one feature by
+ averaging their lengths. If only one of them is present, we take the
+ value of its counterpart. If none of them are present, the feature
+ is reported as missing for this frame. The resulting nine features
+ are: Head-ShoulderCenter, ShoulderCenter-Shoulder, Shoulder-Elbow,
+ Elbow-Wrist, ShoulderCenter-Spine, Spine-HipCenter,
+ HipCenter-HipSide, HipSide-Knee, Knee-Ankle.
+\item finally, all the frames where one of the 9 features is missing
+ are filtered out.
+\end{itemize}
+
+Table \ref{tab:dataset} summarizes some statistics about the resulting
+dataset.
+
+\begin{table}
+\begin{tabular}{cc}
+
+\end{tabular}
+\caption{}
+\label{tab:dataset}
+\end{table}
+
+
+
%%% Local Variables: