diff options
| author | Guillaume Horel <guillaume.horel@serenitascapital.com> | 2013-08-21 15:40:59 -0400 |
|---|---|---|
| committer | Guillaume Horel <guillaume.horel@serenitascapital.com> | 2013-08-21 15:40:59 -0400 |
| commit | f7168bf05ec4b976dd74b357bff7ff54d0693f13 (patch) | |
| tree | 07d1ad7828ba30265a1291c29fbf4cffc9f59905 /parsepdftext.py | |
| parent | 17c23c5d2b6680f90117a7804e65dd7fe541848f (diff) | |
| download | ocr-layer-curation-f7168bf05ec4b976dd74b357bff7ff54d0693f13.tar.gz | |
Simplify datastructure
An alignment is now a list of list. Empty list means word maps to
nothing, and len(list) greater than one means a word maps to multiple words.
This removes the artificial distinction between index and tuple.
Diffstat (limited to 'parsepdftext.py')
0 files changed, 0 insertions, 0 deletions
