Before you can make the fullest use of text you have captured or created, you may need to correct errors, enhance quality, add markup to identify structural elements, and add description or commentary by adding metadata or annotations. These processes are addressed below under the headings of of cleanup, editing, and annotation.
The data you retrieve from digital collections at Columbia or elsewhere on the web or produce by photography, recording, scanning, or OCR will often not be of the quality you need for study or presentation, particularly if you are interested in using tools for textual or other content analysis. There may be many errors in the text and layout, images or media files may not be in the optimal form for presentation or need to be combined with one another, or the material may not be divided into the units you need for your work. A variety of tools are available for cleaning up text, numeric, image, and media files.