Text Curation and Formatting

  • XML schema design and encoding
  • Quality Control: large-scale automatic XML validation and repair
  • Metadata production e.g. indexes, terminology
  • Identification of spelling variation and typos (see some example results below from the LACR project)