Corpus Document Project Results

Group

Machine Readable?

Valid Header?

Valid body

Valid Tagset?

Potentially Representative

Potentially Segmentable

Parse Errors

Comment

Total Score

399111, 387116, 349541, 349481

1

1

0

1

1

0.5

The element type "chapTitle" must be terminated by the matching end-tag "</chapTitle>".
Start location: 84:0

This is a list rather than a corpus. There is no text beyond the verbs. That's one file. The other file seems to be a converted Excel file. How is one to perform stylometrics or a KWIC based on a list with not context?

4.5

350921

1

1

0

1

1

0.5

None (fully machine-readable/parsable)

The body of the document is missing. The tagging of the header is very decent.

4.5

323531, 349501, 389692

0

1

1

1

1

1

Numerous and code-breaking

This is, overall, a very good document in terms of design. Unfortunately, there are a lot of performance errors in the tagging (missing tags, incomplete tags, etc.), which hampers machine readability (i.e. its cannot be parsed in its current state)

5