Project Gutenberg produces electronic texts that they hope extremely large portions of the audience will want and use frequently. In the same vein, Project Gutenberg has avoided requests, demands, and pressures to create authoritative editions. Their goal is to release electronic texts that are 99.9% accurate in the eyes of the reader in general, rather than the scholar in particular.
Encoding format: SGML (OTA DTD)
Paragraph, page divisions, and punctuation of original
Direct speech represented by quotation entity references
Chapters (div) bear IDs in the form C1