![]() | ![]() |
| SEARCH | ABOUT | INDEX | NEWS | CORE STANDARDS | TECHNOLOGY REPORTS | EVENTS | LIBRARY | |
|
SEARCH Advanced Search ABOUT Site Map CP RSS Channel Contact Us Sponsoring CP About Our Sponsors NEWS Cover Stories Articles & Papers Press Releases CORE STANDARDS XML SGML Schemas XSL/XSLT/XPath XLink XML Query CSS SVG TECHNOLOGY REPORTS XML Applications General Apps Government Apps Academic Apps EVENTS LIBRARY Introductions FAQs Bibliography Technology and Society Semantics Tech Topics Software Related Standards Historic |
[October 21, 2000] ETCSL project description: "The aim of this project is to produce a 'collected works' of over 400 poetic compositions of the classical [Sumerian] literature, equipped with translations. This standardised, electronically searchable SGML corpus, which is based to a large degree on published materials, comprises some 400 literary compositions of the Isin/Larsa/Old Babylonian Period, amounting to approximately 40,000 lines of verse (excluding Emesal cult songs, literary letters, and magical incantations). The full catalogue can be found elsewhere at this site. The compositions are presented in single-line composite text format (in a standardised transliteration) with newly-prepared English prose translations, and a full bibliographical database, thereby making available for the first time a collected works of Sumerian literature. The corpus is freely available to anyone who wishes to use it via this World Wide Web site... The literature written in Sumerian is the oldest human poetry that can be read, dating from approximately 2100 to about 1650 BC. The main 'classical' corpus can be very roughly estimated at 50,000 lines of verse, including narrative poetry, praise poetry, hymns, laments, prayers, songs, fables, didactic poems, debate poems and proverbs. The majority of this has been reconstructed during the past fifty years from thousands of often fragmentary clay tablets inscribed in cuneiform writing. Relatively few compositions are yet published in satisfactory modern editions. Much is scattered throughout a large number of journals and other publications. Several important poems must still be consulted in twenty-year-old unpublished doctoral dissertations, some with translations which have now become unusable because of progress in our knowledge of the language. Major compositions have not yet been edited at all. The slow progress of research, with little organised collaboration until recently, means that Sumerian literature has [hitherto] remained inaccessible to the majority of those who might wish to read or study it, and virtually unknown to a wider public." "The principal task is the preparation of a complete electronic corpus of reconstructed texts. These are being encoded in Standard Generalized Markup Language (SGML), which will ensure the widest accessibility of the material in the future. The corpus comprises: (1) an information database; (2) transliterations of 13 ancient literary catalogues; (3) composite texts of 409 literary compositions; (4) new translations of all the composite texts. The emphasis is on coherent, readable English prose. These will enlarge immensely the accessibility and usability of the corpus by scholars not conversant with Sumerian, including both archaeological and art-historical specialists within the field and comparative and cultural historians from outside it." Principal researchers on this project Dr Jeremy Black, Dr Graham Cunningham, Dr Gábor Zólyomi, and Dr Eleanor Robson. "SGML, the Standard Generalised Markup Language, is an international standard (ISO 8879: 1986) for writing tagging languages which describe the structure, rather than the visual appearance, of texts. SGML works by means of Document Type Definitions (DTDs) which prescribe the order, hierarchy and frequency of the elements of a text, and the writing system used. It is particularly useful for ensuring structural consistency throughout a large body of material, and for systematically tagging noteworthy or interesting features of those texts. Because it is an international standard and not a proprietary format, SGML is independent of platform, application and character-set and therefore extremely portable and durable. In short, it is ideal for encoding large language corpora which need to be searched, analysed and shared between projects over a long period of time. Hyper-Text Markup Language, in which Web documents are written, is probably the best-known SGML application; its current DTD is HTML 4.0. There are many other internationally or professionally standard DTDs but, not surprisingly, nothing quite suitable for marking up a corpus of Sumerian literature. The corpus project constructed a set of suitable DTDs; these were tested on a sample text, The lament for Nibru, which has a composite text of 323 lines, and 33 cuneiform sources... HTML is derived from SGML, and current trends are moving towards a Web based on XML -- Extensible Markup Language, a simplified form of SGML -- with XML-aware browsers already commercially available. In the next few months a parallel XML- and Unicode-based site will be developed, enabling much more sophisticated display and searching facilities. However, until their use becomes more widespread, the simple HTML- and ASCII-based pages which you are now reading will continue to be available... By the end of the project we also hope to offer a choice between HTML- and ASCII-viewing and XML- and Unicode-viewing." [from 'About ETCSL', dated May 19, 2000] References:
|
![]() ![]() ![]()
|
||||
![]() |
Document URI: http://xml.coverpages.org/etcsl.html — Legal stuff
Robin Cover, Editor: [email protected]





