|
| |
| | Text Encoding Initiative - Wikipedia, the free encyclopedia |
 | | The Text Encoding Initiative (TEI) is a consortium of institutions and research projects which collectively maintains and develops a standard for the representation of texts in digital form. |  | | Its major deliverable is a set of Guidelines, which specify encoding methods for machine-readable texts, chiefly in the humanities, social sciences and linguistics. |  | | the Electronic Text Center and the Institute for Advanced Technology in the Humanities at the University of Virginia. |
|
http://en.wikipedia.org/wiki/Text_Encoding_Initiative
(396 words)
|
|
| |
| | Cover Pages: Text Encoding Initiative (TEI) |
 | | The Text Encoding Initiative uses XML in the markup of literary and linguistic texts. |  | | Text Encoding Initiative Consortium Releases P4 Draft Guidelines in XML and SGML. |  | | TEI aims to [enable encoding of] all the semantically significant aspects of literary texts, both old ones that predate XML technology, or indeed, computers in general, and newly created ones. |
|
http://xml.coverpages.org/tei.html
(7415 words)
|
|
| |
| | NINCH Guide to Good Practice |
 | | The EAD header was modeled on that of the Text Encoding Initiative (TEI). |  | | Text encoding thus makes it possible to bridge the gap between local research and insight and the discourse of the larger community, and to articulate interpretative statements in a way that is broadly intelligible. |  | | Among the projects surveyed, the use of TEI DTDs in encoding texts is one of the clearest cases of the adoption of standards for a particular type of material. |
|
http://www.nyu.edu/its/humanities/ninchguide/V
(7242 words)
|
|
| |
| | Cover Pages: Academic Applications |
 | | The Text Encoding Initiative is an international research project sponsored by the Association for Computing in the Humanities (ACH), the Association for Literary and Linguistic Computing (ALLC), and the Association for Computational Linguistics (ACL). |  | | The TEI (Text Encoding Initiative) has developed an SGML encoding for a wide range of document types in the domain of humanities computing. |  | | LDC SGML encoding: Through a now refined process the Language Analysis Center is able to produce a final digitized text of approximately 8,000 entries complete with SGML tags, in the span of one month. |
|
http://xml.coverpages.org/acadapps.html
(12232 words)
|
|
| |
| | Susan Hockey |
 | | She is Chair of the Association for Literary and Linguistic Computing and is a member (past Chair) of the Steering Committee for the Text Encoding Initiative. |  | | It is less suitable for encoding text which is to be analysed in some way, for example by a retrieval program, but it can be used to display the results of an analysis of a more richly marked up text such as one encoded in the TEI scheme. |  | | We are now working on the design of an enhanced header which will provide a direct mapping to all the MARC fields which we use for cataloguing electronic texts as well as encoding, profile and revision descriptions which can be used by computer software as well as human users. |
|
http://www.loc.gov/catdir/semdigdocs/hockey.html
(3211 words)
|
|
| |
| | WWP Bibliography |
 | | Text Encoding Initiative Guidelines to Electronic Text Encoding and Interchange, ed. |  | | "Encoding Verse Texts", in Computers and the Humanities 29, 99-111. |  | | "Hierarchical Encoding of Text: Technical Problems and SGML Solutions", in Computers and the Humanities 29, 211-231. |
|
http://www.wwp.brown.edu/encoding/bibliography.html
(1167 words)
|
|
| |
| | TEI (Text Encoding Initiative): Metadata Reference Guide: MIT Libraries |
 | | Text Encoding Initiative: defines a general-purpose scheme that makes it possible to encode different textual views. |  | | • encoding description (level of detail of the analysis-the aim or purpose for which an electronic file was encoded; editorial principles and practices used during the encoding of the text), |  | | Encodings for different views of text; alternative encodings for the same text features; mechanisms for user-defined extensions to the scheme. |
|
http://libraries.mit.edu/guides/subjects/metadata/standards/tei.html
(1047 words)
|
|
| |
| | Amazon.ca: Books: Text Encoding Initiative: Background and Context |
 | | The Text Encoding Initiative (TEI) Guidelines for Electronic Text Encoding and Interchange are the result of over six years' work by dozens of scholars from all over the world. |  | | The work of participants in the TEI not only involved consideration of problems of text encoding that are likely to be with us for decades to come, but also required the development of a methodology - from scratch - for approaching these problems. |  | | They will certainly serve as the primary basis for encoding texts in electronic form for the foreseeable future. |
|
http://www.amazon.ca/exec/obidos/ASIN/0792336895
(264 words)
|
|
| |
| | The Text Encoding Initiative and the GENIA Corpus |
 | | The talk first introduces the Text Encoding Initiative, an international effort established in 1987 under the joint sponsorship of the Association for Computers and the Humanities, the Association for Computational Linguistics, and the Association for Literary and Linguistic Computing. |  | | Describes an encoded work so that the text itself, its source, its encoding, and its revisions are all thoroughly documented. |  | | TEI is the only systematised attempt to develop a fully general text encoding model and set of encoding conventions based upon it. |
|
http://nl.ijs.si/et/Talks/nii02/nii-talk.xml
(2116 words)
|
|
| |
| | WWP Newsletter, Fall 1997: Grappling with Text Encoding on Your Own |
 | | These issues include electronic scholarly editing, philosophical discussions of the nature of text, basic concepts of SGML, and an understanding of the role of text encoding standards, including familiarity with the Text Encoding Initiative and its Guidelines for Text Encoding and Interchange. |  | | With the increasing sophistication of humanities scholars about electronic texts (and of the electronic text community about humanities issues), the work published in this area is becoming more and more intellectually compelling. |  | | The past few years, however, have seen not only an increase in the quantity of material published on electronic text issues, but--more importantly--a shift in focus away from the simple fact of electronic publication and its novelty, and towards a careful consideration of the problems and theoretical issues involved. |
|
http://www.wwp.brown.edu/project/newsletter/vol03num02/grappling.html
(548 words)
|
|
| |
| | Corpus Encoding Standard |
 | | The CES is an application of SGML (ISO 8879:1986, Information Processing--Text and Office Systems--Standard Generalized Markup Language) compliant with the specifications of the TEI Guidelines for Electronic Text Encoding and Interchange of the Text Encoding Initiative. |  | | The CES specifies a minimal encoding level that corpora must achieve to be considered standardized in terms of descriptive representation (marking of structural and typographic information) as well as general architecture (so as to be maximally suited for use in a text database). |  | | It also provides encoding specifications for linguistic annotation, together with a data architecture for linguistic corpora. |
|
http://www.cs.vassar.edu/CES
(283 words)
|
|
| |
| | A Basic Guide to Text Encoding - Encoding using TEI |
 | | The UNL Libraries Electronic Text Center uses Text Encoding Initiative (TEI) tag sets and rules, a sub-set of Extensible Markup Language (XML), to encode texts. |  | | SGML texts are not, of course, designed to be read "in the raw". |  | | Manually typing the text and OCR scanning of the text are most common methods of transfer. |
|
http://libr.unl.edu:2000/guide_site/teien.html
(475 words)
|
|
| |
| | Encoding standards for large text resources: The Text Encoding Initiative (ResearchIndex) |
 | | The Text Encoding Initiative (TEI) is an international project established in 1988 to develop guidelines for the preparation and interchange of electronic texts for research, and to satisfy a broad range of uses by the language industries more generally. |  | | Encoding standards for large text resources: The Text Encoding Initiative |  | | The need for standardized encoding practices has become inxreasingly critical as the need to use and, most importantly, reuse vast amounts of electronic text has dramatically increased for both research and industry, in particular for natural... |
|
http://citeseer.ist.psu.edu/178560.html
(177 words)
|
|
| |
| | Guidelines for Text Encoding and Interchange |
 | | The TEI Consortium, now in its second year, is an international non-profit corporation set up to maintain and develop the Text Encoding Initiative system, which has become the de facto standard for scholarly work with digital text since its first publication in 1994. |  | | Fully XML-compliant, the new P4 edition of the Guidelines for Electronic Text Encoding and Interchange is intended to provide a standard format for data interchange in humanities research and to suggest principles for the encoding of texts in the same format. |  | | The Guidelines defines a language for describing how texts are constructed and proposes names for their components. |
|
http://www.upress.virginia.edu/books/tei.html
(168 words)
|
|
| |
| | TEI: Yesterday's information tomorrow |
 | | The Text Encoding Initiative (TEI) Guidelines are an international and interdisciplinary standard that facilitates libraries, museums, publishers, and individual scholars represent a variety of literary and linguistic texts for online research, teaching, and preservation. |  | | The fifth Annual Members Meeting will be held 28-29 October 2005, at the Bulgarian Academy of Sciences, Sofia, Bulgaria and is open to members and non-members alike. |
|
http://www.tei-c.org
(235 words)
|
|
| |
| | Ebenezer's software for TEI |
 | | Many who are commited to the abstract ideals of the Text Encoding Initiative never create or use TEI-conformant texts because they can afford neither the money to buy the commercial software nor the time to learn the arcana of the free software. |  | | It's not the fault of the Text Encoding Initiative, or of their sponsors, collaborators, in-laws, remote cousins, distant acquaintances, or household pets. |  | | Scholars who want to use software tools to create and handle TEI-conformant text files have usually had to choose between expensive commercial products or free programs which required a lot of effort and computer sophistication to use. |
|
http://www.umanitoba.ca/faculties/arts/linguistics/russell/ebenezer.htm
(1329 words)
|
|
| |
| | Corpus Encoding Standard |
 | | The CES is an application of SGML (ISO 8879:1986, Information Processing--Text and Office Systems--Standard Generalized Markup Language) compliant with the specifications of the TEI Guidelines for Electronic Text Encoding and Interchange of the Text Encoding Initiative. |  | | The CES specifies a minimal encoding level that corpora must achieve to be considered standardized in terms of descriptive representation (marking of structural and typographic information) as well as general architecture (so as to be maximally suited for use in a text database). |  | | It also provides encoding specifications for linguistic annotation, together with a data architecture for linguistic corpora. |
|
http://www.cs.vassar.edu/CES/CES1.html
(1329 words)
|
|
| |
| | TEI Text Encoding in Libraries |
 | | Level 1 texts are not intended to be adequate for textual analysis; they are more likely to be suited to the goals of a preservation unit or mass digitization initiative. |  | | Electronic text at all levels of encoding should begin with the transcription of the first word on the first leaf of the original work. |  | | The primary advantage in using the TEILite DTD at this level is that a TEI Header is attached to the text file. |
|
http://www.indiana.edu/~letrs/tei
(2948 words)
|
|
| |
| | Cover Pages: Academic Applications |
 | | The Text Encoding Initiative is an international research project sponsored by the Association for Computing in the Humanities (ACH), the Association for Literary and Linguistic Computing (ALLC), and the Association for Computational Linguistics (ACL). |  | | The TEI (Text Encoding Initiative) has developed an SGML encoding for a wide range of document types in the domain of humanities computing. |  | | The text itself should be essentially self-describing, which means that the computer file which embodies it should contain a header with essential meta-data. |
|
http://xml.coverpages.org/acadapps.html
(12232 words)
|
|
| |
| | HTI American Verse Project |
 | | The American Verse Project is a collaborative project between the University of Michigan Humanities Text Initiative (HTI) and the University of Michigan Press. |  | | In recognition of the effort involved in selecting, editing, encoding, and maintaining online the works included in the archive, we expect all users to abide by the conditions of use. |  | | The full text of each volume of poetry is being converted into digital form and coded in Standard Generalized Mark-up Language (SGML) using the TEI Guidelines, with various forms of access provided through the WWW. |
|
http://www.hti.umich.edu/english/amverse
(179 words)
|
|
| |
| | Internationalization of XML – Past, Present, Future |
 | | A working group of the Text Encoding Initiative (TEI) is currently looking at this and related problems. |  | | Text normalization, very different from character encoding, is a rather advanced topic where we have made progress in the past few years, but where a lot of work still needs to be done. |  | | Unfortunately, there was a countereffect to this: Because most texts are already normalized in most cases, problems are rarely noticeable, and therefore the motivation for addressing text normalization is often too low. |
|
http://www.idealliance.org/papers/dx_xml03/papers/03-06-02/03-06-02.html
(179 words)
|
|
| |
| | Tekstlaboratoriets FAQ liste |
 | | It is not possible to access in a friendly way from a Macintosh some of the texts which we have in CD-ROM and for which we have permission to make available through the network: examples are the (non-English) texts from the European Corpus Initiative (ECI-MC1). |  | | On the other hand, we plan to use the net more and more in the future, which means that for some services run at the Text laboratory it is enough that you have a browser you are content with. |  | | If there is need, we may make some of the files that came on CD-ROM available from the Textlab disks, but note that speech files take a LOT of space. |
|
http://www.hf.uio.no/tekstlab/faq.html
(179 words)
|
|
| |
| | Timeline |
 | | Funding begins for the Text Encoding Initiative, an effort to develop standard guidelines for formating electronic texts, and American National Biography. |  | | In December, the Endowment is again restructured: the remaining five program divisions are consolidated into three and thirty-one programs into nine; an Office of Enterprise is created; and staffing is reduced by 38 percent. |  | | In January, NEH programs are restructured: parts of the Fellowships and Seminars divisions are merged with the Research and Education divisions, Challenge Grants are again administered by a separate office; and the Federal/State Partnership is created. |
|
http://www.neh.gov/whoweare/timeline.html
(179 words)
|
|
| |
| | initiative |
 | | Text Encoding Initiative Consortium Text Encoding Initiative Consortium The TEI Consortium was formed to continue the work of the Text Encoding Initiative and is hosted by four universities: University of Bergen, Brown University, University... |  | | The ecole initiative [The early church on-line encyclopedia (Ecole)] The ecole initiative [The early church on-line encyclopedia (Ecole)] The Ecole Initiative (Early Church On-Line Encyclopedia) is an ever-growing reference resource assembling... |  | | Budapest open access initiative Budapest open access initiative Arising from a meeting held at the Open Society Institute in December 2001, the Budapest Open Access Initiative represents an attempt by academics across diverse fields to... |
|
http://www.jointctr.org/?Category=initiative
(179 words)
|
|
| |
| | Cover Pages: Academic Applications |
 | | The Text Encoding Initiative is an international research project sponsored by the Association for Computing in the Humanities (ACH), the Association for Literary and Linguistic Computing (ALLC), and the Association for Computational Linguistics (ACL). |  | | For example, with respect to the Association for Computational Linguistics Data Collection Initiative (ACL/DCI), 620 MB: "The many formats in which the originals of these texts came have all, to one extent or another, been mapped into a markup language consistent with the SGML standard (ISO 8879). |  | | LDC SGML encoding: Through a now refined process the Language Analysis Center is able to produce a final digitized text of approximately 8,000 entries complete with SGML tags, in the span of one month. |
|
http://www.oasis-open.org/cover/acadapps.html
(179 words)
|
|
| |
| | Corpus Encoding Standard |
 | | The CES is an application of SGML (ISO 8879:1986, Information Processing--Text and Office Systems--Standard Generalized Markup Language) compliant with the specifications of the TEI Guidelines for Electronic Text Encoding and Interchange of the Text Encoding Initiative. |  | | The CES specifies a minimal encoding level that corpora must achieve to be considered standardized in terms of descriptive representation (marking of structural and typographic information) as well as general architecture (so as to be maximally suited for use in a text database). |  | | The CES has been designed to be optimally suited for use in language engineering research and applications, in order to serve as a widely accepted set of encoding standards for corpus-based work in natural language processing applications. |
|
http://www.lpl.univ-aix.fr/projects/multext/CES/CES1.html
(179 words)
|
|
| |
| | Publications--Nancy Ide |
 | | Computers and the Humanities 33:1-2, Special Issue on the Tenth Anniversary of the Text Encoding Initiative, E. Mylonas and A. Renear, eds. |  | | Ide, N. Encoding standards for large text resources: The Text Encoding Initiative. |  | | Ide, N. and V éronis, J. Word Sense Disambiguation: The State of the Art. |
|
http://www.cs.vassar.edu/faculty/ide/pubs.html
(179 words)
|
|
| |
| | Zdravo Matija |
 | | Key words: textual criticism, critical editions, XML, Text Encoding Initiative, Anton Martin Slomšek, 19th century Slovenian literature |  | | The focus of the work was first in developing the methodology and annotation scheme to produce standardised digital encodings of text-critical editions of Slovenian literature, but of course also in coming up with an interesting and usable result. |  | | Yet, for this purpose, an elaborated methodology is needed, which will, on the one hand, encompass the specific editorial problems of Slovenian literature and, on the other, be based on open international standards and guidelines for text encoding and interchange. |
|
http://www.komunikacija.org.yu/komunikacija/casopisi/ncd/5/d005/document
(179 words)
|
|
| |
| | Representation of Linguistic Corpora |
 | | The CES is an application of SGML (ISO 8879:1986, Information Processing--Text and Office Systems--Standard Generalized Markup Language) compliant with the specifications of the TEI Guidelines for Electronic Text Encoding and Interchange of the Text Encoding Initiative. |  | | The CES specifies a minimal encoding level that corpora must achieve to be considered standardized in terms of descriptive representation (marking of structural and typographic information) as well as general architecture (so as to be maximally suited for use in a text database). |  | | This project is intended to provide a theoretical background and develop coherent methodologies for the representation, access, and manipulation of corpora intended for use in corpus-based natural language processing (NLP) research. |
|
http://www.cs.vassar.edu/~ide/research
(179 words)
|
|
| |
| | Refining Our Notion of What Text Really Is. |
 | | The Guidelines of the Text Encoding Initiative exhibit a characteristically ambiguous stance: although they seem to privilege this view and benefit from its influence, they do not specifically invoke, explain, or defend it. |  | | In fact, we think that the continuing perplexity that surrounds the treatment of overlapping hierarchies is not due to the technical issues of encoding at all, but rather to a more fundamental deficiency in our understanding of just what we are doing when we prepare an encoded text. |  | | The motivation for the view that texts are hierarchies of content objects lay initially in reflecting on these practical benefits of treating texts as if they were ordered hierarchies of content objects. |
|
http://www.stg.brown.edu/resources/stg/monographs/ohco.html
(179 words)
|
|
|