Refine
Document Type
- Conference Proceeding (12) (remove)
Language
- English (12) (remove)
Keywords
- Korpus <Linguistik> (5)
- Neuägyptisch (2)
- Software (2)
- AEMASE (1)
- Annotation (1)
- Big Data (1)
- CVMA (1)
- Computer-mediated communication (1)
- Computerlinguistik (1)
- Corpus linguistics (1)
Has Fulltext
- yes (12)
Institute
- Akademienvorhaben Strukturen und Transformationen des Wortschatzes der ägyptischen Sprache. Text- und Wissenskultur im alten Ägypten (5)
- ALLEA (2)
- TELOTA - IT/Digital Humanities (2)
- Veröffentlichungen von Akademiemitgliedern (2)
- Akademienunion (1)
- Akademienvorhaben Digitales Wörterbuch der Deutschen Sprache (1)
Virtually all conventional text-based natural language processing techniques - from traditional information retrieval systems to full-fledged parsers - require reference to a fixed lexicon accessed by surface form, typically trained from or constructed for synchronic input text adhering strictly to contemporary orthographic conventions. Unconventional input such as historical text which violates these conventions therefore presents difficulties for any such system due to lexical variants present in the input but missing from the application lexicon. To facilitate the extension of synchronically-oriented natural language processing techniques to historical text while minimizing the need for specialized lexical resources, one may first attempt an automatic canonicalization of the input text. This paper provides an informal overview of the various canonicalization techniques currently employed by the Deutsches Textarchiv project at the Berlin-Brandenburg Academy of Sciences and Humanities to prepare a corpus of historical German text for part-of-speech tagging, lemmatization, and integration into a robust online information retrieval system.
The article summarizes the contents and the structurtal premises of the “Thesaurus Indogermanischer Text- und Sprachmaterialien” (TITUS), focussing on search functions and facilities and questions of the encoding of ancient languages written in various scripts. Examples are taken from Tocharian, Greek, Vedic Sanskrit, and other ancient Indo-European languages covered by TITUS.
This paper is an updated presentation of the Ramses project being currently developed at the University of Liège. The first section stresses the main objectives and gives a technical description of the general architecture of Ramses software. The second part describes the encoding procedures and reviews the current state of the annotation. In the third section, some changes brought about by the use of large-scale corpora are discussed from an epistemological viewpoint. The paper ends with the presentation of some new avenues for research that will ensue from the use of a complex multilevel corpus.
This is the invited evening lecture of the biannual workshop on hydroid development of 1999. Its topic is the role of hydra as a rather puristic model for the de-novo generation of spatial patterns in development, and our work in this field. Emphasis is placed not only on experimental studies, but also on theoretical analysis, because the understanding of spatial order requires a systems approach involving the combination of knowledge on molecules, cells and tissues with mathematical analysis, laws and facts.
Physical principles underlying biological pattern formation are discussed. In particular, the combination of local self-enhancement and long-range (“lateral”) inhibition (Gierer and Meinhardt, 1972) accounts for de-novo pattern formation, and for striking features of developmental regulation such as induction, spacing and proportion regulation of centers of activation in tissues and cells. Part I explains physical principles of spatial organisation in biological development. Part II demonstrates in mathematical terms that and how short-range activation and long-range inhibition are conditions for the generation of spatial concentration patterns. The conditions can be expressed in terms of ranges, rates and orders of reactions. These conditions, in turn, can also be derived by analysis of dynamic instabilities by means of Fourier waves, showing the neither obvious nor trivial relation between the latter approach and the theory based primarily on autocatalysis and lateral inhibition.