Anatolian Databases provides a corpus of transcriptions of texts in Lydian, Lycian and Cuneiform Luvian, some of the extinct Indo-European languages of ancient Anatolia (modern Turkey) spoken and written in the 2nd and 1st millennia B.C, although the origin of the texts provided is not clearly stated on the site, which may make the resource less useful. The site forms part of the home page of historical linguist Dr H. Craig Melchert of the University of North Carolina at Chapel Hill and includes an online version of his Cuneiform Luvian Lexicon (1993).The material is presented in an Adobe Acrobat Reader format with thumbnails for easy navigation through the extensive text which can be extracted, copied or printed. The absence of translations, critical apparatus and minimal commentaries and further references, however, demonstrate the specialist nature of this resource. The non-specialist reader will have to make use of other resources for a wider appreciation of the subject. In addition, Melchert's home page provides free online access to several of his papers on the languages featured in the Anatolian Databases as well as an outline of the linguistic courses he teaches at UNC. This is a specialist resource which will interest advanced undergraduate students and researchers in ancient history and linguistics of Turkey and the Aegean basin.
Cretan Hieroglyphic Texts by John G. Younger is a simplified edition of the Corpus Hieroglyphicarum Inscriptionum Cretae (CHIC). It is an important reference work that can disseminate the study of Cretan Hieroglyphic inscriptions (mostly from Knossos and Malia) and eventually distribute updates fast. Cretan Hieroglyphic was used in Minoan Crete and there is evidence of similarities between Linear A and Cretan Hieroglyphic; both scripts are undeciphered.
This technical website may be valuable to researchers in archaeology, linguistics and classics specialising in Linear A and Minoan scripts.
This online essay (first published in Scientific American in 1990) introduces the early history of the Indo-European language group and argues that the protolanguage originated more than 6,000 years ago in Eastern Anatolia. The essay describes how linguists use evidence of phonological change to trace the history of languages. The authors defend their Indo-European glottalic system of sound change, challenging the classical view that Sanskrit most faithfully conserved the original sound system. Examples are taken from agricultural terminology, and anthropological conclusions concerning migration patterns are drawn from an analysis of words descriptive of landscape and vegetation. Particular attention is paid to the Armenian language and its place in the Indo-European language family tree. There is a short list of further reading appended to the essay.
Enchiridion is online teaching guide by Elaine Woodward and Marianne Pagos of the Boston Latin School for those who want to learn ancient Greek. This e-book is available to download here in PDF format. Individual chapters cover the following topics relating to Greek grammar: the Greek alphabet and pronunciation; cases (nominative, accusative etc), number, and gender; nouns and their forms; verb forms and tenses; participles; pronouns. Each chapter provides both explanatory text and grammatical/translation exercises. Other pages of the website offer further exploration of the Greek language, and include: readings from Homer's Iliad for translation; a section listing prepositions; a list of irregular verbs; a 'glossary' (which is in fact a dictionary of transliterated Greek words); and an appendix of the various charts which appear in the main text. The site is easy to navigate and the text is clear.
The academic journal for 'North African and Andalusian Dialectological Studies' (EDNA) is a publication from the Spanish 'Institute of Islamic and Near Eastern Studies'. The journal is devoted to the study of the Arabic language in the Western regions of Maghreb and Alandalús, thus combining studies on contemporary and historical issues of Arabic dialects in these regions. Available on the site is the full-text content for all issues published between 1996 and 2004. Although Spanish is the main language of the publication, there are also articles in English, French and German. As part of the Institute of Islamic and Near Eastern Studies, the user may also browse the website and access relevant information about the institute: publications; cultural activities; and the library.
The Germanic Lexicon Project (formerly the Indo-European Language resources page) is a website that collects and provides access to out-of-copyright grammars and dictionaries of Germanic languages. Most of the texts digitised here date from the last decades of the nineteenth century or the first couple of decades of the twentieth. Languages covered include: Proto-Germanic; Old English; Gothic; Old High German; Middle High German; Old Saxon; Old Frisian; and Old Norse. There is also a nineteenth-century guide to the Somersetshire dialect, and a small section on non-Germanic languages. Background information for each publication is available. The featured texts were in various states of digitisation when reviewed, with several having been fully converted into HTML or XML, others existing as scanned pages. The site's editor, Sean Crist, is seeking volunteers to assist with digitisation. The site features a keyword search engine. There is also an e-mail discussion board, although this does not seem to have many regular users. This is a valuable online resource, recommended to those working on the linguistics of early Germanic languages.
This website is intended to introduce undergraduate students to the Great Vowel Shift (GVS) that affected the pronunciation of the English language between the fifteenth and eighteenth centuries. It summarises the historical significance of the changes that took place over this period, and explains how the shift has had an impact on the interpretation of literature. Literary examples of language change are taken from Chaucer, Shakespeare, Dryden, and Pope. The site features an interactive Java applet and audio recordings illustrating how each vowel has changed, and an exemplary dialogue between a conservative and advanced English speaker. Links to other Internet resources, and a short reading list, are also provided. This site provides a clear and succinct guide to the Great Vowel Shift and some of its implications.
The Historical Sociolinguistics Network provides a forum for researchers in language change and language use in the context of the social histories of languages. Its website provides information on: relevant conferences and summer schools; members, with lists of publications; and projects. There are also a noticeboard with items of interest and a short section with links to other online resources. Areas of interest include: written versus oral histories; the history of writing and schooling; historical language contact; gender in historical language communities; and standard languages and codification. The home page opens up a PHP index page from which the other sections can be accessed, which makes the site a little awkward to navigate. The network has recieved an AHRC Scientific Network Grant towards its summer schools.
This websites is produced by Professor Danko Sipka of Arizona State University. Designed for his students of comparative Slavonic philology, the website History of Slavic languages : Polish, Russian and Serbo-Croatian in Comparison is an excellent resource for both teachers and students of Slavonic languages and those with a knowledge of either Russian, Polish, Croatian, Serbian, or Bosnian. The website is divided into the following sections: morphosyntax; methodology; lexicon; East, West and South Slavic; survival kits; phonology; links; and separate pages for Russian, Polish and BCS (Bosnian, Croatian, Serbian). Many files can be downloaded in PDF format and some areas are restricted to students of ASU, however this is a good resource in the field of Slavonic languages and linguistics.
This website details the ongoing Historical Thesaurus of English (HTE) project. It describes the project itself, how the finished work will be organised, and lists publications that have benefited from the work on the thesaurus so far. The site also provides some sample entries, such as 'beer' and 'gin'.The HTE contains English words (including Old English) from their earliest written occurrence, giving information on when they fell out of use (where appropriate and known). It is based on the New Oxford English Dictionary. The HTE is organised into three sections: the External World, the Mind, and Society. Within each section, words are ordered chronologically and semantically (not alphabetically). The HTE allows the building of models of vocabularies available at any one time, and it should be a valuable research tool for studying literary and linguistic history. The project received funding from the Arts and Humanities Research Board (AHRB) within the Research Grants scheme.
This set of Web pages provides an introduction to the study of ancient Iberian languages, based on inscriptions found in modern Spain and Portugal. The various sections discuss Levantine Iberian writing, Greek-Iberian, Celtiberian, and the Sudlusitanian-Tartessian language. The site provides a guide to the phonetics of each alphabet, and offers theories as to where each alphabet is derived from. The arguments surrounding the origins of each language are explored, as are the issues surrounding the relationships between the several languages and scripts, such as the Basque-Iberian hypothesis. Some example inscriptions are provided on the site, as is a bibliography.
The Indo-European Etymological Dictionary (IED) is the website for the IED research project which started in 1991 at the Department of Comparative Indo-European Linguistics at Leiden University. Its aim is to compile etymological databases of various Indo-European languages and to create a comprehensive online Indo-European etymological database containing vocabulary that can be traced back to a proto-language. Currently sixteen etymological databases based on authoritative etymological dictionaries are available online, including: Indo-European; Greek; Lithuanian; Russian; Albanian; Baltic; Old Frisian; Indo-Arian; Iranian; Slavic; Tocharian B; Old Norse; Rgvedic; Vedic; Cuneiform Luvian; and Lycian. The site also contains instructions for using the database. To access some of the databases users need to download StarLing, a software package designed to assist with text and database processing, such as the handling of linguistic fonts. This is a valuable resource for comparative linguists and language historian.
This website is an introduction to traditional grammar for students of medieval literature. Written and edited by Dr Bella Millet of Southampton University, the site seeks to present the varieties of medieval grammar in an historically contextualised manner. The site is divided into two main sections: basic grammar (that is, modern grammar) - for example, syntax and parts of speech; Old English - basic grammar equivalents in Old English.There is also a very useful index of grammatical terms, complete with links to explanations and examples.
The Lampeter Corpus of Early Modern English Tracts is a collection of non-literary prose texts covering the period between 1640 and 1740. The period is enclosed between the outbreak of the Civil War in 1642 and the beginnings of the Industrial Revolution in the 18th century and is marked by the standardisation of British English. The corpus consists of 120 texts (tracts, pamphlets). The texts are subdivided into ten decades and six domains: religion; politics; economy; science; law; and miscellaneous. Each domain is represented by two texts in each decade. The total comes up to 1.1 million words. The texts are encoded according to the guidelines of the Text Encoding Initiative (TEI) and use of the Standard Generalised Markup Language (SGML). They are available free of charge for scholarly research and are aimed at linguists and historians.
Language Variation and Change is a journal dedicated exclusively to the study of linguistic variation. It focuses on variation in either oral or written data, from a synchronic or diachronic perspective. The journal is a valuable resource for sociologists, linguists, sociolinguists, psychologists, anthropologists, phonologists and dialectologists. The site gives an overview of the journal and guidelines for contributors. The site allows free access to one issue of the journal and abstracts to back issues from volume 11, issue 1 from 1999. The full articles can be bought online or accessed through subscription.
The website Languages of India, which is part of the larger Indian saga site, contains information about the many languages of the country. Across the top of the page is a list of all eighteen official languages currently spoken in India. These link to pages which lay out the history of all these languages and of literature in them. Other links give a more linear explanation of the linguistic history of India, including languages no longer spoken, such as Sanskrit, as well as the great works of literature and epic religious texts. This narrative ends with a look at current trends. This is a site which is easy to use and well laid out, comprehensive and authoritative.
The Early Modern English Dictionaries Database is an online searchable database of entries from sixteen early dictionaries, dating from between 1530 and 1657. The sources include bilingual lexicons as well as specialist and hard-word dictionaries. In addition to the database, there is a helpful introduction, and a bibliography of works that may be of interest to those studying the lexicography of the period. The search engine is simple and quick to use. The site is intended to benefit lexicographers, researchers studying the work of the authors of the dictionaries included, and those simply looking up words in the course of routine scholarly work. There is a public version accessible to anyone and a licensed more extensive version accessible with a subscription fee.
This website, published by John Younger of the Department of Classics of the University of Kansas, provides a brief introduction to the script, transliterations of all the major Linear A texts from Crete and the Aegean, a comprehensive bibliography of related publications from 1980 and a series of free downloadable fonts for Macintosh and Windows users of all the ancient Aegean scripts (Hieroglyphic, Linear A, Linear B and the Phaistos disc). Linear A, the main writing system of the Minoan civilisation of Bronze Age Crete in the second millennium BC, is one of the last undeciphered scripts of the ancient Mediterranean. The website also supplies a rudimentary grammar and vocabulary of Linear A and speculates on the language underlying the script, which Younger believes to be related to one of the Indo-European Hittite languages of Anatolia. Linear A is first attested in Middle Minoan (MM) I B palatial contexts circa 2000 BC but its use and occurrence expanded dramatically throughout Crete and the Aegean the during the MM II-III periods and finally disappeared in the course of the 16th century BC (Late Minoan IB), probably as a result of major cultural or political influence from the Greek Mainland. While there are no photographs or drawings of the actual Linear A documents themselves, the texts are presented in a way which allows the reader to reconstruct the original layout of the tablets. In addition, all the various epigraphic conventions and abbreviations are also provided. The editor has also usefully grouped together all the texts believed to be of religious significance. While this is a specialist resource for professional archaeologists, ancient philologists and epigraphists, it will also interest undergraduates and the interested amateur.
The digital resource contains almost 10,000 images of a wide variety of legal documents, varying from police reports to notarial deeds. All (current) Flemish provinces are represented in this resource. The project aims to gather a large number of 19th century legal documents, originating from the last period in which the Lowlands were united (1815-1830). The purpose is to study to what extent, and in which cases, Dutch was spoken and written in a legal context during that period (as opposed to French). In a later phase of the project, transcriptions of this resource should be added to the digital resource. In order to access this resource, it is necessary to complete an order form.
A Linguistic Atlas of Early Middle English, 1150 - 1325 (LAEME) is an interactive online atlas, designed to enable regional and chronological linguistic study of English during this period. The Atlas complements the printed 'Linguistic Atlas of Late Mediaeval English' (LALME), which covers the period immediately following LAEME. Resources provided as part of the atlas include: a comprehensive introduction to the atlas, its contents and uses; a corpus of lexico-gramatically tagged texts (in a searchable database); a database of information regarding LAEME corpus sources; information on the software used by LAEME with instructions on concordancing, dictionary-making and map-making; and a corpus of etymologies and changes. Searches are performed mainly via 'task' buttons, which bring up search fields relevant to particular interests, namely: mapping; concordancing; timetables; tagged texts; and dictionaries. The maps illustrating regional word usages are particularly useful for those researching the origins of a particular work or manuscript. LAEME is designed specifically as a non-commercial teaching and research resource, to be cited as per a printed text. This resource would be of use both to linguists and also to medievalists studying manuscripts of the period.
Linguistic Atlas of Older Scots (LAOS) is a project at the Institute for Historical Dialectology at the University of Edinburgh and partly funded by the Arts and Humanities Research Council (AHRC). It is a corpus of historical texts from different regions of Scotland covering the time from 1380 to 1700. Phase 1 is presented at the website covering 1380 to 1500. The corpus can be searched and browsed in a variety of ways. The distribution of different forms may be shown on maps of Scotland as a compliment to reading the texts and searching for items of interest. The website is rather hard to navigate and a read through of the manual is recommended, still this is a valuable resource for anyone researching Early Scots.
This is the website of the Linguistic Research Center of the University of Texas at Austin, which features a variety of useful resources for students and researchers in modern, comparative and historical linguistics. The Centre has a specific interest in the use of computers to analyse language, and supports a number of projects to this end. Details of all projects undertaken by the centre, together with their digital output, are available on this site. The bulk of the site is taken up by the online Indo-European Documentation Centre, offering introductions to and histories of the Indo-European language families (Germanic; Anatolian; Tocharian; Celtic; Italic; Hellenic; Armenian; Indo-Iranian; Balto-Slavic; and Albanian); sample Indo-European texts; and brief introductions to the lexicon, linguistics, phonology and grammar of Indo-European languages. Other projects included are the Modern Hebrew Project (which creates and disseminates online tools for learning Hebrew) and the Numerals Project (developing resources for the study of early counting systems in the Ancient Near East). Links to related resources, including academic articles are provided. The site also offers details about the history of machine translation at the centre, and the now concluded Meso-American Languages Project, together with a bibliography of materials produces by the Centre's research staff.
The Middle English Grammar Project (MEG) is funded by the Norwegian Research Council and based at the University of Stavanger, Norway and the University of Glasgow. The eventual aim of the project is to produce a reference grammar of Middle English, based on a corpus of electronic texts. The project's website provides: an introduction to the project and its methods; a description of work currently being done by project members; a list of related sites; a list of related publications by project staff; news and contact information. The site also gives access to HTML and PDF versions of the MEG corpus of electronic texts, which can be browsed by dialect region. This site would be of interest to those studying linguistics or Middle English.
This website, for the Oral Tradition Journal, contains the twice-yearly published full-text inter-disciplinary journal relating to oral tradition and history, literary criticism and history, folklore, anthropology, linguistics and history. One of the journal's main aims is to promote the academic study of these related fields and, to that end, all of the publications from 1986 are available freely on the Web page (all are available to be downloaded in PDF format). It is, moreover, possible to search all the publications by keyword search or by author search or to browse through issue-by-issue. The journal covers a great many areas and aspects of research and study, from Basque Oral Poetry Championship to Australian Aboriginal Oral Traditions, and many more. New users of the site are recommended to read volume 18, issues 1 and 2 before moving onto the rest of the site as these 'provide a broad, state-of-the-art perspective on the multidisciplinary field of studies in oral tradition'.
The Parsed Corpus of Early English Correspondence (PCEEC) consists of 4970 letters from 84 different collections, written between 1410 and 1695 and contains some 2.2 million word tokens in total. The corpus was compiled by the Sociolinguistics and Language History project team at the Department of English, University of Helsinki. The corpus is part of speech and syntactically annotated and the website gives information about the different tagging schemes used. The corpus itself is distributed by the Oxford Text Archive (OTA) and it may be used subject to copyright restrictions. The corpus is designed to be compatible with CorpusSearch, which is a suite of search tools designed by Beth Randall at the University of Pennsylvania. This is a valuable resource for anyone researching or studying the development of the English language. The Corpus can also be ordered via the Oxford Text Archive (OTA) website, (formerly part of the Arts and Humanities Data Service (AHDS)), on completion of a request access form.
The Penn-Helsinki Parsed Corpus of Historical English is a corpus of prose text samples of Middle English, Early Modern English and Modern British English. The corpus has been annotated for syntactic structure, allowing the user to search for syntax features as well as text strings. The corpus is available to institutions on a subscription basis. This website describes the corpus and provides instructions as to its use.The corpus contains a total of 1.3 million words, from over 50 text samples, each of which is given in three forms: a text file, a part-of-speech tagged file and a parsed file. There is also an additional file with philological and bibliographical information about each text. The website provides: a condensed version of the annotations manual; provenances for each of the texts used in the corpus; instructions for the search engine; and, in PDF format, the complete manual for the CorpusSearch programme.
Sankt-Peterburgskii korpus agiograficheskikh tekstov [Saint Petersburg corpus of hagiographical texts] is a searchable database of medieval Slavic saints lives, constructed by the department of mathematical linguistics at the University of St Petersburg. It contains more than 50 manuscripts, with around 500,000 catalogued word usages. The word index may be searched in Old Slavic using an online keyboard, and results show the desired word (and lexical derivatives) in context. Unfortunately it is difficult to indentify the manuscript from which the results have been selected, and it is not possible to search only within selected manuscripts. The site is easy to navigate but requires a font download and works in Internet Explorer but not Firefox. A particularly pleasing feature is that the transcribed saints lives (dating from the fifteenth to the seventeenth century) can be downloaded in PDF or XML format. There are also several pages explaining: manuscript transcription, construction of the word index (accounting for multiple spelling variants; omission of superscript marks etc.); morphosyntactic mark-up; XML mark-up. This is a wonderful resource for researchers in the field of Slavic linguistics, palaeoslavistics and medieval Slavic culture.
Slovo: Towards a Digital Library of South Slavic Manuscripts is the website of an international project which aims to: increase cooperation between academic institutions studying medieval Slavic monastic culture; develop a website on Balkan literary heritage; create internationally agreed standards for the electronic publishing, description and encoding of medieval Slavic manuscripts. Pages on individual monasteries offer all or some of the following: an overview of monastery history; a description of manuscript collections and art treasures; a description of digitization efforts; links to manuscript descriptions; related links; bibliography; links to online articles or PDF files. Within the guidelines section is: an article on storing, publishing and researching Slavic manuscripts with computer technology, based on the work of the Repertorium Intitiative and the Slovo project; a ‘how to’ encode Slavic manuscripts within Text Encoding Initiative guidelines; and further documents on character set standardization, XML and advanced encoding resources which will be of interest specifically for those involved in the electronic publishing of medieval manuscripts. The links to current manuscript projects under ‘initiatives’ are of particular interest. This site will be of great use for researchers in the field of palaeoslavistics, and of significant interest to those researching medieval Slavic monastic culture.
Coptic is the name given to the latest stage of the ancient Egyptian language from the first century BC and written in an alphabet deriving from Greek and Demotic. The term is applied more generally to the distinct culture of Egyptian Christianity and its diaspora which still uses the Coptic language in its religious rituals. This website, produced by the St Shenouda the Archimandrite Coptic Society of Los Angeles, is part of an on-going project to preserve and promote Coptic culture by providing digital resources for Coptic language, literary, archaeological and artistic study. Projects include the Coptic Microfilm Library (CML) which aims to put all relevant Coptic and Arabic texts online and the Mapping of Coptic Monuments project, which will record all Egyptian Christian architectural and archaeological sites. The Manual of Coptic Studies (at the time of review almost completely empty and not updated since 1996) includes: the liturgy and texts of Coptic Christianity; a history of the language; a guide to Coptic writing; a directory of Coptic scholars. Other features include a useful slide show of frescoes from Coptic churches and monasteries. There is also a run of newsletters from the mid-1990s and downloadable software. The links page provides further information on websites of Coptic interest.
The TITUS (Thesaurus Indogermanischer Text- und Sprachmaterialien) project Web page is a multilingual online text retrieval system for Indo-European languages. The project started in 1987 with the creation of a digital collection in ancient Indo-European languages. The site contains texts in the following language families: Vedic; Sanskrit; Middle and Modern Indic; Old, middle, and modern Iranian; Anatolian; Tocharian; Armenian; Baltic; Slavic; Germanic; Greek; Italic; Celtic; Caucasian; Uralic; Proto-Cretan; Semitic; and Dravidic. Some material needs special software which is freely available from the site. The site also makes available: teaching material, such as detailed language maps and audio materials; news related to the area of study; the FAQ section; information about jobs in this area of research; an events diary; links to external related projects and institutions; Indo-European courses, mainly in Germany and in Austria; and a bibliography. Technical information, such as Unicode documentation and relevant software, is also available from the site. A number of the texts may be of interest to scholars of religion, including a selection of Buddhist and Hindu works, Avestan (Zoroastrian) texts, and multiple Bible versions, including the Septuagint (Greek translation of the Old Testament.) The user should note that the site uses split frames, which can sometimes complicate its navigation.
This website offers an interface to the text of the Time magazine from 1923 to the present day, over 100 million words in all. Users can search for a word or phrase and retrieve all instances of the string in context. Searches can be restricted to a particular period in time, and they may include information about part-of-speech (word class). The results can be displayed in different ways, allowing the user to see, for example, how the frequency of a word has changed over time. The interface also allows for retrieval of collocates (surrounding words). The resource offers a powerful way to explore the English language as published in the Time magazine over the years. The interface is easy to use. The accompanying help texts provide ample information about how to use the interface and also offers suggestions of the kind of questions that can be answered using the tool. This resource would be of use to anyone interested in the English language, language change, American English, and corpus linguistics. It also offers a valuable tool for looking at cultural and historical events as reported in the Time magazine.
The Tower of Babel is the website of the international project devoted to the research of evolution of the human language. The aim of the project is to build an online database of roots, or etyma reconstructed for the world's languages. A wealth of etymological databases covering different linguistic families such as: Chinese; North Caucasian; Sino-Tibetan; Yenisseian; Altaic; Chukchee-Kamchatkan; Dravidian; Semitic; Bahnar; and Koisan, can be found on the site. Software and fonts for handling linguistic and database data are available for download, as are some interactive dictionaries and databases. The site contains links to other useful resources on the languages involved. Some original publications are available online. The major participants at this stage are: the Russian State University of Humanities (Centre of Comparative Linguistics); Moscow Jewish University; the Russian Academy of Sciences (department of history and philology); Santa Fe institute (New Mexico, USA); City University of Hong Kong; and Leiden University. Individuals and organisations interested in this project are encouraged to join. The site is particularly useful for researchers in comparative linguistics and language history.
'A Treatise on the Astrolabe by Geoffrey Chaucer' is an online database of verbs from the text of Chaucer's 14th-century treatise. The resource is the work of an undergraduate student at Adam Mickiewicz University in Poland, in collaboration with their supervisor. The database is searched by using a form that enables users to choose: conjugation; classification; and form of the verbs, as well as whether etymology and meanings are displayed. The resource also provides: a description of the database; a brief history of Middle English; a section on verb morphology; and a brief biography of Chaucer. The site also gives an introduction to the text of Chaucer's treatise (but not the text itself) and a short history of the astrolabe as an object. This site would be of interest to students studying Middle English language and literature.
This is the website for the Tycho Brahe project, based at the University of Săo Paulo. The project aims to research the relationship between prosody and syntax in the process of language change that led from Classical Portuguese to Modern European Portuguese. As well as linguistic and mathematical research, the project is also producing the Tycho-Brahe Parsed Corpus of Historical Portuguese, and a Comparative Tagged Corpus of Spoken Modern European Portuguese and Brazilian Portuguese. The former comprises texts written by Portuguese authors between 1550 and 1850, made available electronically for educational and research purposes. The user must complete an access-request form to download the texts and a link to the Tycho-Brahe Corpus is available through this site. The latter Corpus consists of categorized recorded registers of speakers of both dialects. The main website features all the papers, downloadable in PDF or Word format, written as part of the project between 1998 and 2003. Abstracts of the papers are also available. The user may also access details of the sub-projects in HTML, and information about the project's meeting and seminars. Although this site seems no longer to be updated it will be of interest to anyone working within the field of Portuguese linguistics.
The York-Toronto-Helsinki-Toronto Parsed Corpus of Old English Prose contains 1.5 million words of Old English prose taken from the Toronto Dictionary of Old English Corpus, with special formatting which makes it possible to search conveniently for syntactic structure using a computer search engine. The corpus is in HTML format, and can be downloaded from the Oxford Text Archive (formerly part of the Arts and Humanities Data Service (AHDS)). However, use of the resource is restricted, and consequently users are requested to fill in a short form on the site to gain access to the data.