skip to content
 

Our work on the Greek Lexicon 'slips' database, and on composing in XML, has involved us in more fundamental research into text manipulation and storage, which we undertook in 2002-2005, together with seven partners in an international research group:

CHLT

This international research project, funded by the European Commission Information Society Technologies Program and the US National Science Foundation Digital Libraries Initiative, was devoted to international digital library technology, and specifically the development of:

    • an infrastructure for digital libraries;
    • IT tools for end-users that are designed to be adaptable to different uses;
    • a framework for sharing metadata, data, and tools across multiple digital libraries;
    • a distributed archive allowing for long-term preservation of, and easy access to, digital data.

The participants (four European and four U.S.) are listed here, with their specialist areas and researchers:

Each of the participants brought to this project a digital collection that when linked together created a mini international digital library which acted as a test-bed for the creation of structural models and computing software. The work undertaken by each of the partners has helped to develop an 'infrastructure model' for digital libraries. And now that the model is operational at each of the partner sites, the individual 'work packages' are still proceeding in tandem, within a unified working environment.

All these work packages are organised around a series of specialised digital library applications that have been integrated into a single system —three of them involving the use of corpora as test beds for new applications. The methodology relies upon the development of an 'indexing architecture' that can be applied to a range of languages. This allows us to apply the same tools to every text in the system.

Although we brought existing corpora to the project, the infrastructure that we have developed will also allow us to integrate other texts at a very low cost. And so the corpora that we created and integrated into our systems can make a substantial contribution to future research on our shared linguistic heritage.

 

Next Page: Lexicographic Resources

Latest news

Kennedy Professorship of Latin

19 January 2026

The Faculty is delighted to announce that Professor Christopher Whitton has accepted election to the Kennedy Professorship of Latin from 1 October 2026.

Professor Nicholas Zair awarded Leverhulme Research Fellowship

8 January 2026

The Faculty is pleased to announce that Professor Nicholas Zair has been awarded a 3 year Leverhulme Major Research Fellowship from 2026-2029 for his project Understanding Oscan. The Fellowship will allow Nick to spend the next three years working on Oscan, which was spoken widely across Southern Italy between the fifth...

Dr Ben Gray, Assistant Professor in Classics (Ancient History)

20 October 2025

The Faculty is delighted to announce the appointment of Dr Ben Gray ( Birkbeck, University of London) as Assistant Professor in Classics (Ancient History) from 1st January 2026.

“Decoding the Desert” and “Middleton’s Architectural Odysseys” now on CUDL

29 September 2025

Two collections from the Faculty Archives, the photographs of archaeologists Richard Norton and Richard Goodchild in Libya, and notebooks of Victorian architect J. H. Middleton, have been digitised and are available to view on the Cambridge University Digital Library. A gift from the family of Professor Joyce Reynolds -...