DELIVERABLE SUBMISSION SHEET D 7.2  (Month 15)

 

To:

Jean Goederich

(Project Officer)

                                   EUROPEAN COMMISSION
                                   DG INFSO E5
                                   EUFO 3277
                                   rue Alcide de Gasperi
                                   L-2920 Luxembourg

 

From:    Project name:

Cultural Heritage Language Technologies

 

Project acronym:

CHLT

Project number:

IST-2001-32745

 

Person:

 Dolores Iorizzo

 

 

 

Organisation

ICSTM

 

Date

15 September, 2003

The following deliverable:

Deliverable name:

 Periodic Progress Report: Month 15

Deliverable number:

D 7.2

is now complete.

*  It is available for your inspection

  A copy can be sent to you on request.

  Relevant descriptive documents are attached.

  2 bound, 1 unbound copies herewith (public deliverables).

  2 copies herewith (other deliverables).

 

Tick all that apply

 

The deliverable is:

X on paper  on WWW

 

 

an event  software X other

Report attached below.

  (tick one)

 

For all paper deliverables, and other deliverables as appropriate:

Date:

15 September, 2003

Version:

1

Author:

Iorizzo and Rydberg-Cox

No. of pages:

This cover plus 12 pages

Status:

Public    Restricted    *Internal    (tick one)

Commission use only

Keywords:

 

 

Description:

 

Comments:

.

 

 

PROJECT PROGRESS REPORT

 

 

Project:                              CHLT-IST-2001-32745

Progress Report Number: 7

Period:                               1 June -  31 August 2003

Author:                              Dolores Iorizzo

Organisation                     Imperial College London

Address:                            Centre for the History of Science, Technology and Medicine

                                           Sherfield Building - Room 445,

                                           Exhibition Road, London SW7 2AZ

Email:                                d.iorizzo@ic.ac.uk          Phone:   + 44 207-594-9355

 

 

1.     Work planned during this period........................................................................................ 3

2.     Achievements of WP 1 :Advanced Digital Library Applications....................................... 4

3.     Achievements of WP 2 : Computational Linguistics.......................................................... 4

4.     Achievements of WP 3 : Collaborative Infrastructure........................................................ 5

5.     Achievements of WP 4 : Old Norse Morphological Analyzer........................................... 5

6.     Achievements of WP 5 : Neo-Latin Morphological Analyzer........................................... 5

7.     Achievements of WP 6 : Early Modern Latin Corpus....................................................... 5

8.     Achievements of WP 7 : Management of Consortium....................................................... 6

9.     Achievements of WP 8: Integration.................................................................................... 6

10.       Achievements of Workpackage 9: Dissemination and exploitation................................ 7

11.       Technical options adopted.............................................................................................. 7

12.       Meetings held.................................................................................................................. 7

13.       Assessment of interim results......................................................................................... 7

13.1.     Objectives set versus objective attained, deviations................................................... 7

13.2.     Issues.......................................................................................................................... 8

14.       Check list of deliverables completed............................................................................... 9

15.       Resources use for the first period................................................................................. 11

16.       Work planned next reporting period............................................................................. 12

17.       Others........................................................................................................................... 12

18.       Names or address change, responsibility reassignment or other................................... 13

 


 

Work planned during this period

<work packages/tasks due this reporting period>

CHLT has been active for just over a year and this report outlines the work of months 13 15 in the second year of the project.  At the first annual review meeting CHLT was given permission to move to a quarterly reporting system (instead of a bi-monthly one) in light of the striking imbalance of reporting requirement between the US and EC partners. Work in this period has concentrated on three phases: Phase 2: Tools Development was initiated in month 6 and has continued on schedule, as has Phase 3: Evaluation which began in month 10.  Phase 4: Integration has been active from the start of the project. Progress has been made in the following areas:

 

 

    Document Cluster Visualization Tool

    Word Profile Tool

    Facilities to Accept and Preserve Feedback

    Integration of Text Processing System

    Old Norse Morphological Analyzer

    Latin Morphological Analyzer

    Early Modern Latin Texts

    CHLT Website Development

    Project and Programming Integration

    Integration Meetings

    Dissemination

 

 

The following Workpackages were active during the period :

 

WP1 Advanced Digital Library Applications

WP2 Computational Linguistics

WP3 Collaborative Infrastructure

WP4 Old Norse Morphological Analyzer

WP5 Neo-Latin Morphological Analyzer

WP6 Test-bed Development

WP7 Management of Consortium

WP8 Integration

WP9 Dissemination and Exploitation

Achievements of WP 1 : Advanced Digital Library Applications

WP Leader : Stefan Reuger

Scheduled dates : 1 June 31 August, 2003

Actual dates: September 2003

Deliverables: No deliverables due this period.

 

Concentration of effort on developing the design of interface for the Cluster and Visualization Tool.  On schedule for delivery at the end of January 2004.

 

 

Estimated achievement: 42 % as planned

 

down to task level, following the same structure:

Achievements of WP 2 : Computational Linguistics

 

WP Leader : Jeff Rydberg-Cox     

Scheduled dates : 1 June 31 August, 2003

Actual dates: September 2003

Deliverables:             No deliverables due this period.

 

 

Having completed D 2.1 and 2.2 on schedule we have turned to the development of the multi-lingual information retrieval tool. On schedule for completion of extraction tool (D 2.3) in November 2003.

 

Estimated achievement: 42 % as planned

Achievements of WP 3 : Collaborative Infrastructure

WP Leader : Greg Crane                   

Scheduled dates : 1 June 31 August, 2003

Actual dates: September 2003

Deliverables:  No deliverables due this period.

                       

 

   Concentration of effort on developing naming conventions for DL objects and formulating a system for maintenance procedures for the CHLT partners and the Perseus Digital Library System.. 

 

Estimated achievement: 42 % as planned

 

 

 

Achievements of WP 4 : Old Norse Morphological Analyzer 

 

WP Leader : Tim Tangherlini     

Scheduled dates : 1 June 31 August, 2003

Actual dates: September 2003

Deliverables: No deliverables due this period

           

 

Concentration of effort on the expansion of the Old Norse database to include exceptional-irregular words forms.

 

Estimated achievement: 42 % as planned

 

 

 

Achievements of WP 5 :

 

WP Leader : Andrea Bozzi    

Scheduled dates : 1 June 31 August, 2003

Actual dates: September 2003

Deliverables: No delieverables due this period

 

   Continuation of effort in adding gender codes to account for ambiguous morphological categories, testing the results and modifying the source code. Work on schedule to complete Word Segmentation System (D 5.2) by the end of January 2004.

 

Estimated achievement: 42 % as planned

 

 

 

Achievements of WP 6 :

 

WP Leader : Ross Scaife 

Scheduled dates : 1 June 31 August, 2003        

Actual dates: September 2003

Deliverables: No deliverables due this period

 

Continuation of effort on the XML markup of Renaissance Latin texts using XML- Text Encoding Initiative (TEI) compliant standards.  These texts will be used as a test bed for IT tools and applications developed for the CHLT Digital Library System.  Workpackage on schedule for D 6.1 in month 35.

 

Estimated achievement: 42 % as planned

 

 

Achievements of WP 7 : Management of Consortium

 

WP Leaders : Dolores Iorizzo and Jeff Rydberg-Cox

Scheduled dates : 1 June 31 August, 2003        

Actual dates: Septmeber 2003

Deliverables:             D 7.2 (month 15)

 

Concentration of effort on making sure CHLT partners respect the importance of timely reporting to the EC, and the collaboration of its members.

 

 

Estimated achievement: 42% as planned

 

 

Achievements of WP 8: Integration

 

WP Leaders : Greg Crane, Jeff Rydberg-Cox, Stefan Rueger, Dolores Iorizzo    

Scheduled dates : 1 June 31 August, 2003

Actual dates: September 2003

Deliverables:  No deliverables due this period

 

 

Integration has concentrated on implementation of code standards for plug-in modules for the core digital library system shared by all CHLT partners.

 

Estimated achievement: 42 % as planned

 

 

 

 

 

 

Achievements of Workpackage 9: Dissemination and exploitation

 

WP Leaders : Greg Crane, Jeff Rydberg-Cox, Dolores Iorizzo       

Scheduled dates : 1 June 31 August, 2003

Actual dates: September 2003

Deliverables: No deliverables due this period.

 

Concentration of effort on identifying and linking to other digital library projects that share the aims of CHLT and therefore may benefit from the dissemination of our web-based technology.

 

Estimated achievement: 42 %, as planned

<short (1/2 pages) description of work done, difficulties encountered, etc.; if appropriate this description should be broken

Technical options adopted

 

WP1:   Considering use of GUI code that may work for loading the applet

with variable parameters.

 

WP2:   Further development of DTDs for disambiguation of ambiguous

Greek and Latin forms.

 

WP3:   Further development of abstract interfaces to morphological databases

using SOAP.

 

WP4:   Testing out different web interfaces for the Old Norse Morphological

Analyser.

 

WP5:   Modification of Gender Codes for Word Segmentation System.

 

WP6:   Further DTDs implemented for text elements and attributes.

 

 

 

Meetings held

 

No CHLT meetings held in this period.

 

 

Assessment of interim results

 

Objectives set versus objective attained, deviations

 

WP1:   Begin development of interface for the Document Cluster and Visualisation Tool.  Work progress on schedule.

No deviations.

 

WP2:   Development of multi-lingual information retrieval facilities for the

extraction tool. Work progress on schedule.

No deviations.

 

WP3:   Develop naming conventions for DL objects and create maintenance

procedure for quality control. purposes. Work progress on schedule.

No deviations.

 

WP4:   Expand Old Norse database to include irregular word forms. Work

progress on  No deviations.

 

WP5:   Add gender codes to ambiguous morphological categories,

test results and modifying source code accordingly.  Work progress

on schedule. No deviations.

 

WP6:   Mark-up Latin texts in TEI-conformant XML and deposit them in CVS

repository. Work progresses on schedule. No deviations.

 

WP7:   Continue to co-ordinate the efforts of US and EC partners to ensure

            full collaboration and completion of the technical work the

workpackages. No deviations

 

WP8:   Integrate code standards among all CHLT partners. Work progress

on schedule. No deviations.

 

WP9:   Development of links to other DL projects through web-based dissemination. Objectives are on schedule. No deviations.

 

Issues

 

 

<problems  that have arisen in the period covered by this report, decisions and measures taken, etc.>

< serious isses only.>

Issues description

Here describe issues or problems that might affect achievements, delay activities, deliverables or milestones

Action items

Corrective action envisaged by the project to overcome the issue. This include the expected impact in terms of delays, quality and quantity of work.

None

None

 

Check list of deliverables completed

<list of deliverables completed this period, and their status (Public,Restricted,Confidential) taken from TA - the whole table can be copied and progress recorded in the status column>

Del. no.

  Deliverable name

WP no.

Lead participant

Del. Type

Security*

Delivery (proj. month)

Status

 

D 1.1

 

Prototype Document Cluster Visualization Tool

 

1

 

ICSTM

 

IT

 

PUB

 

10

 

Delivered

 

 

 

D 2.1

 

 

 

Word Profile Tool

 

2

 

UMKC/CAM

 

IT

 

PUB

 

12

 

Delivered

D 2.2

Facilities to Accept and Preserve Feedback

2

UMKC/CAM

IT

PUB

12

Delivered

 

 

 

 

D 3.1

 

 

Text Processing System

 

3

PERSEUS

 

IT

 

PUB

 

1

 

Delivered

D 3.2

General Data Provider Routine for Metadata Sharing

3

PERSEUS

 

IT

 

PUB

 

10

 

Delivered

 

D 3.3

 

 

 

Metadata Harvester

 

3

 

PERSEUS

 

IT

 

PUB

 

12

 

Delivered

 

D 4.1

 

 

 

 

Report on Old Norse Morphological Analyzer

 

4

 

UCLA/

KU

 

IT/

Report

 

PUB

 

12

 

Delivered

 

D 4.2

 

 

 

 

Report on Electronic Editions of Old Norse Texts

 

4

 

UCLA/

KU

 

IT/

Report

 

PUB

 

12

 

 

Delivered

 

D 5.1

 

Report on Neo-Latin Morphological Analyzer

 

 

 

5

 

ILC- PISA

 

IT/

Report

 

PUB

 

10

 

Delivered

 

D 7.1

 

 

Report on Kick-off Meeting

And Consortium Agreement

 

 

7

 

ICSTM/UMKC

 

 

 

Report

 

PUB

 

4

 

Delivered

 

D 7.2

 

 

 

Bi-Monthly Reports

(Revised to Quarterly

Reports, June 2003)

 

7

ICSTM/UMKC

 

Report

 

PUB

 

2,4,8,10,

15

 

Delivered

 

D 7.3

 

 

Bi-Yearly Periodic Progress Reports

 

 

7

ICSTM/

UMKC

 

Report

 

PUB

 

6, 12

 

Delivered

 

D 7.4

 

 

Bi-Yearly Financial Cost Statement

 

 

7

 

ICSTM

 

Report

 

PUB

 

6, 12

 

Delivered

 

D 7.5

 

 

Yearly Consortium Report

 

7

ICSTM/

UMKC

 

Report

 

PUB

 

12

 

Delivered

 

 

 

D 8.1

 

 

Bi- Yearly Integration Meeting Report

 

 

8

ICSTM/

UMKC

PERS

 

Report

 

PUB

 

6, 12

 

Delivered

 

 

D 8.2

 

Version Control System for Core Digital Library

 

8

 

ICSTM/ UMKC

PERS

 

Report

 

PUB

 

6, 12

 

Delivered

 

 

 

D 8.3

 

 

Indexing and Input/Output Formats for Integration and Interoperablity

 

8

ICSTM/

UMKC

PERS

 

Report

 

PUB

 

6, 12

 

Delivered

 

D 9.1

 

 

CHLT Website

 

9

ICSTM/

UMKC

 

 

WEB

 

PUB

 

3, 6, 12

 

Delivered

 

D 9.2

 

Dissemination and Use

Strategy

 

9

ICSTM/

UMKC

 

Report

 

PUB

 

6

 

Delivered

 

 

 

D 9.3

 

Dissemination and Exploitation Reports

 

 

9

 

ICSTM/ UMKC

 

 

Report

 

PUB

 

12

 

Delivered

 

*Int.     Internal circulation within project (and Commission Project Officer if requested)

  Rest.  Restricted circulation list (BBC as External User) and Commission PO only

  IST    Circulation within IST Programme participants

  FP5   Circulation within Framework Programme participants

  Pub.   Public document

 

 

Resource use for 1 June 31 August 2003

 

Resources used for three months

Report period 1June - 31 August 2003

 

 

 

Partner

 

Man-month allocation by workpackage

 

 

 

 

 

Name

Code

WP01

WP02

WP03

WP04

WP05

WP06

WP07

WP08

and 09

Total

ICSTM

P01

1.5

 

.05

 

 

.2

.6

.3 + .25

2.9

UCAM-CLAS

P02

 

1.5

 

 

 

 

 

 

1.5

ILC

P03

 

 

 

 

3

 

 

 

3

KU

P04

 

 

 

1.5

 

 

 

 

1.5

Totals

 

 

 

 

 

 

 

 

 

8.9

 

           

Compared with the original resources allocation  in TA:

 

Planned resource allocation for three months

 1 September 30 November 2003

 

 

 

Partner

 

Man-month allocation by workpackage

 

 

 

 

 

Name

Code

WP01

WP02

WP03

WP04

WP05

WP06

WP07

WP08

Total

ICSTM

P01

1.5

 

.05

 

 

.2

.6

.3 + .2.5

2.9

UCAM-CLAS

P02

 

1.5

 

 

 

 

 

 

1.5

ILC

P03

 

 

 

 

3

 

 

 

3

KU

P04

 

 

 

1.5

 

 

 

 

1.5

Totals

 

 

 

 

 

 

 

 

 

8.9

 

 

 

Work planned next reporting period

 

 

WP1:   Continue design of interface for Cluster and Visualisation

Tool.

 

WP2:   Continue to work on the multi-lingual information retrieval tool.

 

WP3:   Continue work on identifying linkable reference materials for the DLS.

 

WP4:   Continue to expand database of exceptional-irregular forms in Old Norse.

           

WP5:   Continue to add gender codes to ambiguous morphological

categories, testing them and making changes to source code.

 

WP6:   XML mark-up of 100-150 pages of new text per month.

 

WP7:   Maintain timely reporting schedule and ensure workpackage

developments and targets proceed according to schedule.

 

WP8:   Maintain integration of code standards and data sharing routines.

 

WP9:   Continue to implement web-based dissemination of CHLT.

<in terms of tasks, by partner 5/6 lines/WP, more if necessary>

 

 

 

 

Others

 

CHLT has been offered technical support by respected experts in digital library technology, and we wish to acknowledge their continued interest in and practical support of the aims of CHLT.

 

Dr. Carl Lagoze, Cornell University

 

Dr. Brian Fuchs, Senior Programmer, Archimedes Project, Max Planck, Berlin

 

Julia Flanders, Director, Scholarly Technology Group, Brown University

 

Professor Susan Hockey, University College London.

 

Dr Peter Walters, UKISHELP

 

Dr Hamish Cunnigham, University of Sheffield

 

Mr Michael Hawkins, Imperial College London

 

Martin Doerr, Heraklion, Forth, Crete

 

 

 

 

 

Names or address change, responsibility reassignment or other.

 

No changes.

 

 

 

 


DELIVERABLE SUBMISSION SHEET D 7.3 (Month 18)

 

To:

Jean Goederich

(Project Officer)

                                   EUROPEAN COMMISSION
                                   DG INFSO E5
                                   EUFO 3277
                                   rue Alcide de Gasperi
                                   L-2920 Luxembourg

 

From:    Project name:

Cultural Heritage Language Technologies

 

Project acronym:

CHLT

Project number:

IST-2001-32745

 

Person:

 Dolores Iorizzo

 

 

 

Organisation

ICSTM

 

Date

28 January, 2004

The following deliverable:

Deliverable name:

Periodic Progress Report: Month 18

Deliverable number:

D 7.3

is now complete.

*  It is available for your inspection

  A copy can be sent to you on request.

  Relevant descriptive documents are attached.

  2 bound, 1 unbound copies herewith (public deliverables).

  2 copies herewith (other deliverables).

 

Tick all that apply

 

The deliverable is:

on paper  on WWW

www. CHLT.org

 

an event  software X other

Report attached below.

  (tick one)

 

For all paper deliverables, and other deliverables as appropriate:

Date:

28 January, 2004

Version:

1

Author:

Iorizzo and Rydberg-Cox

No. of pages:

This cover plus 13 pages

Status:

Public    Restricted     *Internal    (tick one)

Commission use only

Keywords:

 

 

Description:

 

Comments:

.

 

 

PROJECT PROGRESS REPORT

 

 

Project:                              CHLT-IST-2001-32745

Progress Report Number: 8

Period:                               1 June -  November 30 2003

Author:                              Dolores Iorizzo

Organisation                     Imperial College London

Address:                            Centre for the History of Science, Technology and Medicine

                                           Sherfield Building - Room 445,

                                           Exhibition Road, London SW7 2AZ

Email:                                d.iorizzo@ic.ac.uk          Phone:   + 44 207-594-9355

 

 

1.     Work planned during this period........................................................................................ 3

2.     Achievements of WP 1 :Advanced Digital Library Applications....................................... 4

3.     Achievements of WP 2 : Computational Linguistics.......................................................... 4

4.     Achievements of WP 3 : Collaborative Infrastructure........................................................ 5

5.     Achievements of WP 4 : Old Norse Morphological Analyzer........................................... 5

6.     Achievements of WP 5 : Neo-Latin Morphological Analyzer........................................... 5

7.     Achievements of WP 6 : Early Modern Latin Corpus....................................................... 5

8.     Achievements of WP 7 : Management of Consortium....................................................... 6

9.     Achievements of WP 8: Integration.................................................................................... 6

10.       Achievements of Workpackage 9: Dissemination and exploitation................................ 7

11.       Technical options adopted.............................................................................................. 7

12.       Meetings held.................................................................................................................. 7

13.       Assessment of interim results......................................................................................... 7

13.1.     Objectives set versus objective attained, deviations................................................... 7

13.2.     Issues.......................................................................................................................... 8

14.       Check list of deliverables completed............................................................................... 9

15.       Resources use for the first period................................................................................. 11

16.       Work planned next reporting period............................................................................. 12

17.       Others........................................................................................................................... 12

18.       Names or address change, responsibility reassignment or other................................... 13

 


 

Work planned during this period

<work packages/tasks due this reporting period>

CHLT has now been active for 18 months and this report outlines the work of the third six month period of the project.  There have been three cycles of work active in this period as outlined in the original description of work (p.17).  Phase 1: Fundamentals was completed on schedule in the first year and has provided a sound foundation for the later phases. Phase 2: Tools Development was initiated in month 6 and has continued on schedule, as has Phase 3: Evaluation which began in month 10.  Phase 4: Integration has been active from the start of the project. Progress has been made in the following areas:

 

 

    Document Cluster Visualization Tool

    Word Profile Tool

    Facilities to Accept and Preserve Feedback

    Integration of Text Processing System

    Old Norse Morphological Analyzer

    Latin Morphological Analyzer

    Early Modern Latin Texts

    CHLT Website Development

    Project and Programming Integration

    Integration Meetings

    Dissemination

 

 

The following Workpackages were active during the period :

 

WP1 Advanced Digital Library Applications

WP2 Computational Linguistics

WP3 Collaborative Infrastructure

WP4 Old Norse Morphological Analyzer

WP5 Neo-Latin Morphological Analyzer

WP6 Test-bed Development

WP7 Management of Consortium

WP8 Integration

WP9 Dissemination and Exploitation

Achievements of WP 1 : Advanced Digital Library Applications

WP Leader : Stefan Reuger

Scheduled dates : 1 June - November 2003

Actual dates: December 2003

Deliverables: No deliverables due this period

 

The main work of this period has been focused on the design of the interface for the Cluster and Visualization Tool, and its testing for use in three areas (i) indexing and keyword extraction, (ii) document clustering and (iii) the visualization of search results. On schedule for delivery of D 1.2 at the end of January 2004.

 

We have also made notable progress ahead of schedule in several areas: (iv) integrating visualization with the Perseus Digital Library System, (v)  indexing Ancient Greek Documents, (vi) rendering documents in JAVA, (vii) loading visualization applet to allow for query, language and collection information available on start-up, (viii) linking our visualization with the rich analysis tools developed by Perseus (WP3) and (ix) writing an interface for the WP4 the Old Norse Morphological Analyser.

 

Estimated achievement: 50 % as planned

 

down to task level, following the same structure:

Achievements of WP 2 : Computational Linguistics

 

WP Leader : Jeff Rydberg-Cox     

Scheduled dates : 1 June - November 2003

Actual dates: December 2003

Deliverables:             D 2.3: Tool to Extract Corpus Based Thesauri from Corpus

 

 

The main work of this period has been the completion on schedule of (i) D 2.1 (Word Profile Tool) and (ii) D 2.2 (Facilities to Accept, Preserve and Integrate User Feedback) in month 12, which has meant that we have been able to develop our extraction tool ahead of schedule with good results. Work has also begun on the development of the theoretical models on which the multi-lingual retrieval facilities will be based, as well as addressing problems relating to sectional preferences and sub-categorization frames.

 

 

Estimated achievement: 50 % as planned

Achievements of WP 3 : Collaborative Infrastructure

WP Leader : Greg Crane                   

Scheduled dates : 1 June - November 2003

Actual dates: December 2003

Deliverables:  D 3.4: Report on Naming Conventions

                        D 3.5: Maintenance Procedures for Naming Conventions

                       

 

   Work in this period has been diverse and productive.  We have developed and delivered (i) naming conventions (D 3.4) and (ii) maintenance procedures for implementing quality control among CHLT partners and the Perseus Digital Library System (D 3.5).

 

We are also ahead of schedule in (iii) building a clear and well-documented API to accelerate the development of advanced digital library applications, (iv) developing abstract interfaces to our morphological databases, and (v) experimenting with new ways to use OAI protocols to share large volumes of data received from reference works.

 

Estimated achievement: 50 % as planned

 

 

 

Achievements of WP 4 : Old Norse Morphological Analyzer 

 

WP Leader : Tim Tangherlini     

Scheduled dates : 1 June - November 2003

Actual dates: December 2003

Deliverables: No deliverables due this period

           

 

The main work of this period has focused on (i) the expansion of the Old Norse database to include exceptional-irregular words forms, and (ii) the development of an integrated viewing environment for normalized and diplomatic texts.

 

Estimated achievement: 50 % as planned

 

 

 

Achievements of WP 5 :

 

WP Leader : Andrea Bozzi    

Scheduled dates : 1 June - November 2003

Actual dates: December 2003

Deliverables: No delieverables due this period

 

   The main work of this period has focused on (i) the development of a word segmentation system, (ii) adding gender codes to the LES ambiguous morphological categories, (iii) implementing new LE rules, (iv) coding exceptional and invariable forms, and (v) testing the lemmatization results and modifying the source code. We also (vi) continue to document implementation functions, data structures and algorithms. Our work is on schedule to complete Word Segmentation System (D 5.2) by the end of January 2004.

 

Estimated achievement: 50 % as planned

 

 

 

Achievements of WP 6 :

 

WP Leader : Ross Scaife 

Scheduled dates : 1 June - November 2003          

Actual dates: December 2003

Deliverables: No deliverables due this period

 

Progress continues according to schedule on the XML markup of Renaissance Latin texts using XML- Text Encoding Initiative (TEI) compliant standards.  These texts will be used as a test bed for IT tools and applications developed for the CHLT Digital Library System.  Workpackage on schedule for D 6.1 in month 35.

 

Estimated achievement: 50 % as planned

 

 

Achievements of WP 7 : Management of Consortium

 

WP Leaders : Dolores Iorizzo and Jeff Rydberg-Cox

Scheduled dates : 1 June - November 2003          

Actual dates: December 2003

Deliverables:             D 7.2, D 7.3, D 7.4

 

    7.2 Month 15 Periodic Progress Report

    7.3 Month 18 Periodic Progress Report

    7.4 Month 18 Cost Statement

 

 

We have made progress in getting the message across that a close collaboration of workpackages and timely reporting is an integral part of the work of CHLT.  As a result there has been an increased effort in getting reports and deliverables in on schedule for all CHLT partners.

 

 

 

Estimated achievement: 50% as planned

 

 

Achievements of WP 8: Integration

 

WP Leaders : Greg Crane, Jeff Rydberg-Cox, Stefan Rueger, Dolores Iorizzo    

Scheduled dates : 1 June - November 2003

Actual dates: December 2003

Deliverables:  D 8.1 Integration Report

                        D 8.2 Version Control Maintenance

                        D 8.3 Code Standards for Compatibility of Plug-in Modules

 

 

Integration in the third six month period has concentrated on programming integration, testing out data sharing routines, maintenance of the version control system, and implementation of code standards for plug-in modules for the core digital library system which is shared by all CHLT partners.

 

Estimated achievement: 50 % as planned

 

 

 

Achievements of Workpackage 9: Dissemination and exploitation

 

WP Leaders : Greg Crane, Jeff Rydberg-Cox, Dolores Iorizzo       

Scheduled dates : 1 June - November 2003

Actual dates: December 2003

Deliverables: D 9.3: Annual Dissemination and Exploitation Report

 

Update of CHLT.org website shows results of workpackages to ensure maximum exposure in the digital library community of the technical tools, applications and infrastructure systems that are being developed by CHLT; added links to other digital library projects have implemented web-based dissemination.

 

Estimated achievement: 50 %, as planned

<short (1/2 pages) description of work done, difficulties encountered, etc.; if appropriate this description should be broken

Technical options adopted

 

WP1:   Re-writing of GUI code necessary for loading the applet with variable

parameters and to address the problem of initialising the panels with

search results.

 

WP2:   Further development of DTDs for disambiguation of ambiguous

Greek and Latin forms.

 

WP3:   Further development of abstract interfaces to morphological databases

using SOAP and XML-RPC.

 

 

WP4:   Further development of web interface for the Old Norse Morphological

Analyser to integrate work with WP1 Cluster and Visualisation Tool.

 

WP5:   Modification of Gender Codes for Word Segmentation System.

 

WP6:   Further DTDs implemented for text elements and attributes.

 

 

 

Meetings held

 

 

CHLT Collaborators Meeting, November 2003, Kansas City, Missouri

 

CHLT Integration Meeting, November 2003, Kansas City, Missouri

 

List attended meetings, by whom, where and for how long (internal project  meetings as well as external ones).

 

Assessment of interim results

 

Objectives set versus objective attained, deviations

 

WP1:   Set objective was to begin development and design of interface for the

Document Cluster and Visualisation Tool which acts as a text search engine.  Work progresses on or ahead of schedule (see Section 1).

No deviations.

 

WP2:   Set objective was to complete work on Word Profile Tool (D 2.1) and

Develop Facilities to Accept, Preserve and Integrate User Feedback (D 2.2), and proceed with work laying groundwork for multi-lingual information retrieval facilities, sectional preferences, and sub-categorisation frames.  Work progresses on schedule.

No deviations.

 

WP3:   Set objectives were to report on naming conventions of digital library objects; yet we also managed to maintain OAI compliant data provider and metadata harvester; distribute procedures for naming conventions to all CHLT partners and follow up with quality control maintenance of the DL system.  Work progresses on or ahead of schedule.

No deviations.

 

WP4:   Set objectives were to refine Old Norse Morphological Analyser;

continue with the mark-up of Old Norse texts in diplomatic and normalised versions; and to test the morphological analyser with the marked-up text.  We also were able to start work on integrating the Old Norse Morphological Analyzer with the Cluster and Visualisation Tool ( WP1) ahead of schedule. Work progresses on or ahead of schedule.  No deviations.

 

WP5:   Set objective was to begin development of a Word Segmentation

System for the Latin morphological analyser. Work progresses on schedule.

No deviations.

 

WP6:   Set objective was to mark-up Latin texts in TEI-conformant XML and

commit them to CVS repository. Work progresses on schedule.

No deviations.

 

WP7:   Set objectives were to co-ordinate the efforts of US and EC partners to

ensure full collaboration and completion of the technical work the workpackages.

No deviations

 

WP8:   Set objectives were to integrate tools and applications developed by the

workpackages; maintain the version control system for the core digital

library, and to maintain code standards among all CHLT partners.

Work progresses on schedule. No deviations.

 

WP9:   Set objectives were to further develop CHLT website as a vehicle of dissemination; publish papers on the results of CHLT workpackages; and give papers at international conferences on the work of CHLT.  Objectives are on schedule.

No deviations.

 

Issues

 

 

<problems  that have arisen in the period covered by this report, decisions and measures taken, etc.>

< serious isses only.>

Issues description

Here describe issues or problems that might affect achievements, delay activities, deliverables or milestones

Action items

Corrective action envisaged by the project to overcome the issue. This include the expected impact in terms of delays, quality and quantity of work.

None

None

 

Check list of deliverables completed

<list of deliverables completed this period, and their status (Public,Restricted,Confidential) taken from TA - the whole table can be copied and progress recorded in the status column>

Del. no.

  Deliverable name

WP no.

Lead participant

Del. Type

Security*

Delivery (proj. month)

Status

 

D 1.1

 

Prototype Document Cluster Visualization Tool

 

1

 

ICSTM

 

IT

 

PUB

 

10

 

Delivered

 

 

 

D 2.1

 

 

 

Word Profile Tool

 

2

 

UMKC/CAM

 

IT

 

PUB

 

12

 

Delivered

D 2.2

Facilities to Accept and Preserve Feedback

2

UMKC/CAM

IT

PUB

12

Delivered

 

 

D 3.1

 

 

Text Processing System

 

3

PERSEUS

 

IT

 

PUB

 

1

 

Delivered

D 3.2

General Data Provider Routine for Metadata Sharing

3

PERSEUS

 

IT

 

PUB

 

10

 

Delivered

 

D 3.3

 

 

 

Metadata Harvester

 

3

 

PERSEUS

 

IT

 

PUB

 

12

 

Delivered

 

D 4.1

 

 

 

 

Report on Old Norse Morphological Analyzer

 

4

 

UCLA/

KU

 

IT/

Report

 

PUB

 

12

 

Delivered

 

D 4.2

 

 

 

 

Report on Electronic Editions of Old Norse Texts

 

4

 

UCLA/

KU

 

IT/

Report

 

PUB

 

12

 

 

Delivered

 

D 5.1

 

Report on Neo-Latin Morphological Analyzer

 

 

 

5

 

ILC- PISA

 

IT/

Report

 

PUB

 

10

 

Delivered

 

D 7.1

 

 

Report on Kick-off Meeting

And Consortium Agreement

 

 

7

 

ICSTM/UMKC

 

 

 

Report

 

PUB

 

4

 

Delievered

 

D 7.2

 

 

 

Bi-Monthly Reports

(Revised to Quarterly

Reports, June 2003)

 

7

ICSTM/UMKC

 

Report

 

PUB

 

2,4,8,10,

15

 

Delivered

 

D 7.3

 

 

Bi-Yearly Periodic Progress Reports

 

 

7

ICSTM/

UMKC

 

Report

 

PUB

 

6, 12, 18

 

Delivered

 

D 7.4

 

 

Bi-Yearly Financial Cost Statement

 

 

7

 

ICSTM

 

Report

 

PUB

 

6, 12, 18

 

Delivered

 

D 7.5

 

 

Yearly Consortium Report

 

7

ICSTM/

UMKC

 

Report

 

PUB

 

12

 

Delivered

 

 

 

D 8.1

 

 

Bi- Yearly Integration Meeting Report

 

 

8

ICSTM/

UMKC

PERS

 

Report

 

PUB

 

6, 12, 18

 

Delivered

 

 

D 8.2

 

Version Control System for Core Digital Library

 

8

 

ICSTM/ UMKC

PERS

 

Report

 

PUB

 

6, 12, 18

 

Delivered

 

 

 

D 8.3

 

 

Indexing and Input/Output Formats for Integration and Interoperablity

 

8

ICSTM/

UMKC

PERS

 

Report

 

PUB

 

6, 12, 18

 

Delivered

 

D 9.1

 

 

CHLT Website

 

9

ICSTM/

UMKC

 

 

WEB

 

PUB

 

3, 6, 12, 18

 

Delivered

 

D 9.2

 

Dissemination and Use

Strategy

 

9

ICSTM/

UMKC

 

Report

 

PUB

 

6

 

Delivered

 

 

 

D 9.3

 

Dissemination and Exploitation Reports

 

 

9

 

ICSTM/ UMKC

 

 

Report

 

PUB

 

12

 

Delivered

 

*Int.     Internal circulation within project (and Commission Project Officer if requested)

  Rest.  Restricted circulation list (BBC as External User) and Commission PO only

  IST    Circulation within IST Programme participants

  FP5   Circulation within Framework Programme participants

  Pub.   Public document

 

 

 

 

 

 

 

 

 

 

 

 

Resource use for the second 6th month period

 

Resources used for third 6 months

Report period June 2003- Nov 2003

 

 

 

Partner

 

Man-month allocation by workpackage

 

 

 

 

 

Name

Code

WP01

WP02

WP03

WP04

WP05

WP06

WP07

WP08

and 09

Total

ICSTM

P01

3

 

.1

 

 

.4

1.2

.6 + .5

5.8

UCAM-CLAS

P02

 

3

 

 

 

 

 

 

3

ILC

P03

 

 

 

 

6

 

 

 

6

KU

P04

 

 

 

3

 

 

 

 

3

Totals

 

 

 

 

 

 

 

 

 

17.8

 

           

Compared with the original resources allocation  in TA:

 

Planned resource allocation for forth six-months

 Dec 2003 / May 2004

 

 

 

Partner

 

Man-month allocation by workpackage

 

 

 

 

 

Name

Code

WP01

WP02

WP03

WP04

WP05

WP06

WP07

WP08

Total

ICSTM

P01

3

 

.1

 

 

.4

1.2

.6 + .5

5.8

UCAM-CLAS

P02

 

3

 

 

 

 

 

 

3

ILC

P03

 

 

 

 

6

 

 

 

6

KU

P04

 

 

 

3

 

 

 

 

3

Totals

 

 

 

 

 

 

 

 

 

17.8

 

 

 

Work planned next reporting period

 

 

WP1:   Finalise work on the design of interface for Cluster and Visualisation

Tool.  A major concern will be to endow the applet with greater

flexibility to deal gracefully with different languages and sets of document collections.  We will also be concerned with the integration of the Greek rendering module in the visualisation and display programme.

 

WP2:   Continue to work on the multi-lingual information retrieval tool and

Make progress on document architecture issues.  We also hope to

Finalise the citation scheme map and begin integration into the word

study tool.

 

WP3:   Continue to work on identifying and exposing linkable reference

materials for the DLS.

 

WP4:   Continue to expand database of exceptional-irregular forms in Old Norse; continue to work with the WP1 and WP3 teams to refine and integrate the Cluster and Visualisation Tool for Old Norse into the CHLT Digital Library System.

           

WP5:   Continue to  (i) add gender codes to the LES ambiguous morphological

categories (ii) implement new LE rules, (iii) code FE (exceptional forms) and I (invariable forms), (iv) test lemmatisation results and feed them back into the system to refine programme, (v) continue to modify source code as a result of the refinements, and (vi) document implementation functions, data structures and algorithms.

 

WP6:   XML mark-up of 100-150 pages of new text per month.

 

WP7:   Maintain timely reporting schedule and ensure workpackage

developments and targets proceed according to schedule.

 

WP8:   Maintain integration of (i) code standards, (ii) data sharing routines

and (iii) metadata harvester for all CHLT workpackages, and integrate results into the Digital Library System.

 

WP9:   Continue to implement web-based dissemination of CHLT; publish

papers on the results of CHLT workpackages; attend conferences relevant to the work of CHLT; and organise conferences, meetings, seminars for dissemination of results of CHLT.

<in terms of tasks, by partner 5/6 lines/WP, more if necessary>

 

 

 

 

Others

 

CHLT has been offered technical support by respected experts in digital library technology, and we wish to acknowledge their continued interest in and practical support of the aims of CHLT.

 

Dr. Carl Lagoze, Cornell University

 

Dr. Brian Fuchs, Senior Programmer, Archimedes Project, Max Planck, Berlin

 

Julia Flanders, Director, Scholarly Technology Group, Brown University

 

Professor Susan Hockey, University College London.

 

Dr Peter Walters, UKISHELP

 

Dr Hamish Cunnigham, University of Sheffield

 

Mr Michael Hawkins, Imperial College London

 

 

 

Names or address change, responsibility reassignment or other.

 

No changes.

 

 

 


DELIVERABLE SUBMISSION SHEET D 7.2  Month 21

 

To:

Jean Goederich

(Project Officer)

                                   EUROPEAN COMMISSION
                                   DG INFSO E5
                                   EUFO 3277
                                   rue Alcide de Gasperi
                                   L-2920 Luxembourg

 

From:    Project name:

Cultural Heritage Language Technologies

 

Project acronym:

CHLT

Project number:

IST-2001-32745

 

Person:

 Dolores Iorizzo

 

 

 

Organisation

ICSTM

 

Date

30 March, 2004

The following deliverable:

Deliverable name:

Periodic Progress Report: Month 21

Deliverable number:

D 7.2

is now complete.

*  It is available for your inspection

  A copy can be sent to you on request.

  Relevant descriptive documents are attached.

  2 bound, 1 unbound copies herewith (public deliverables).

  2 copies herewith (other deliverables).

 

Tick all that apply

 

The deliverable is:

on paper  on WWW

www. CHLT.org

 

an event  software X other

Report attached below.

  (tick one)

 

For all paper deliverables, and other deliverables as appropriate:

Date:

30 March, 2004

Version:

1

Author:

Iorizzo and Rydberg-Cox

No. of pages:

This cover plus  14 pages

Status:

Public    Restricted    *Internal    (tick one)

Commission use only

Keywords:

 

 

Description:

 

Comments:

.

 

 

PROJECT PROGRESS REPORT

 

 

Project:                              CHLT-IST-2001-32745

Progress Report Number: 9

Period:                               1 December 2003 -  29 February 2004

Author:                              Dolores Iorizzo

Organisation                     Imperial College London

Address:                            Centre for the History of Science, Technology and Medicine

                                           Sherfield Building - Room 445,

                                           Exhibition Road, London SW7 2AZ

Email:                                d.iorizzo@ic.ac.uk          Phone:   + 44 207-594-9355

 

 

1.     Work planned during this period........................................................................................ 3

2.     Achievements of WP 1 :Advanced Digital Library Applications....................................... 4

3.     Achievements of WP 2 : Computational Linguistics.......................................................... 4

4.     Achievements of WP 3 : Collaborative Infrastructure........................................................ 5

5.     Achievements of WP 4 : Old Norse Morphological Analyzer........................................... 5

6.     Achievements of WP 5 : Neo-Latin Morphological Analyzer........................................... 5

7.     Achievements of WP 6 : Early Modern Latin Corpus....................................................... 5

8.     Achievements of WP 7 : Management of Consortium....................................................... 6

9.     Achievements of WP 8: Integration.................................................................................... 6

10.   Achievements of Workpackage 9: Dissemination and exploitation.................................... 7

11.   Technical options adopted.................................................................................................. 7

12.   Meetings held...................................................................................................................... 7

13.   Assessment of interim results............................................................................................. 7

13.1.     Objectives set versus objective attained, deviations................................................... 7

13.2.     Issues.......................................................................................................................... 8

14.   Check list of deliverables completed................................................................................... 9

15.   Work planned next reporting period................................................................................. 12

16.   Others............................................................................................................................... 12

17.   Names or address change, responsibility reassignment or other....................................... 13

 


 

Work Planned for this Period

<work packages/tasks due this reporting period>

CHLT has now been active for 21 months and this report outlines the work of the third quarter of the second year of the project.  There have been three cycles of work active in this period. Phase 2: Tools Development was initiated in month 6 and has continued on schedule. Phase 3: Evaluation began in month 10 and is also on schedule.  Phase 4: Integration has been active from the start of the project.

 

 Progress has been made in the following areas:

 

    Document Cluster Visualization Tool

    Word Profile Tool

    Facilities to Accept and Preserve Feedback

    Integration of Text Processing System

    Old Norse Morphological Analyzer

    Latin Morphological Analyzer

    Early Modern Latin Texts

    CHLT Website Development

    Project and Programming Integration

    Integration Meetings

    Dissemination

 

 

The following Workpackages were active during the period :

 

WP1 Advanced Digital Library Applications

WP2 Computational Linguistics

WP3 Collaborative Infrastructure

WP4 Old Norse Morphological Analyzer

WP5 Neo-Latin Morphological Analyzer

WP6 Test-bed Development

WP7 Management of Consortium

WP8 Integration

WP9 Dissemination and Exploitation


Achievements of WP 1 : Advanced Digital Library Applications

 

WP Leader: Stefan Reuger

Scheduled dates: 1 December 2003 29 February 2004

Actual dates: 30 March 2004

Deliverables: D 1.2

 

The main work of this period has been to finalize work on the design of the interface for the Document Cluster and Visualization Tool; it has been tested and completed on schedule. Additional progress on this tool has been made ahead of schedule in two main areas:

 

(i)             Modifying the Document Cluster and Visualization Tool to handle a multiplicity of distinct document collections; this is an essential requirement for making it widely applicable to texts that exist in more than one collection, such as Greek texts that have been translated into Latin.

 

(ii)           The building of new collections.  What used to be two tightly linked, although functionally quite distinct indexing steps are now neatly separated, each with its own Java wrapper which provides a standard API to other programs.  We also designed a graphical user interface that accesses these wrappers to index new collections.

 

 

Estimated achievement: 67 % ahead of schedule

down to task level, following the same structure:

Achievements of WP 2: Computational Linguistics

 

WP Leader: Jeff Rydberg-Cox     

Scheduled dates: 1 December 2003 29 February 2004

Actual dates: 30 March 2004

Deliverables:             No deliverables due this period.

 

 

The main work of this period has been on the development of the multi-lingual information retrieval tool, with special focus on:

 

(i)        Methods for query expansion;

 

(ii)       Methods for the extraction of translation equivalents from parallel

and comparable corpora;

 

(iii)      Refining our user interface to integrate our results with those of WP1;

 

(iv)      Dissemination of results to JCDL, ECDL and the New England Journal

of Classics.

 

Estimated achievement: 58 % as planned

 

Achievements of WP 3: Collaborative Infrastructure

 

WP Leader: Greg Crane    

Scheduled dates: 1 December 2003 29 February 2004

Actual dates: 30 March 2004

Deliverables:             No deliverables due this period.

 

              The main work of this period has been to create a robust Digital Library

Infrastucture for data-sharing based on a system of unique name identifiers

for organisations, collections and individual digital objects.  The focus has

been on integrating OAI-derived metadata into every aspect of the digital

library system, from catalogue browsing to full text searching to implicit

linking between documents.

 

Estimated achievement: 58 % as planned

 

 

 

Achievements of WP 4: Old Norse Morphological Analyzer

 

WP Leader: Tim Tangerlini    

Scheduled dates: 1 December 2003 29 February 2004

Actual dates: 30 March 2004

Deliverables:             No deliverables due this period.

 

The main work of this period has been the continuing expansion of the Old

Norse database to include a greater number of exceptional-irregular words

forms.  We have also made considerable progress in developing an integrated

viewing environment for normalized and diplomatic texts to integrate our

work with the results of WP1's Visualization Tool.

 

Estimated achievement: 58 % as planned

 

 

 

Achievements of WP 5: Neo-Latin Morphological Analyzer

 

WP Leader: Andrea Bozzi

Scheduled dates: 1 December 2003 29 February 2004

Actual dates: 30 March 2004

Deliverables: D 5.2 Word Segmentation System for Latin Analyzer

 

   The main work of this period has focused on (i) implementing and testing

new LE rules, (ii) completing on schedule the Word Segmentation System and  (iii) continuing to document implementation functions, data structures and algorithms.

 

Estimated achievement: 58 % as planned

Achievements of WP 6: Test Bed Development

 

WP Leader: Ross Scaife

Scheduled dates: 1 December 2003 29 February 2004

Actual dates: 30 March 2004

Deliverables: No deliverables due this period

 

Progress continues according to schedule on the XML markup of Renaissance Latin texts using XML- Text Encoding Initiative (TEI) compliant standards.  These texts will be used as a test bed for IT tools and applications developed for the CHLT Digital Library System.  Workpackage on schedule for D 6.1 in month 35.

 

Estimated achievement: 58 % as planned

 

 

Achievements of WP 7 : Management of Consortium

 

WP Leader: Dolores Iorizzo and Jeff Rydberg-Cox

Scheduled dates: 1 December 2003 29 February 2004

Actual dates: 30 March 2004

Deliverables: D 7.2:  Month 21 Periodic Progress Report

 

We continue to make progress in getting the message across that a close collaboration of partners and timely reporting is an integral part of the work of CHLT.  As a result there has been greater integration of results and a general boost in morale.

 

Estimated achievement: 58% as planned

 

 

 

 

 

 

 

 

 

 

Achievements of WP 8: Integration

 

WP Leaders : Greg Crane, Jeff Rydberg-Cox, Stefan Rueger, Dolores Iorizzo 

Scheduled dates: 1 December 2003 29 February 2004

Actual dates: 30 March 2004

Deliverables: No deliverables due this period

 

Integration continues to focus on programming integration, testing out data sharing routines, maintenance of the version control system, and implementation of code standards for plug-in modules for the core digital library system shared by CHLT partners.  The most active integration work this period has been between WP1 - WP2 (getting the Greek Word Tool to run with the Visualization Tool) and  WP1 - WP4 (getting the Old Norse to work with the Visualization Tool), and getting the efforts of WP1-WP2-WP3-WP4 to work within the PERSEUS/CHLT Digital Library System.

 

Estimated achievement: 58 % as planned

 

 

Achievements of WP 9: Dissemination and exploitation

 

WP Leaders : Greg Crane, Jeff Rydberg-Cox, Dolores Iorizzo

Scheduled dates: 1 December 2003 29 February 2004

Actual dates: 30 March 2004

Deliverables: No deliverable due this period

 

 

Now that we have results from the project we have been able to take to the road and give demonstrations to other research groups.  We have made an effort to disseminate our work not only via the www.CHLT.org website, but to seek out opportunities to give presentations internationally at the Mellon Foundation in New York, the NSF and Library of Congress in Washington DC, and at Cornell University, Ithaca New York. We also hosted a conference at Imperial College London on 'Knowledge Sharing and the Semantic Web for Cultural Heritage Projects' which attracted a good number of people across the Library, Archive, Museum and Computer Science communities; this has opened up a number of new directions for the future of CHLT.

 

Estimated achievement: 58 %, as planned

 

 

<short (1/2 pages) description of work done, difficulties encountered, etc.; if appropriate this description should be broken

Technical options adopted

 

WP1: Use of MG wrapper using JAVA API to run the mgbuild shell

Script; and use of CK wrapper with JAVA and C for extraction and

indexing of candidate key words.

 

WP2:   Refinement of JAVA API and indexing format for generic text display.

 

WP3:   Continued use of SOAP, XML-RPC and OAI-derived metadata.

 

WP4:   Refinement of JAVA API web interface for the Old Norse

Morphological Analyser for integration with WP1 Visualisation Tool.

 

WP5:   Refinement of LE Codes for Word Segmentation System.

 

WP6:   Further DTDs implemented for text elements and attributes.

 

 

 

Meetings Held

 

 

Presentation of CHLT Work in Progress to the Mellon Foundation, NYC.

            10 11 December, 2003.

 

Presentation of CHLT Work in Progress at Cornell University, Ithaca, NY

            15-16 December, 2003.

 

Conference on 'Knowledge Sharing and the Semantic Web for Cultural

Heritage Projects',  Imperial College London, 11 February, 2004.

 

List attended meetings, by whom, where and for how long (internal project  meetings as well as external ones).

 

Assessment of interim results

 

Objectives set versus objective attained, deviations

 

WP1:   Set objective was to finalise development and design of interface for the Document Cluster and Visualisation Tool: D 2.1 delivered on schedule.  Work progresses ahead of schedule. No deviations.

 

WP2:   Set objective was to proceed with development of multi-lingual

information tool.  Work progresses on schedule. No deviations.

 

 

WP3:   Set objectives were to focus on building robust architecture for

Digital Library Sysem.  Work progresses on or ahead of schedule.

No deviations.

 

WP4:   Set objectives were to continue to refine Old Norse Morphological

Analyser and mark-up Old Norse texts in diplomatic and normalised versions; as well as to regularly test the morphological analyser with the marked-up text.  We also were scheduled to continue work on integrating the Old Norse Morphological Analyzer with the WP1 Visualisation Tool. Work progresses on schedule.  No deviations.

 

WP5:   Set objective was to deliver Word Segmentation System for the Latin

morphological analyser. Work progresses on schedule. No deviations.

 

WP6:   Set objective was to mark-up Latin texts in TEI-conformant XML and

commit them to CVS repository. Work progresses on schedule.

No deviations.

 

WP7:   Set objectives were to co-ordinate the efforts of US and EC partners to

ensure full collaboration and completion of the technical work the workpackages. No deviations

 

WP8:   Set objectives were to integrate tools and applications developed by the

workpackages; maintain the version control system for the core digital

library, and to maintain code standards among all CHLT partners.

Work progresses on schedule. No deviations.

 

WP9:   Set objectives were to further develop CHLT website as a vehicle of dissemination; publish papers on the results of CHLT workpackages; and give papers at international conferences on the work of CHLT.  Objectives are on schedule. No deviations.

 

Issues

 

Issues description

Here describe issues or problems that might affect achievements, delay activities, deliverables or milestones

Action items

Corrective action envisaged by the project to overcome the issue. This include the expected impact in terms of delays, quality and quantity of work.

None

None

 

Check List of Deliverables

<list of deliverables completed this period, and their status (Public,Restricted,Confidential) taken from TA - the whole table can be copied and progress recorded in the status column>

Del. no.

  Deliverable name

WP no.

Lead participant

Del. Type

Security*

Delivery (proj. month)

Status

 

D 1.1

 

Prototype Document Cluster Visualization Tool

 

1

 

ICSTM

 

IT

 

PUB

 

10

 

Delivered

 

 

 

 

D

1.2

 

 

 

Document Cluster and Visualization Tool

 

 

1

 

ICSTM

 

IT

 

PUB

 

20

 

Delivered

D 1.3

Report on Cluster Visualization Tool

1

ICSTM

IT

PUB

24

 

 

D 2.1

 

 

 

Word Profile Tool

 

2

 

UMKC/CAM

 

IT

 

PUB

 

12

 

Delivered

D 2.2

Facilities to Accept and Preserve Feedback

2

UMKC/CAM

IT

PUB

12

Delivered

 

D 2.3

 

 

Tool to Extract Corpus Based  Thesauri from Corpus

2

UMKC/CAM

IT

PUB

16

Delivered

D 2.4

Multi-lingual Information Retrieval Tool

2

UMKC/CAM

IT

PUB

24

 

D 2.5

Syntactic Parsing Toolbox

2

UMKC/CAM

IT

PUB

35

 

 

D 3.1

 

 

Text Processing System

 

3

PERSEUS

 

IT

 

PUB

 

1

 

Delivered

D 3.2

General Data Provider Routine for Metadata Sharing

3

PERSEUS

 

IT

 

PUB

 

10

 

Delivered

 

D 3.3

 

 

 

Metadata Harvester

 

3

 

PERSEUS

 

IT

 

PUB

 

12

 

Delivered

D 3.4

Report on Naming Conventions

3

PERSEUS

IT

PUB

18

Delivered

D 3.5

Maintenance Procedures for Naming Conventions

3

PERSEUS

IT

PUB

18

Delivered

D 3.6

Prototype Metadata Sharing System Between Two Libraries

 

 

3

PERSEUS

 

 

Proto

 

 

PUB

 

 

24

 

D 3.7

Use of Metadata Sharing System by CHLT Partners

 

3

PERSEUS

 

IT

 

PUB

 

35

 

 

D 4.1

 

 

 

 

Report on Old Norse Morphological Analyzer

 

4

 

UCLA/

KU

 

IT/

Report

 

PUB

 

12

 

Delivered

 

D 4.2

 

 

 

 

Report on Electronic Editions of Old Norse Texts

 

4

 

UCLA/

KU

 

IT/

Report

 

PUB

 

12

 

 

Delivered

D 4.3

Report on Digitization of Old Norse MSS

4

KU

Report

PUB

24

 

D 4.4

Images of Old Norse MS Linked to Tagged Text

 

4

UCLA/KU

 

IT

 

PUB

 

30

 

D 4.5

Prototype Reading Environment for Old Norse Texts

 

4

UCLA/KU

 

Proto

 

PUB

 

30

 

D 4.6

Complete Digitization of Old Norse MSS linked to Tagged Text

 

4

UCLA/KU

 

IT

 

PUB

 

35

 

D 4.7

Integration of Old Norse Texts with Morphological Analyzer in Integrated Reading Environment

 

 

4

 

UCLA/KU

 

 

IT

 

 

PUB

 

 

 

D 5.1

 

Report on Neo-Latin Morphological Analyzer

 

 

 

5

 

ILC- PISA

 

IT/

Report

 

PUB

 

10

 

Delivered

D 5.2

Word Segmentation System for Latin

Analyzer

 

5

 

ILC-PISA

 

IT

 

PUB

 

20

 

Delivered

D 5.3

Lemmatization Module for Early Modern Latin

 

5

 

ILC-PISA

 

IT

 

PUB

 

30

 

D 6.1

Tagged Early Modern Texts Integrated with Latin Morphological Analyzer in Integrated Reading Environment

 

 

6

PERSEUS/UK/ICSTM

 

 

IT

 

 

PUB

 

 

35

 

D

6.2

Integration of Early Modern Texts from Partners in Reading Environment

 

6

PERSEUS/UK/ICSTM

 

 

IT

 

 

PUB

 

 

35

 

 

D 7.1

 

 

Report on Kick-off Meeting

And Consortium Agreement

 

 

7

 

ICSTM/UMKC

 

 

 

Report

 

PUB

 

4

 

Delivered

 

D 7.2

 

 

 

Bi-Monthly Reports

(Revised to Quarterly

Reports, June 2003)

 

7

ICSTM/UMKC

 

Report

 

PUB

 

2,4,8,10,

15, 21

 

Delivered

 

D 7.3

 

 

Bi-Yearly Periodic Progress Reports

 

 

7

ICSTM/

UMKC

 

Report

 

PUB

 

6, 12, 18

 

Delivered

 

D 7.4

 

 

Bi-Yearly Financial Cost Statement

 

 

7

 

ICSTM

 

Report

 

PUB

 

6, 12, 18

 

Delivered

 

D 7.5

 

 

Yearly Consortium Report

 

7

ICSTM/

UMKC

 

Report

 

PUB

 

12

 

Delivered

 

 

 

D 8.1

 

 

Bi- Yearly Integration Meeting Report

 

 

8

ICSTM/

UMKC

PERS

 

Report

 

PUB

 

6, 12, 18

 

Delivered

 

 

D 8.2

 

Version Control System for Core Digital Library

 

8

 

ICSTM/ UMKC

PERS

 

Report

 

PUB

 

6, 12, 18

 

Delivered

 

 

 

D 8.3

 

 

Indexing and Input/Output Formats for Integration and Interoperablity

 

8

ICSTM/

UMKC

PERS

 

Report

 

PUB

 

6, 12, 18

 

Delivered

 

D 9.1

 

 

CHLT Website

 

9

ICSTM/

UMKC

 

 

WEB

 

PUB

 

3, 6, 12, 18

 

Delivered

 

D 9.2

 

Dissemination and Use

Strategy

 

9

ICSTM/

UMKC

 

Report

 

PUB

 

6

 

Delivered

 

 

 

D 9.3

 

Dissemination and Exploitation Reports

 

 

9

 

ICSTM/ UMKC

 

 

Report

 

PUB

 

12

 

Delivered

 

*Int.     Internal circulation within project (and Commission Project Officer if requested)

  Rest.  Restricted circulation list (BBC as External User) and Commission PO only

  IST    Circulation within IST Programme participants

  FP5   Circulation within Framework Programme participants

  Pub.   Public document

 

 

Work Planned Next Reporting Period

 

 

WP1: The main concern will be seamless integration into the existing visualisation of the old Norse and Ancient Greek letter rendering modules.

 

WP2:   Continue work to refine the extraction of translation equivalents based

            on Chi2 scores; integration of this tool with the visualisation tool

developed by WP1, and preparation of the Word Tool for final release.

 

WP3:   Continue to develop Digital Library System to provide metadata

sharing and integration between two digital libraries.

 

WP4:   Continue to expand Old Norse database; continue with refinement and integration of WP1 Visualisation Tool for Old Norse into the PERSEUS/CHLT Digital Library System.

           

WP5:   Implementation of LE management algorithms; software testing and

validation; software document.

 

WP6:   XML mark-up of 100-150 pages of new text per month.

 

WP7:   Maintain timely reporting schedule and ensure workpackage

developments and targets proceed according to schedule.

 

WP8:   Maintain integration of (i) code standards, (ii) data sharing

routines and (iii) metadata harvester for all CHLT workpackages, and integrate results into the Digital Library System.

 

WP9:   Continue to implement web-based dissemination of CHLT; publish

papers on the results of CHLT workpackages; attend conferences relevant to the work of CHLT; and organise conferences, meetings,

seminars for dissemination of results of CHLT.

 

<in terms of tasks, by partner 5/6 lines/WP, more if necessary>

Others

 

CHLT has been offered technical support by experts in digital library technology, and we wish to acknowledge their continued interest in and practical support of the aims of CHLT.

 

Dr. Carl Lagoze, Cornell University

 

Dr. Brian Fuchs, Senior Programmer, Archimedes Project, Max Planck, Berlin

 

Julia Flanders, Director, Scholarly Technology Group, Brown University

 

Professor Susan Hockey, University College London.

 

Dr Peter Walters, UKISHELP

 

Dr Hamish Cunnigham, University of Sheffield

 

Mr Michael Hawkins, Imperial College London

 

Martin Doerr, Heraklion, Forth, Crete

 

Names or address change, responsibility reassignment or other.

 

No changes.