DELIVERABLE SUBMISSION SHEET Ð D
7.2 (Month 15)
To: |
Jean Goederich |
(Project Officer) |
EUROPEAN
COMMISSION
DG INFSO E5
EUFO 3277
rue Alcide de
Gasperi
L-2920
Luxembourg
From: Project name: |
Cultural Heritage Language Technologies |
|||
|
Project acronym: |
CHLT |
Project number: |
IST-2001-32745 |
|
Person: |
Dolores Iorizzo |
|
|
|
Organisation |
ICSTM |
||
|
Date |
15 September, 2003 |
||
The following
deliverable:
Deliverable name: |
Periodic Progress Report: Month 15 |
||
Deliverable number: |
D 7.2 |
||
is
now complete. |
*
It is available for your inspection
A copy can be sent to you on request.
Relevant descriptive documents are attached.
2 bound, 1 unbound copies herewith (public deliverables).
2 copies herewith (other deliverables). |
Tick all that apply |
|
The deliverable is: |
X on paper on WWW |
|
|
|||
an event software X other |
Report attached below. |
(tick
one) |
|
|||
For
all paper deliverables, and other deliverables as appropriate:
Date: |
15 September, 2003 |
Version: |
1 |
Author: |
Iorizzo and Rydberg-Cox |
No. of pages: |
This cover plus 12 pages |
Status: |
Public Restricted *Internal (tick one) |
||
Commission
use only Keywords: |
|||
Description: |
|||
Comments:
|
PROJECT PROGRESS REPORT
Project: CHLT-IST-2001-32745
Progress Report Number: 7
Period: 1
June - 31 August 2003
Author: Dolores
Iorizzo
Organisation Imperial
College London
Address: Centre for
the History of Science, Technology and Medicine
Sherfield
Building - Room 445,
Exhibition
Road, London SW7 2AZ
Email: d.iorizzo@ic.ac.uk Phone: + 44 207-594-9355
1. Work planned during
this period........................................................................................ 3
2. Achievements of WP 1
:Advanced Digital Library Applications....................................... 4
3. Achievements of WP 2 :
Computational Linguistics.......................................................... 4
4. Achievements of WP 3 :
Collaborative Infrastructure........................................................ 5
5. Achievements of WP 4 :
Old Norse Morphological Analyzer........................................... 5
6. Achievements of WP 5 :
Neo-Latin Morphological Analyzer........................................... 5
7. Achievements of WP 6 :
Early Modern Latin Corpus....................................................... 5
8. Achievements of WP 7 :
Management of Consortium....................................................... 6
9. Achievements of WP 8:
Integration.................................................................................... 6
10. Achievements
of Workpackage 9: Dissemination and exploitation................................ 7
11. Technical
options adopted.............................................................................................. 7
12. Meetings
held.................................................................................................................. 7
13. Assessment
of interim results......................................................................................... 7
13.1. Objectives set versus
objective attained, deviations................................................... 7
13.2. Issues.......................................................................................................................... 8
14. Check list
of deliverables completed............................................................................... 9
15. Resources
use for the first period................................................................................. 11
16. Work
planned next reporting period............................................................................. 12
17. Others........................................................................................................................... 12
18. Names or
address change, responsibility reassignment or other................................... 13
CHLT has been active for just over a year and this report outlines the work of months 13 Ð 15 in the second year of the project. At the first annual review meeting CHLT was given permission to move to a quarterly reporting system (instead of a bi-monthly one) in light of the striking imbalance of reporting requirement between the US and EC partners. Work in this period has concentrated on three phases: Phase 2: Tools Development was initiated in month 6 and has continued on schedule, as has Phase 3: Evaluation which began in month 10. Phase 4: Integration has been active from the start of the project. Progress has been made in the following areas:
á Document Cluster Visualization Tool
á Word Profile Tool
á Facilities to Accept and Preserve Feedback
á Integration of Text Processing System
á Old Norse Morphological Analyzer
á Latin Morphological Analyzer
á Early Modern Latin Texts
á CHLT Website Development
á Project and Programming Integration
á Integration Meetings
á Dissemination
The following Workpackages were active during the period :
WP1 Ð Advanced Digital Library Applications
WP2 Ð Computational Linguistics
WP3 Ð Collaborative Infrastructure
WP4 Ð Old Norse Morphological Analyzer
WP5 Ð Neo-Latin Morphological Analyzer
WP6 Ð Test-bed Development
WP7 Ð Management of Consortium
WP8 Ð Integration
WP9 Ð Dissemination and Exploitation
WP Leader : Stefan Reuger
Scheduled dates
: 1 June Ð 31 August, 2003
Actual dates: September 2003
Deliverables: No deliverables due this period.
á Concentration of effort on developing the design of interface for the Cluster and Visualization Tool. On schedule for delivery at the end of January 2004.
Estimated achievement: 42 % as planned
WP Leader : Jeff Rydberg-Cox
Scheduled dates
: 1 June Ð 31 August, 2003
Actual dates: September 2003
Deliverables: No
deliverables due this period.
á Having completed D 2.1 and 2.2 on schedule we have turned to the development of the multi-lingual information retrieval tool. On schedule for completion of extraction tool (D 2.3) in November 2003.
Estimated achievement: 42 % as planned
WP Leader : Greg Crane
Scheduled dates
: 1 June Ð 31 August, 2003
Actual dates: September 2003
Deliverables:
No deliverables due this period.
á Concentration of effort on developing naming conventions for DL objects and formulating a system for maintenance procedures for the CHLT partners and the Perseus Digital Library System..
Estimated achievement: 42 % as planned
WP Leader : Tim Tangherlini
Scheduled dates
: 1 June Ð 31 August, 2003
Actual dates: September 2003
Deliverables: No deliverables due this period
á Concentration of effort on the expansion of the Old Norse database to include exceptional-irregular words forms.
Estimated achievement: 42 % as planned
WP Leader : Andrea Bozzi
Scheduled dates
: 1 June Ð 31 August, 2003
Actual dates: September 2003
Deliverables: No delieverables due this period
á Continuation of effort in adding gender codes to account for ambiguous morphological categories, testing the results and modifying the source code. Work on schedule to complete Word Segmentation System (D 5.2) by the end of January 2004.
Estimated achievement: 42 % as planned
WP Leader : Ross Scaife
Scheduled dates
: 1 June Ð 31 August, 2003
Actual dates: September 2003
Deliverables: No deliverables due this period
á Continuation of effort on the XML markup of Renaissance Latin texts using XML- Text Encoding Initiative (TEI) compliant standards. These texts will be used as a test bed for IT tools and applications developed for the CHLT Digital Library System. Workpackage on schedule for D 6.1 in month 35.
Estimated achievement: 42 % as planned
WP Leaders : Dolores Iorizzo and Jeff Rydberg-Cox
Scheduled dates
: 1 June Ð 31 August, 2003
Actual dates: Septmeber 2003
Deliverables: D
7.2 (month 15)
á Concentration of effort on making sure CHLT partners respect the importance of timely reporting to the EC, and the collaboration of its members.
Estimated achievement: 42% as planned
WP Leaders : Greg Crane, Jeff Rydberg-Cox, Stefan Rueger, Dolores Iorizzo
Scheduled dates
: 1 June Ð 31 August, 2003
Actual dates: September 2003
Deliverables:
No deliverables due this period
á Integration has concentrated on implementation of code standards for plug-in modules for the core digital library system shared by all CHLT partners.
Estimated achievement: 42 % as planned
WP Leaders : Greg Crane, Jeff Rydberg-Cox, Dolores Iorizzo
Scheduled dates
: 1 June Ð 31 August, 2003
Actual dates: September 2003
Deliverables: No deliverables due this period.
á Concentration of effort on identifying and linking to other digital library projects that share the aims of CHLT and therefore may benefit from the dissemination of our web-based technology.
Estimated achievement: 42 %, as planned
WP1: Considering use of GUI
code that may work for loading the applet
with variable parameters.
WP2: Further
development of DTDÕs for disambiguation of ambiguous
Greek and Latin forms.
WP3: Further
development of abstract interfaces to morphological databases
using SOAP.
WP4: Testing out
different web interfaces for the Old Norse Morphological
Analyser.
WP5: Modification of
Gender Codes for Word Segmentation System.
WP6: Further
DTDÕs implemented for text elements and attributes.
No CHLT meetings held in this period.
WP1: Begin
development of interface for the Document Cluster and Visualisation Tool. Work progress on schedule.
No deviations.
WP2: Development
of multi-lingual information retrieval facilities for the
extraction tool. Work progress on
schedule.
No deviations.
WP3: Develop
naming conventions for DL objects and create maintenance
procedure for quality control. purposes. Work
progress on schedule.
No deviations.
WP4: Expand
Old Norse database to include irregular word forms. Work
progress on No deviations.
WP5: Add gender codes
to ambiguous morphological categories,
test results and modifying source
code accordingly. Work progress
on schedule. No deviations.
WP6: Mark-up
Latin texts in TEI-conformant XML and deposit them in CVS
repository. Work progresses on
schedule. No deviations.
WP7: Continue
to co-ordinate the efforts of US and EC partners to ensure
full
collaboration and completion of the technical work the
workpackages. No deviations
WP8: Integrate
code standards among all CHLT partners. Work progress
on schedule. No deviations.
WP9: Development
of links to other DL projects through web-based dissemination. Objectives are
on schedule. No deviations.
Issues description Here describe issues or problems that might affect
achievements, delay activities, deliverables or milestones |
Action items Corrective action envisaged by the project to
overcome the issue. This include the expected impact in terms of delays,
quality and quantity of work. |
None |
None |
Del. no. |
Deliverable name |
WP no. |
Lead participant |
Del. Type |
Security* |
Delivery (proj. month) |
Status |
|
D 1.1 |
Prototype Document Cluster Visualization Tool |
1 |
ICSTM |
IT |
PUB |
10 |
Delivered |
|
D 2.1 |
Word Profile Tool |
2 |
UMKC/CAM |
IT |
PUB |
12 |
Delivered |
|
D 2.2 |
Facilities to Accept and Preserve Feedback |
2 |
UMKC/CAM |
IT |
PUB |
12 |
Delivered |
|
D 3.1 |
Text Processing System |
3 |
PERSEUS |
IT |
PUB |
1 |
Delivered |
|
D 3.2 |
General Data Provider Routine for Metadata Sharing |
3 |
PERSEUS |
IT |
PUB |
10 |
Delivered |
|
D 3.3 |
Metadata Harvester |
3 |
PERSEUS |
IT |
PUB |
12 |
Delivered |
|
D 4.1 |
Report on Old Norse Morphological Analyzer |
4 |
UCLA/ KU |
IT/ Report |
PUB |
12 |
Delivered |
|
D 4.2 |
Report on Electronic Editions of Old Norse Texts |
4 |
UCLA/ KU |
IT/ Report |
PUB |
12 |
Delivered |
|
D 5.1 |
Report on Neo-Latin Morphological Analyzer |
5 |
ILC- PISA |
IT/ Report |
PUB |
10 |
Delivered |
|
D 7.1 |
Report on Kick-off Meeting And Consortium Agreement |
7 |
ICSTM/UMKC |
Report |
PUB |
4 |
Delivered |
|
D 7.2 |
Bi-Monthly Reports (Revised to Quarterly Reports, June 2003) |
7 |
ICSTM/UMKC |
Report |
PUB |
2,4,8,10, 15 |
Delivered |
|
D 7.3 |
Bi-Yearly Periodic Progress Reports |
7 |
ICSTM/ UMKC |
Report |
PUB |
6, 12 |
Delivered |
|
D 7.4 |
Bi-Yearly Financial Cost Statement |
7 |
ICSTM |
Report |
PUB |
6, 12 |
Delivered |
|
D 7.5 |
Yearly Consortium Report |
7 |
ICSTM/ UMKC |
Report |
PUB |
12 |
Delivered |
|
D 8.1 |
Bi- Yearly Integration Meeting Report |
8 |
ICSTM/ UMKC PERS |
Report |
PUB |
6, 12 |
Delivered |
|
D 8.2 |
Version Control System for Core Digital Library |
8 |
ICSTM/ UMKC PERS |
Report |
PUB |
6, 12 |
Delivered |
|
D 8.3 |
Indexing and Input/Output Formats for Integration and Interoperablity |
8 |
ICSTM/ UMKC PERS |
Report |
PUB |
6, 12 |
Delivered |
|
D 9.1 |
CHLT Website |
9 |
ICSTM/ UMKC |
WEB |
PUB |
3, 6, 12 |
Delivered |
|
D 9.2 |
Dissemination and Use Strategy |
9 |
ICSTM/ UMKC |
Report |
PUB |
6 |
Delivered |
|
D 9.3 |
Dissemination and Exploitation Reports |
9 |
ICSTM/ UMKC |
Report |
PUB |
12 |
Delivered |
|
*Int. Internal
circulation within project (and Commission Project Officer if requested)
Rest. Restricted circulation list (BBC as
External User) and Commission PO only
IST Circulation within IST
Programme participants
FP5 Circulation within Framework Programme
participants
Pub. Public document
Resources used for three
months |
Report period 1June - 31
August 2003 |
|
|
|
||||||||||||||
Partner |
|
Man-month allocation by
workpackage |
|
|
|
|
|
|||||||||||
Name |
Code |
WP01 |
WP02 |
WP03 |
WP04 |
WP05 |
WP06 |
WP07 |
WP08 and 09 |
Total |
||||||||
ICSTM |
P01 |
1.5 |
|
.05 |
|
|
.2 |
.6 |
.3 + .25 |
2.9 |
||||||||
UCAM-CLAS |
P02 |
|
1.5 |
|
|
|
|
|
|
1.5 |
||||||||
ILC |
P03 |
|
|
|
|
3 |
|
|
|
3 |
||||||||
KU |
P04 |
|
|
|
1.5 |
|
|
|
|
1.5 |
||||||||
Totals |
|
|
|
|
|
|
|
|
|
8.9 |
||||||||
Compared with the original resources allocation in TA:
Planned resource allocation
for three months |
1 September Ð 30 November 2003 |
|
|
|
|||||||||||||||
Partner |
|
Man-month allocation by
workpackage |
|
|
|
|
|
||||||||||||
Name |
Code |
WP01 |
WP02 |
WP03 |
WP04 |
WP05 |
WP06 |
WP07 |
WP08 |
Total |
|||||||||
ICSTM |
P01 |
1.5 |
|
.05 |
|
|
.2 |
.6 |
.3 + .2.5 |
2.9 |
|||||||||
UCAM-CLAS |
P02 |
|
1.5 |
|
|
|
|
|
|
1.5 |
|||||||||
ILC |
P03 |
|
|
|
|
3 |
|
|
|
3 |
|||||||||
KU |
P04 |
|
|
|
1.5 |
|
|
|
|
1.5 |
|||||||||
Totals |
|
|
|
|
|
|
|
|
|
8.9 |
|||||||||
WP1: Continue design of
interface for Cluster and Visualisation
Tool.
WP2: Continue to work
on the multi-lingual information retrieval tool.
WP3: Continue work on
identifying linkable reference materials for the DLS.
WP4: Continue
to expand database of exceptional-irregular forms in Old Norse.
WP5: Continue
to add gender codes to ambiguous morphological
categories, testing them and making
changes to source code.
WP6: XML
mark-up of 100-150 pages of new text per month.
WP7: Maintain
timely reporting schedule and ensure workpackage
developments and targets proceed
according to schedule.
WP8: Maintain
integration of code standards and data sharing routines.
WP9: Continue to implement
web-based dissemination of CHLT.
CHLT has been offered technical support by
respected experts in digital library technology, and we wish to acknowledge
their continued interest in and practical support of the aims of CHLT.
Dr. Carl Lagoze, Cornell University
Dr. Brian Fuchs, Senior Programmer, Archimedes
Project, Max Planck, Berlin
Julia Flanders, Director, Scholarly Technology
Group, Brown University
Professor Susan Hockey, University College
London.
Dr Peter Walters, UKISHELP
Dr Hamish Cunnigham, University of Sheffield
Mr Michael Hawkins, Imperial College London
Martin Doerr, Heraklion, Forth, Crete
No changes.
DELIVERABLE SUBMISSION SHEET Ð D
7.3 (Month 18)
To: |
Jean Goederich |
(Project Officer) |
EUROPEAN
COMMISSION
DG INFSO E5
EUFO 3277
rue Alcide de
Gasperi
L-2920
Luxembourg
From: Project name: |
Cultural Heritage Language Technologies |
|||
|
Project acronym: |
CHLT |
Project number: |
IST-2001-32745 |
|
Person: |
Dolores Iorizzo |
|
|
|
Organisation |
ICSTM |
||
|
Date |
28 January, 2004 |
||
The following
deliverable:
Deliverable name: |
Periodic Progress Report: Month 18 |
||
Deliverable number: |
D 7.3 |
||
is
now complete. |
*
It is available for your inspection
A copy can be sent to you on request.
Relevant descriptive documents are attached.
2 bound, 1 unbound copies herewith (public deliverables).
2 copies herewith (other deliverables). |
Tick all that apply |
|
The deliverable is: |
on paper on WWW |
www. CHLT.org |
|
|||
an event software X other |
Report attached below. |
(tick
one) |
|
|||
For
all paper deliverables, and other deliverables as appropriate:
Date: |
28 January, 2004 |
Version: |
1 |
Author: |
Iorizzo and Rydberg-Cox |
No. of pages: |
This cover plus 13 pages |
Status: |
Public Restricted *Internal (tick one) |
||
Commission
use only Keywords: |
|||
Description: |
|||
Comments:
|
PROJECT PROGRESS REPORT
Project: CHLT-IST-2001-32745
Progress Report Number: 8
Period: 1
June - November 30 2003
Author: Dolores
Iorizzo
Organisation Imperial
College London
Address: Centre for
the History of Science, Technology and Medicine
Sherfield
Building - Room 445,
Exhibition
Road, London SW7 2AZ
Email: d.iorizzo@ic.ac.uk Phone: + 44 207-594-9355
1. Work planned during
this period........................................................................................ 3
2. Achievements of WP 1
:Advanced Digital Library Applications....................................... 4
3. Achievements of WP 2 :
Computational Linguistics.......................................................... 4
4. Achievements of WP 3 :
Collaborative Infrastructure........................................................ 5
5. Achievements of WP 4 :
Old Norse Morphological Analyzer........................................... 5
6. Achievements of WP 5 :
Neo-Latin Morphological Analyzer........................................... 5
7. Achievements of WP 6 :
Early Modern Latin Corpus....................................................... 5
8. Achievements of WP 7 :
Management of Consortium....................................................... 6
9. Achievements of WP 8:
Integration.................................................................................... 6
10. Achievements
of Workpackage 9: Dissemination and exploitation................................ 7
11. Technical
options adopted.............................................................................................. 7
12. Meetings
held.................................................................................................................. 7
13. Assessment
of interim results......................................................................................... 7
13.1. Objectives set versus
objective attained, deviations................................................... 7
13.2. Issues.......................................................................................................................... 8
14. Check list
of deliverables completed............................................................................... 9
15. Resources
use for the first period................................................................................. 11
16. Work
planned next reporting period............................................................................. 12
17. Others........................................................................................................................... 12
18. Names or
address change, responsibility reassignment or other................................... 13
CHLT has now been active for 18 months and this report outlines the work of the third six month period of the project. There have been three cycles of work active in this period as outlined in the original description of work (p.17). Phase 1: Fundamentals was completed on schedule in the first year and has provided a sound foundation for the later phases. Phase 2: Tools Development was initiated in month 6 and has continued on schedule, as has Phase 3: Evaluation which began in month 10. Phase 4: Integration has been active from the start of the project. Progress has been made in the following areas:
á Document Cluster Visualization Tool
á Word Profile Tool
á Facilities to Accept and Preserve Feedback
á Integration of Text Processing System
á Old Norse Morphological Analyzer
á Latin Morphological Analyzer
á Early Modern Latin Texts
á CHLT Website Development
á Project and Programming Integration
á Integration Meetings
á Dissemination
The following Workpackages were active during the period :
WP1 Ð Advanced Digital Library Applications
WP2 Ð Computational Linguistics
WP3 Ð Collaborative Infrastructure
WP4 Ð Old Norse Morphological Analyzer
WP5 Ð Neo-Latin Morphological Analyzer
WP6 Ð Test-bed Development
WP7 Ð Management of Consortium
WP8 Ð Integration
WP9 Ð Dissemination and Exploitation
WP Leader : Stefan Reuger
Scheduled dates
: 1 June - November 2003
Actual dates: December 2003
Deliverables: No deliverables due this period
á The main work of this period has been focused on the design of the interface for the Cluster and Visualization Tool, and its testing for use in three areas (i) indexing and keyword extraction, (ii) document clustering and (iii) the visualization of search results. On schedule for delivery of D 1.2 at the end of January 2004.
We have also made notable progress ahead of schedule in several areas: (iv) integrating visualization with the Perseus Digital Library System, (v) indexing Ancient Greek Documents, (vi) rendering documents in JAVA, (vii) loading visualization applet to allow for query, language and collection information available on start-up, (viii) linking our visualization with the rich analysis tools developed by Perseus (WP3) and (ix) writing an interface for the WP4 the Old Norse Morphological Analyser.
Estimated achievement: 50 % as planned
WP Leader : Jeff Rydberg-Cox
Scheduled dates
: 1 June - November 2003
Actual dates: December 2003
Deliverables: D
2.3: Tool to Extract Corpus Based Thesauri from Corpus
á The main work of this period has been the completion on schedule of (i) D 2.1 (Word Profile Tool) and (ii) D 2.2 (Facilities to Accept, Preserve and Integrate User Feedback) in month 12, which has meant that we have been able to develop our extraction tool ahead of schedule with good results. Work has also begun on the development of the theoretical models on which the multi-lingual retrieval facilities will be based, as well as addressing problems relating to sectional preferences and sub-categorization frames.
Estimated achievement: 50 % as planned
WP Leader : Greg Crane
Scheduled dates
: 1 June - November 2003
Actual dates: December 2003
Deliverables:
D 3.4: Report on Naming Conventions
D
3.5: Maintenance Procedures for Naming Conventions
á Work in this period has been diverse and productive. We have developed and delivered (i) naming conventions (D 3.4) and (ii) maintenance procedures for implementing quality control among CHLT partners and the Perseus Digital Library System (D 3.5).
We are also ahead of schedule in (iii) building a clear and well-documented API to accelerate the development of advanced digital library applications, (iv) developing abstract interfaces to our morphological databases, and (v) experimenting with new ways to use OAI protocols to share large volumes of data received from reference works.
Estimated achievement: 50 % as planned
WP Leader : Tim Tangherlini
Scheduled dates
: 1 June - November 2003
Actual dates: December 2003
Deliverables: No deliverables due this period
á The main work of this period has focused on (i) the expansion of the Old Norse database to include exceptional-irregular words forms, and (ii) the development of an integrated viewing environment for normalized and diplomatic texts.
Estimated achievement: 50 % as planned
WP Leader : Andrea Bozzi
Scheduled dates
: 1 June - November 2003
Actual dates: December 2003
Deliverables: No delieverables due this period
á The main work of this period has focused on (i) the development of a word segmentation system, (ii) adding gender codes to the LES ambiguous morphological categories, (iii) implementing new LE rules, (iv) coding exceptional and invariable forms, and (v) testing the lemmatization results and modifying the source code. We also (vi) continue to document implementation functions, data structures and algorithms. Our work is on schedule to complete Word Segmentation System (D 5.2) by the end of January 2004.
Estimated achievement: 50 % as planned
WP Leader : Ross Scaife
Scheduled dates
: 1 June - November 2003
Actual dates: December 2003
Deliverables: No deliverables due this period
á Progress continues according to schedule on the XML markup of Renaissance Latin texts using XML- Text Encoding Initiative (TEI) compliant standards. These texts will be used as a test bed for IT tools and applications developed for the CHLT Digital Library System. Workpackage on schedule for D 6.1 in month 35.
Estimated achievement: 50 % as planned
WP Leaders : Dolores Iorizzo and Jeff Rydberg-Cox
Scheduled dates
: 1 June - November 2003
Actual dates: December 2003
Deliverables: D
7.2, D 7.3, D 7.4
á 7.2 Month 15 Periodic Progress Report
á 7.3 Month 18 Periodic Progress Report
á 7.4 Month 18 Cost Statement
á We have made progress in getting the message across that a close collaboration of workpackages and timely reporting is an integral part of the work of CHLT. As a result there has been an increased effort in getting reports and deliverables in on schedule for all CHLT partners.
Estimated achievement: 50% as planned
WP Leaders : Greg Crane, Jeff Rydberg-Cox, Stefan Rueger, Dolores Iorizzo
Scheduled dates
: 1 June - November 2003
Actual dates: December 2003
Deliverables:
D 8.1 Integration Report
D
8.2 Version Control Maintenance
D
8.3 Code Standards for Compatibility of Plug-in Modules
á Integration in the third six month period has concentrated on programming integration, testing out data sharing routines, maintenance of the version control system, and implementation of code standards for plug-in modules for the core digital library system which is shared by all CHLT partners.
Estimated achievement: 50 % as planned
WP Leaders : Greg Crane, Jeff Rydberg-Cox, Dolores Iorizzo
Scheduled dates
: 1 June - November 2003
Actual dates: December 2003
Deliverables: D 9.3: Annual Dissemination and
Exploitation Report
á Update of CHLT.org website shows results of workpackages to ensure maximum exposure in the digital library community of the technical tools, applications and infrastructure systems that are being developed by CHLT; added links to other digital library projects have implemented web-based dissemination.
Estimated achievement: 50 %, as planned
WP1: Re-writing of GUI code
necessary for loading the applet with variable
parameters and to address the problem of
initialising the panels with
search results.
WP2: Further
development of DTDÕs for disambiguation of ambiguous
Greek and Latin forms.
WP3: Further
development of abstract interfaces to morphological databases
using SOAP and XML-RPC.
WP4: Further
development of web interface for the Old Norse Morphological
Analyser to integrate work with WP1 Cluster and
Visualisation Tool.
WP5: Modification of
Gender Codes for Word Segmentation System.
WP6: Further
DTDÕs implemented for text elements and attributes.
á CHLT
Collaborators Meeting, November 2003, Kansas City, Missouri
á CHLT Integration Meeting, November 2003, Kansas
City, Missouri
WP1: Set
objective was to begin development and design of interface for the
Document Cluster and Visualisation Tool which
acts as a text search engine. Work
progresses on or ahead of schedule (see Section 1).
No deviations.
WP2: Set
objective was to complete work on Word Profile Tool (D 2.1) and
Develop Facilities to Accept,
Preserve and Integrate User Feedback (D 2.2), and proceed with work laying
groundwork for multi-lingual information retrieval facilities, sectional
preferences, and sub-categorisation frames. Work progresses on schedule.
No deviations.
WP3: Set
objectives were to report on naming conventions of digital library objects; yet
we also managed to maintain OAI compliant data provider and metadata harvester;
distribute procedures for naming conventions to all CHLT partners and follow up
with quality control maintenance of the DL system. Work progresses on or ahead of schedule.
No deviations.
WP4: Set
objectives were to refine Old Norse Morphological Analyser;
continue with the mark-up of Old
Norse texts in diplomatic and normalised versions; and to test the
morphological analyser with the marked-up text. We also were able to start work on integrating the Old Norse
Morphological Analyzer with the Cluster and Visualisation Tool ( WP1) ahead of
schedule. Work progresses on or ahead of schedule. No deviations.
WP5: Set
objective was to begin development of a Word Segmentation
System for the Latin morphological
analyser. Work progresses on schedule.
No deviations.
WP6: Set
objective was to mark-up Latin texts in TEI-conformant XML and
commit them to CVS repository. Work
progresses on schedule.
No deviations.
WP7: Set
objectives were to co-ordinate the efforts of US and EC partners to
ensure full collaboration and completion of the technical work the
workpackages.
No deviations
WP8: Set
objectives were to integrate tools and applications developed by the
workpackages; maintain the version
control system for the core digital
library, and to maintain code
standards among all CHLT partners.
Work progresses on schedule. No
deviations.
WP9: Set
objectives were to further develop CHLT website as a vehicle of dissemination;
publish papers on the results of CHLT workpackages; and give papers at
international conferences on the work of CHLT. Objectives are on schedule.
No deviations.
Issues description Here describe issues or problems that might affect
achievements, delay activities, deliverables or milestones |
Action items Corrective action envisaged by the project to
overcome the issue. This include the expected impact in terms of delays,
quality and quantity of work. |
None |
None |
Del. no. |
Deliverable name |
WP no. |
Lead participant |
Del. Type |
Security* |
Delivery (proj. month) |
Status |
|
D 1.1 |
Prototype Document Cluster Visualization Tool |
1 |
ICSTM |
IT |
PUB |
10 |
Delivered |
|
D 2.1 |
Word Profile Tool |
2 |
UMKC/CAM |
IT |
PUB |
12 |
Delivered |
|
D 2.2 |
Facilities to Accept and Preserve Feedback |
2 |
UMKC/CAM |
IT |
PUB |
12 |
Delivered |
|
D 3.1 |
Text Processing System |
3 |
PERSEUS |
IT |
PUB |
1 |
Delivered |
|
D 3.2 |
General Data Provider Routine for Metadata Sharing |
3 |
PERSEUS |
IT |
PUB |
10 |
Delivered |
|
D 3.3 |
Metadata Harvester |
3 |
PERSEUS |
IT |
PUB |
12 |
Delivered |
|
D 4.1 |
Report on Old Norse Morphological Analyzer |
4 |
UCLA/ KU |
IT/ Report |
PUB |
12 |
Delivered |
|
D 4.2 |
Report on Electronic Editions of Old Norse Texts |
4 |
UCLA/ KU |
IT/ Report |
PUB |
12 |
Delivered |
|
D 5.1 |
Report on Neo-Latin Morphological Analyzer |
5 |
ILC- PISA |
IT/ Report |
PUB |
10 |
Delivered |
|
D 7.1 |
Report on Kick-off Meeting And Consortium Agreement |
7 |
ICSTM/UMKC |
Report |
PUB |
4 |
Delievered |
|
D 7.2 |
Bi-Monthly Reports (Revised to Quarterly Reports, June 2003) |
7 |
ICSTM/UMKC |
Report |
PUB |
2,4,8,10, 15 |
Delivered |
|
D 7.3 |
Bi-Yearly Periodic Progress Reports |
7 |
ICSTM/ UMKC |
Report |
PUB |
6, 12, 18 |
Delivered |
|
D 7.4 |
Bi-Yearly Financial Cost Statement |
7 |
ICSTM |
Report |
PUB |
6, 12, 18 |
Delivered |
|
D 7.5 |
Yearly Consortium Report |
7 |
ICSTM/ UMKC |
Report |
PUB |
12 |
Delivered |
|
D 8.1 |
Bi- Yearly Integration Meeting Report |
8 |
ICSTM/ UMKC PERS |
Report |
PUB |
6, 12, 18 |
Delivered |
|
D 8.2 |
Version Control System for Core Digital Library |
8 |
ICSTM/ UMKC PERS |
Report |
PUB |
6, 12, 18 |
Delivered |
|
D 8.3 |
Indexing and Input/Output Formats for Integration and Interoperablity |
8 |
ICSTM/ UMKC PERS |
Report |
PUB |
6, 12, 18 |
Delivered |
|
D 9.1 |
CHLT Website |
9 |
ICSTM/ UMKC |
WEB |
PUB |
3, 6, 12, 18 |
Delivered |
|
D 9.2 |
Dissemination and Use Strategy |
9 |
ICSTM/ UMKC |
Report |
PUB |
6 |
Delivered |
|
D 9.3 |
Dissemination and Exploitation Reports |
9 |
ICSTM/ UMKC |
Report |
PUB |
12 |
Delivered |
|
*Int. Internal
circulation within project (and Commission Project Officer if requested)
Rest. Restricted circulation list (BBC as External
User) and Commission PO only
IST Circulation within IST
Programme participants
FP5 Circulation within Framework
Programme participants
Pub. Public document
Resources used for third 6
months |
Report period June 2003-
Nov 2003 |
|
|
|
||||||||||||||
Partner |
|
Man-month allocation by
workpackage |
|
|
|
|
|
|||||||||||
Name |
Code |
WP01 |
WP02 |
WP03 |
WP04 |
WP05 |
WP06 |
WP07 |
WP08 and 09 |
Total |
||||||||
ICSTM |
P01 |
3 |
|
.1 |
|
|
.4 |
1.2 |
.6 + .5 |
5.8 |
||||||||
UCAM-CLAS |
P02 |
|
3 |
|
|
|
|
|
|
3 |
||||||||
ILC |
P03 |
|
|
|
|
6 |
|
|
|
6 |
||||||||
KU |
P04 |
|
|
|
3 |
|
|
|
|
3 |
||||||||
Totals |
|
|
|
|
|
|
|
|
|
17.8 |
||||||||
Compared with the original resources allocation in TA:
Planned resource allocation
for forth six-months |
Dec 2003 / May 2004 |
|
|
|
|||||||||||||||
Partner |
|
Man-month allocation by
workpackage |
|
|
|
|
|
||||||||||||
Name |
Code |
WP01 |
WP02 |
WP03 |
WP04 |
WP05 |
WP06 |
WP07 |
WP08 |
Total |
|||||||||
ICSTM |
P01 |
3 |
|
.1 |
|
|
.4 |
1.2 |
.6 + .5 |
5.8 |
|||||||||
UCAM-CLAS |
P02 |
|
3 |
|
|
|
|
|
|
3 |
|||||||||
ILC |
P03 |
|
|
|
|
6 |
|
|
|
6 |
|||||||||
KU |
P04 |
|
|
|
3 |
|
|
|
|
3 |
|||||||||
Totals |
|
|
|
|
|
|
|
|
|
17.8 |
|||||||||
WP1: Finalise work on the
design of interface for Cluster and Visualisation
Tool.
A major concern will be to endow the applet with greater
flexibility to deal gracefully with different languages and sets of
document collections. We will also
be concerned with the integration of the Greek rendering module in the
visualisation and display programme.
WP2: Continue to work
on the multi-lingual information retrieval tool and
Make progress on document architecture
issues. We also hope to
Finalise the citation scheme map and begin
integration into the word
study tool.
WP3: Continue to work
on identifying and exposing linkable reference
materials for the DLS.
WP4: Continue
to expand database of exceptional-irregular forms in Old Norse; continue to
work with the WP1 and WP3 teams to refine and integrate the Cluster and
Visualisation Tool for Old Norse into the CHLT Digital Library System.
WP5: Continue
to (i) add gender codes to the LES
ambiguous morphological
categories (ii) implement new LE
rules, (iii) code FE (exceptional forms) and I (invariable forms), (iv) test
lemmatisation results and feed them back into the system to refine programme,
(v) continue to modify source code as a result of the refinements, and (vi)
document implementation functions, data structures and algorithms.
WP6: XML
mark-up of 100-150 pages of new text per month.
WP7: Maintain
timely reporting schedule and ensure workpackage
developments and targets proceed
according to schedule.
WP8: Maintain
integration of (i) code standards, (ii) data sharing routines
and (iii) metadata harvester for all
CHLT workpackages, and integrate results into the Digital Library System.
WP9: Continue
to implement web-based dissemination of CHLT; publish
papers on the results of CHLT
workpackages; attend conferences relevant to the work of CHLT; and organise
conferences, meetings, seminars for dissemination of results of CHLT.
CHLT has been offered technical support by
respected experts in digital library technology, and we wish to acknowledge
their continued interest in and practical support of the aims of CHLT.
Dr. Carl Lagoze, Cornell University
Dr. Brian Fuchs, Senior Programmer, Archimedes
Project, Max Planck, Berlin
Julia Flanders, Director, Scholarly Technology
Group, Brown University
Professor Susan Hockey, University College
London.
Dr Peter Walters, UKISHELP
Dr Hamish Cunnigham, University of Sheffield
Mr Michael Hawkins, Imperial College London
No changes.
DELIVERABLE SUBMISSION SHEET Ð D
7.2 Month 21
To: |
Jean Goederich |
(Project Officer) |
EUROPEAN
COMMISSION
DG INFSO E5
EUFO 3277
rue Alcide de
Gasperi
L-2920
Luxembourg
From: Project name: |
Cultural Heritage Language Technologies |
|||
|
Project acronym: |
CHLT |
Project number: |
IST-2001-32745 |
|
Person: |
Dolores Iorizzo |
|
|
|
Organisation |
ICSTM |
||
|
Date |
30 March, 2004 |
||
The following
deliverable:
Deliverable name: |
Periodic Progress Report: Month 21 |
||
Deliverable number: |
D 7.2 |
||
is
now complete. |
*
It is available for your inspection
A copy can be sent to you on request.
Relevant descriptive documents are attached.
2 bound, 1 unbound copies herewith (public deliverables).
2 copies herewith (other deliverables). |
Tick all that apply |
|
The deliverable is: |
on paper on WWW |
www. CHLT.org |
|
|||
an event software X other |
Report attached below. |
(tick
one) |
|
|||
For
all paper deliverables, and other deliverables as appropriate:
Date: |
30 March, 2004 |
Version: |
1 |
Author: |
Iorizzo and Rydberg-Cox |
No. of pages: |
This cover plus 14 pages |
Status: |
Public Restricted *Internal (tick one) |
||
Commission
use only Keywords: |
|||
Description: |
|||
Comments:
|
PROJECT PROGRESS REPORT
Project: CHLT-IST-2001-32745
Progress Report Number: 9
Period: 1
December 2003 - 29 February 2004
Author: Dolores
Iorizzo
Organisation Imperial
College London
Address: Centre for
the History of Science, Technology and Medicine
Sherfield
Building - Room 445,
Exhibition
Road, London SW7 2AZ
Email: d.iorizzo@ic.ac.uk Phone: + 44 207-594-9355
1. Work planned during this
period........................................................................................ 3
2. Achievements of WP 1
:Advanced Digital Library Applications....................................... 4
3. Achievements of WP 2 :
Computational Linguistics.......................................................... 4
4. Achievements of WP 3 :
Collaborative Infrastructure........................................................ 5
5. Achievements of WP 4 :
Old Norse Morphological Analyzer........................................... 5
6. Achievements of WP 5 :
Neo-Latin Morphological Analyzer........................................... 5
7. Achievements of WP 6 :
Early Modern Latin Corpus....................................................... 5
8. Achievements of WP 7 :
Management of Consortium....................................................... 6
9. Achievements of WP 8:
Integration.................................................................................... 6
10. Achievements of Workpackage 9:
Dissemination and exploitation.................................... 7
11. Technical options adopted.................................................................................................. 7
12. Meetings held...................................................................................................................... 7
13. Assessment of interim results............................................................................................. 7
13.1. Objectives set versus
objective attained, deviations................................................... 7
13.2. Issues.......................................................................................................................... 8
14. Check list of deliverables
completed................................................................................... 9
15. Work planned next reporting
period................................................................................. 12
16. Others............................................................................................................................... 12
17. Names or address change,
responsibility reassignment or other....................................... 13
CHLT has now been active for 21 months and this report outlines the work of the third quarter of the second year of the project. There have been three cycles of work active in this period. Phase 2: Tools Development was initiated in month 6 and has continued on schedule. Phase 3: Evaluation began in month 10 and is also on schedule. Phase 4: Integration has been active from the start of the project.
Progress has been made in the following areas:
á Document Cluster Visualization Tool
á Word Profile Tool
á Facilities to Accept and Preserve Feedback
á Integration of Text Processing System
á Old Norse Morphological Analyzer
á Latin Morphological Analyzer
á Early Modern Latin Texts
á CHLT Website Development
á Project and Programming Integration
á Integration Meetings
á Dissemination
The following Workpackages were active during the period :
WP1 Ð Advanced Digital Library Applications
WP2 Ð Computational Linguistics
WP3 Ð Collaborative Infrastructure
WP4 Ð Old Norse Morphological Analyzer
WP5 Ð Neo-Latin Morphological Analyzer
WP6 Ð Test-bed Development
WP7 Ð Management of Consortium
WP8 Ð Integration
WP9 Ð Dissemination and Exploitation
Achievements of WP 1 : Advanced Digital Library
Applications
WP Leader: Stefan Reuger
Scheduled dates: 1 December 2003 Ð 29 February 2004
Actual dates: 30 March 2004
Deliverables: D 1.2
á The main work of this period has been to finalize work on the design of the interface for the Document Cluster and Visualization Tool; it has been tested and completed on schedule. Additional progress on this tool has been made ahead of schedule in two main areas:
(i) Modifying the Document Cluster and Visualization Tool to handle a multiplicity of distinct document collections; this is an essential requirement for making it widely applicable to texts that exist in more than one collection, such as Greek texts that have been translated into Latin.
(ii) The building of new collections. What used to be two tightly linked, although functionally quite distinct indexing steps are now neatly separated, each with its own Java wrapper which provides a standard API to other programs. We also designed a graphical user interface that accesses these wrappers to index new collections.
Estimated achievement: 67 % ahead of schedule
Scheduled dates: 1 December 2003 Ð 29 February 2004
Actual dates: 30 March 2004
Deliverables: No
deliverables due this period.
á The main work of this period has been on the development of the multi-lingual information retrieval tool, with special focus on:
(i) Methods for query expansion;
(ii) Methods for the extraction of translation equivalents from parallel
and comparable corpora;
(iii) Refining our user interface to integrate our results with those of WP1;
(iv) Dissemination of results to JCDL, ECDL and the New England Journal
of Classics.
Estimated achievement: 58 % as planned
Achievements of WP 3: Collaborative Infrastructure
Scheduled dates: 1 December 2003 Ð 29 February 2004
Actual dates: 30 March 2004
Deliverables: No
deliverables due this period.
á The main work of this period has been to create a robust Digital Library
Infrastucture for data-sharing based on a system of unique name identifiers
for organisations, collections and individual digital objects. The focus has
been on integrating OAI-derived metadata into every aspect of the digital
library system, from catalogue browsing to full text searching to implicit
linking between documents.
Estimated achievement: 58 % as planned
Achievements of WP 4: Old Norse Morphological Analyzer
WP Leader: Tim Tangerlini
Scheduled dates: 1 December
2003 Ð 29 February 2004
Actual dates: 30 March 2004
Deliverables: No
deliverables due this period.
á The main work of this period has been the continuing expansion of the Old
Norse database to include a greater number of exceptional-irregular words
forms. We have also made considerable progress in developing an integrated
viewing environment for normalized and diplomatic texts to integrate our
work with the results of WP1's Visualization Tool.
Estimated achievement: 58 % as planned
Achievements of WP 5: Neo-Latin Morphological
Analyzer
WP Leader: Andrea Bozzi
Scheduled dates: 1 December
2003 Ð 29 February 2004
Actual dates: 30 March 2004
Deliverables: D 5.2 Word
Segmentation System for Latin Analyzer
á The main work of this period has focused on (i) implementing and testing
new LE rules, (ii) completing on schedule the Word Segmentation System and (iii) continuing to document implementation functions, data structures and algorithms.
Estimated achievement: 58 % as planned
WP Leader: Ross Scaife
Scheduled dates: 1 December 2003 Ð 29 February 2004
Actual dates: 30 March 2004
Deliverables: No deliverables due this period
á Progress continues according to schedule on the XML markup of Renaissance Latin texts using XML- Text Encoding Initiative (TEI) compliant standards. These texts will be used as a test bed for IT tools and applications developed for the CHLT Digital Library System. Workpackage on schedule for D 6.1 in month 35.
Estimated achievement: 58 % as planned
Achievements of WP 7 : Management of Consortium
WP Leader: Dolores Iorizzo and Jeff Rydberg-Cox
Scheduled dates: 1 December 2003 Ð 29 February 2004
Actual dates: 30 March 2004
Deliverables: D 7.2: Month 21 Periodic Progress Report
á We continue to make progress in getting the message across that a close collaboration of partners and timely reporting is an integral part of the work of CHLT. As a result there has been greater integration of results and a general boost in morale.
Estimated achievement: 58% as planned
WP Leaders : Greg Crane, Jeff Rydberg-Cox, Stefan
Rueger, Dolores Iorizzo
Scheduled dates: 1 December 2003 Ð 29 February 2004
Actual dates: 30 March 2004
Deliverables: No deliverables due this period
á Integration continues to focus on programming integration, testing out data sharing routines, maintenance of the version control system, and implementation of code standards for plug-in modules for the core digital library system shared by CHLT partners. The most active integration work this period has been between WP1 - WP2 (getting the Greek Word Tool to run with the Visualization Tool) and WP1 - WP4 (getting the Old Norse to work with the Visualization Tool), and getting the efforts of WP1-WP2-WP3-WP4 to work within the PERSEUS/CHLT Digital Library System.
Estimated achievement: 58 % as planned
WP Leaders : Greg Crane, Jeff Rydberg-Cox, Dolores
Iorizzo
Scheduled dates: 1 December 2003 Ð 29 February 2004
Actual dates: 30 March 2004
Deliverables: No deliverable due this period
á Now that we have results from the project we have been able to take to the road and give demonstrations to other research groups. We have made an effort to disseminate our work not only via the www.CHLT.org website, but to seek out opportunities to give presentations internationally at the Mellon Foundation in New York, the NSF and Library of Congress in Washington DC, and at Cornell University, Ithaca New York. We also hosted a conference at Imperial College London on 'Knowledge Sharing and the Semantic Web for Cultural Heritage Projects' which attracted a good number of people across the Library, Archive, Museum and Computer Science communities; this has opened up a number of new directions for the future of CHLT.
Estimated achievement: 58 %, as planned
WP1: Use of MG wrapper using JAVA API to run the mgbuild shell
Script; and use of CK wrapper with
JAVA and C for extraction and
indexing of candidate key words.
WP2: Refinement
of JAVA API and indexing format for generic text display.
WP3: Continued
use of SOAP, XML-RPC and OAI-derived metadata.
WP4: Refinement of JAVA
API web interface for the Old Norse
Morphological Analyser for integration with WP1 Visualisation Tool.
WP5: Refinement of LE
Codes for Word Segmentation System.
WP6: Further
DTDÕs implemented for text elements and attributes.
á
Presentation of CHLT ÔWork in ProgressÕ to the Mellon Foundation, NYC.
10
Ð 11 December, 2003.
á Presentation of CHLT ÔWork in ProgressÕ at Cornell University, Ithaca, NY
15-16 December, 2003.
á Conference on 'Knowledge Sharing and the Semantic Web for Cultural
Heritage Projects', Imperial College London, 11 February, 2004.
WP1: Set
objective was to finalise development and design of interface for the Document
Cluster and Visualisation Tool: D 2.1 delivered on schedule. Work progresses ahead of schedule. No
deviations.
WP2: Set
objective was to proceed with development of multi-lingual
information tool. Work progresses on schedule. No
deviations.
WP3: Set
objectives were to focus on building robust architecture for
Digital Library Sysem. Work progresses on or ahead of
schedule.
No deviations.
WP4: Set
objectives were to continue to refine Old Norse Morphological
Analyser and mark-up Old Norse texts
in diplomatic and normalised versions; as well as to regularly test the
morphological analyser with the marked-up text. We also were scheduled to continue work on integrating the
Old Norse Morphological Analyzer with the WP1 Visualisation Tool. Work
progresses on schedule. No
deviations.
WP5: Set
objective was to deliver Word Segmentation System for the Latin
morphological analyser. Work
progresses on schedule. No deviations.
WP6: Set
objective was to mark-up Latin texts in TEI-conformant XML and
commit them to CVS repository. Work
progresses on schedule.
No deviations.
WP7: Set
objectives were to co-ordinate the efforts of US and EC partners to
ensure full collaboration and completion of the technical work the
workpackages. No deviations
WP8: Set
objectives were to integrate tools and applications developed by the
workpackages; maintain the version
control system for the core digital
library, and to maintain code
standards among all CHLT partners.
Work progresses on schedule. No
deviations.
WP9: Set
objectives were to further develop CHLT website as a vehicle of dissemination;
publish papers on the results of CHLT workpackages; and give papers at
international conferences on the work of CHLT. Objectives are on schedule. No deviations.
Issues description Here describe issues or problems that might affect
achievements, delay activities, deliverables or milestones |
Action items Corrective action envisaged by the project to
overcome the issue. This include the expected impact in terms of delays,
quality and quantity of work. |
None |
None |
Del. no. |
Deliverable name |
WP no. |
Lead participant |
Del. Type |
Security* |
Delivery (proj. month) |
Status |
|
D 1.1 |
Prototype Document Cluster Visualization Tool |
1 |
ICSTM |
IT |
PUB |
10 |
Delivered |
|
D 1.2 |
Document Cluster and Visualization Tool |
1 |
ICSTM |
IT |
PUB |
20 |
Delivered |
|
D 1.3 |
Report on Cluster Visualization Tool |
1 |
ICSTM |
IT |
PUB |
24 |
|
|
D 2.1 |
Word Profile Tool |
2 |
UMKC/CAM |
IT |
PUB |
12 |
Delivered |
|
D 2.2 |
Facilities to Accept and Preserve Feedback |
2 |
UMKC/CAM |
IT |
PUB |
12 |
Delivered |
|
D 2.3 |
Tool to Extract Corpus Based Thesauri from Corpus |
2 |
UMKC/CAM |
IT |
PUB |
16 |
Delivered |
|
D 2.4 |
Multi-lingual Information Retrieval Tool |
2 |
UMKC/CAM |
IT |
PUB |
24 |
|
|
D 2.5 |
Syntactic Parsing Toolbox |
2 |
UMKC/CAM |
IT |
PUB |
35 |
|
|
D 3.1 |
Text Processing System |
3 |
PERSEUS |
IT |
PUB |
1 |
Delivered |
|
D 3.2 |
General Data Provider Routine for Metadata Sharing |
3 |
PERSEUS |
IT |
PUB |
10 |
Delivered |
|
D 3.3 |
Metadata Harvester |
3 |
PERSEUS |
IT |
PUB |
12 |
Delivered |
|
D 3.4 |
Report on Naming Conventions |
3 |
PERSEUS |
IT |
PUB |
18 |
Delivered |
|
D 3.5 |
Maintenance Procedures for Naming Conventions |
3 |
PERSEUS |
IT |
PUB |
18 |
Delivered |
|
D 3.6 |
Prototype Metadata Sharing System Between Two Libraries |
3 |
PERSEUS |
Proto |
PUB |
24 |
|
|
D 3.7 |
Use of Metadata Sharing System by CHLT Partners |
3 |
PERSEUS |
IT |
PUB |
35 |
|
|
D 4.1 |
Report on Old Norse Morphological Analyzer |
4 |
UCLA/ KU |
IT/ Report |
PUB |
12 |
Delivered |
|
D 4.2 |
Report on Electronic Editions of Old Norse Texts |
4 |
UCLA/ KU |
IT/ Report |
PUB |
12 |
Delivered |
|
D 4.3 |
Report on Digitization of Old Norse MSS |
4 |
KU |
Report |
PUB |
24 |
|
|
D 4.4 |
Images of Old Norse MS Linked to Tagged Text |
4 |
UCLA/KU |
IT |
PUB |
30 |
|
|
D 4.5 |
Prototype Reading Environment for Old Norse Texts |
4 |
UCLA/KU |
Proto |
PUB |
30 |
|
|
D 4.6 |
Complete Digitization of Old Norse MSS linked to Tagged Text |
4 |
UCLA/KU |
IT |
PUB |
35 |
|
|
D 4.7 |
Integration of Old Norse Texts with Morphological Analyzer in Integrated Reading Environment |
4 |
UCLA/KU |
IT |
PUB |
|
|
|
D 5.1 |
Report on Neo-Latin Morphological Analyzer |
5 |
ILC- PISA |
IT/ Report |
PUB |
10 |
Delivered |
|
D 5.2 |
Word Segmentation System for Latin Analyzer |
5 |
ILC-PISA |
IT |
PUB |
20 |
Delivered |
|
D 5.3 |
Lemmatization Module for Early Modern Latin |
5 |
ILC-PISA |
IT |
PUB |
30 |
|
|
D 6.1 |
Tagged Early Modern Texts Integrated with Latin Morphological Analyzer in Integrated Reading Environment |
6 |
PERSEUS/UK/ICSTM |
IT |
PUB |
35 |
|
|
D 6.2 |
Integration of Early Modern Texts from Partners in Reading Environment |
6 |
PERSEUS/UK/ICSTM |
IT |
PUB |
35 |
|
|
D 7.1 |
Report on Kick-off Meeting And Consortium Agreement |
7 |
ICSTM/UMKC |
Report |
PUB |
4 |
Delivered |
|
D 7.2 |
Bi-Monthly Reports (Revised to Quarterly Reports, June 2003) |
7 |
ICSTM/UMKC |
Report |
PUB |
2,4,8,10, 15, 21 |
Delivered |
|
D 7.3 |
Bi-Yearly Periodic Progress Reports |
7 |
ICSTM/ UMKC |
Report |
PUB |
6, 12, 18 |
Delivered |
|
D 7.4 |
Bi-Yearly Financial Cost Statement |
7 |
ICSTM |
Report |
PUB |
6, 12, 18 |
Delivered |
|
D 7.5 |
Yearly Consortium Report |
7 |
ICSTM/ UMKC |
Report |
PUB |
12 |
Delivered |
|
D 8.1 |
Bi- Yearly Integration Meeting Report |
8 |
ICSTM/ UMKC PERS |
Report |
PUB |
6, 12, 18 |
Delivered |
|
D 8.2 |
Version Control System for Core Digital Library |
8 |
ICSTM/ UMKC PERS |
Report |
PUB |
6, 12, 18 |
Delivered |
|
D 8.3 |
Indexing and Input/Output Formats for Integration and Interoperablity |
8 |
ICSTM/ UMKC PERS |
Report |
PUB |
6, 12, 18 |
Delivered |
|
D 9.1 |
CHLT Website |
9 |
ICSTM/ UMKC |
WEB |
PUB |
3, 6, 12, 18 |
Delivered |
|
D 9.2 |
Dissemination and Use Strategy |
9 |
ICSTM/ UMKC |
Report |
PUB |
6 |
Delivered |
|
D 9.3 |
Dissemination and Exploitation Reports |
9 |
ICSTM/ UMKC |
Report |
PUB |
12 |
Delivered |
|
*Int. Internal
circulation within project (and Commission Project Officer if requested)
Rest. Restricted circulation list (BBC as
External User) and Commission PO only
IST Circulation within IST
Programme participants
FP5 Circulation within Framework
Programme participants
Pub. Public document
WP1: The main concern will be
seamless integration into the existing visualisation of the old Norse and
Ancient Greek letter rendering modules.
WP2: Continue work to
refine the extraction of translation equivalents based
on
Chi2 scores; integration of this tool with the visualisation tool
developed by WP1, and preparation of the Word
Tool for final release.
WP3: Continue to
develop Digital Library System to provide metadata
sharing and integration between two
digital libraries.
WP4: Continue
to expand Old Norse database; continue with refinement and integration of WP1
Visualisation Tool for Old Norse into the PERSEUS/CHLT Digital Library System.
WP5: Implementation of
LE management algorithms; software testing and
validation; software document.
WP6: XML
mark-up of 100-150 pages of new text per month.
WP7: Maintain
timely reporting schedule and ensure workpackage
developments and targets proceed
according to schedule.
WP8: Maintain
integration of (i) code standards, (ii) data sharing
routines and (iii) metadata
harvester for all CHLT workpackages, and integrate results into the Digital
Library System.
WP9: Continue
to implement web-based dissemination of CHLT; publish
papers on the results of CHLT
workpackages; attend conferences relevant to the work of CHLT; and organise
conferences, meetings,
seminars for dissemination of
results of CHLT.
CHLT has been offered technical support by
experts in digital library technology, and we wish to acknowledge their
continued interest in and practical support of the aims of CHLT.
Dr. Carl Lagoze, Cornell University
Dr. Brian Fuchs, Senior Programmer, Archimedes
Project, Max Planck, Berlin
Julia Flanders, Director, Scholarly Technology
Group, Brown University
Professor Susan Hockey, University College
London.
Dr Peter Walters, UKISHELP
Dr Hamish Cunnigham, University of Sheffield
Mr Michael Hawkins, Imperial College London
Martin Doerr, Heraklion, Forth, Crete
No changes.