4 February: fyi -- lexicology, NL
Index of February 2009 | Index of year: 2009 | Full index
computational linguist for the development of Named Entity Processing tools
as part of the IMPACT project.
IMPACT, started in January 2008, is a European research project in the
field of informatics for the humanities, in which 15 national libraries and
research institutes from Europe, Israel and Russia are working together.
The main objective of IMPACT is to significantly improve access to
historical text. To achieve this, the following tasks will be dealt with:
1. Current OCR software is not suitable for mass digitization of historical
documents. Within the project, OCR software will be developed that
significantly improves the accuracy of state-of-the-art systems, enabling
for the first time reliable full-text mass digitization of historical
documents.
2. Information in historical documents is not easily accessed by modern
users because of the historical language barrier. Within the project,
historical lexica and linguistic processing tools will be developed to
enable enriched indexing, thus providing access to historical material with
contemporary query techniques.
In order to be effective, the lexica will also have to contain Named Entity
data, and tools will have to be developed for NE recognition and NE
classification for historical language material.
Tasks:
The computational linguist will be responsible for the development of a
toolbox for NE lexicon building and NE lexicon deployment to tackle
historical language material to be used for the improvement of OCR of
historical texts and for better retrieval on historical text material. The
work will imply the implementation as well as the design of relevant
algorithms.
Profile:
- relevant background in computational linguistics, computer science or
applied mathematics (master level, preferably Ph.D. level)
- sufficient knowledge of and experience with the development and
implementation of NLP algorithms, preferably in the field of NE processing
- sufficient experience in developing complex software systems; preferably
proficiency in C, C++ and/or Java
- knowledge of Dutch is required, preferably also knowledge of historical
Dutch language material
Offer:
An INL contract for two years. The salary is dependent on relevant
experience and skills, with a maximum of ? 4326, - gross per month (scale
11) on a full-time basis. In addition you will be entitled to 42 days
holiday per year plus holiday pay, according to the cao-Onderzoekinstellingen.
Application Deadline: 22-Feb-2009
Mailing Address for Applications:
Dr. Jeannine Beeken
Postbus 9515
Leiden 2300 RA
Netherlands
Email Address for Applications: secretariaat@inl.nl
Contact Information:
Katrien Depuydt
Email: katrien.depuydt@inl.nl
Phone: 0031715272479
Fax: 0031715272115
Index of February 2009 | Index of year: 2009 | Full index