Poetrylab tools and Poetry Ontology are the most remarkable achievements of Postdata project. Multiple activities have been carried out to achieve projct outcomes:
1. Analysis of the model of poetic repertories.
2. Analysis of a survey addressed to the final users of poetic resources in order to understand the data needs of the users of poetry databases.
3. Analysis of the graphical user interface on the Web of Documents of repertoires to retrieve the informational needs of specific poetic repertoires.
4. Analysis of poems from different traditions to create use cases applying the data model.
5. Identification of the properties of the data model that need to be defined with a controlled vocabulary.
6. Query of multiple databases looking for LOD vocabularies that could contain vocabulary terms that could be incorporated into the ontology.
7. Build of new approached to automated scansion, stanza detection and enjambment detection.
8. Train the model with a new developed corpus (PULPO). The resulting model, Alberti, was evaluated on the MLM’ metric aforementioned for English and Spanish.
As a result, we have developed three PoetryLab TOOLS:
- Rantanplan, on top of the industrial-strength NLP framework spaCy for speed.
- Stanza detection, a classical solution based on extracting the information needed and then composing a knowledge base curated by experts with the proper rules that identify the different stanza types.
- Postdata Jollyjumper, new tool that replaced ANJA, annotates enjambment and its type based on previous typologies. There are three broad categories (below) with some subcategories each:
1. Lexical enjambment.
2. Phrase-bounded enjambment.
3. Cross-clause enjambment.
We have obtained three intellectual properties based on them: Averell, Rantamplam and Poetry Lab API.
Regarding Postdata Poetry Ontology, it facilitates a set of concepts for describing poetic works (poems, poetic drama or plays written in verse and songs). It is the product of a homogenization effort that considers different literary traditions, periods, poetic genres, and authorship. Additionally, this will enable the comparison of the characteristics and data in this poetry and thus carry out invaluable research in Comparative Literature and Comparative Metrical Studies quantitatively. Two potential cases of use used to define:
-Bibliographic information and sources search and indexing:
- OntoPoetry Core module represents the abstract or conceptual side of the bibliographic information.
- OntoPoetry Transmission module represents the more tangible side of bibliographic information related to poetic works.
-Poetic information annotation and searching:
- OntoPoetry Poetic Analysis module, which represents different phenomena associated with metrics and prosody, including the textual elements or parts of a poem and the different metrical patterns that analyze those elements.
Finally, Postdata project has been actively working on different communication and dissemination activities during the project. Project members have been involved in the dissemination of project results though different activities as: >15 publications in scientific journals, 2 chapters in books, > 60 contributions in conference proceedings, organization of 14 workshops, participation in more than 15 workshops, etc. Furthermore, the Postdatda community has been very involved in different communication activities aimed at the general public, such as, radio interviews, publications at project website, publication of informative articles at general public magazines, etc.