ENEOLI Wikibase
ENEOLI Wikibase
ENEOLI Wikibase is a Wikibase instance hosted on Wikibase Cloud. It contains bibliographical and terminological data put together and used in the European network on Lexical innovation, a COST research network funded by the EU (ENEOLI, CA 22126, 2024-2027).
Entry in Wikibase World
The entity https://wikibase.world/entity/Q420 describes ENEOLI Wikibase.
Background
As stated on the ENEOLI website, "the European Network on Lexical Innovation connects researchers, educators, students, translators, journalists, language policymakers, and other interested parties in the field of neology. The network aims to refine terminology within the field, showcase advanced methodologies, perform comparative studies on lexical innovation across languages, and provide targeted training for various stakeholders."
Main Functions
ENEOLI Wikibase is a collaborative workbench for the network members, who collaboratively edit entities of four types, which, through the ENEOLI Wikibase Ontology, are related to each other: Conceptual entries, lexical entries, bibliographical records, and researchers. At the same time, it provides access to the project results, through SPARQL queries provided on different pages.
Content
ENEOLI Wikibase content can be classified as follows:
NeoCorpus bibliographical records
Research articles from the field of Neology are collaboratively collected and provided with structured publication metadata using a Zotero group. When transferring the publication metadata to Wikibase, they are processed in the following way (this is done using the ZotWb tool):
- Author and editor name literals are linked to entities describing persons. The main approach here is to reconcile author name literals against Wikidata entities describing persons; matching Wikidata entities are cloned on ENEOLI Wikibase, unmatched ENEOLI person entities are candidates for transfer to Wikidata.
- The article language is expressed through an entity describing the language.
- Article DOI are used to check if an article is already described on Wikidata.
- Journal ISSN are looked up on Wikidata.
NeoVoc concept and lexical entries
The research literature on Neology in French builds on a comparatively long and rich tradition. Monolexical and polylexical terms used in these articles are extracted, represented on ENEOLI Wikibase as concept entries, and provided with a short definition. ENEOLI members provide equivalent terms and short definitions in their working languages, adding these to the concept entries, for which equivalents and short definitions are drafted re-using labels and descriptions found on Wikidata for matching concepts. The following step is the creation of lexical entries for all validated term equivalents in all working languages.
Article term indexation
The full texts belonging to NeoCorpus bibliographical records are enriched with NeoVoc term occurrence information. The workflow can be described as follows:
- The article text body is isolated, either using the TEI XML representation of the full text obtained using GROBID, or manually.
- The text body is then lemmatized using SpaCy (for those languages that have lemmatization modules in SpaCy).
- NeoVoc terms of the article language are found in the text, and the information about term mentions is attached to the article metadata (example).
This allows advanced queries, e.g. for histories of term usage, terms used by authors, terms used together in the same text, etc.
NeoVoc lexical innovation process descriptors
NeoVoc contains concepts describing lexical innovation processes (e.g. blending, and concepts describing results of these processes (e.g. blend).
Lexical entries describing neologisms
Neologisms are collected and described as lexical entry. Part of the description is a link to the corresponding neologism type.