ENEOLI Wikibase

From DHWiki

ENEOLI Wikibase

ENEOLI Wikibase is a Wikibase instance hosted on Wikibase Cloud. It contains bibliographical and terminological data put together and used in the European network on Lexical innovation, a COST research network funded by the EU (ENEOLI, CA 22126, 2024-2027).

Entry in Wikibase World

The entity https://wikibase.world/entity/Q420 describes ENEOLI Wikibase.

Background

As stated on the ENEOLI website, "the European Network on Lexical Innovation connects researchers, educators, students, translators, journalists, language policymakers, and other interested parties in the field of neology. The network aims to refine terminology within the field, showcase advanced methodologies, perform comparative studies on lexical innovation across languages, and provide targeted training for various stakeholders."

Main Functions

ENEOLI Wikibase is a collaborative workbench for the network members, who collaboratively edit entities of four types, which, through the ENEOLI Wikibase Ontology, are related to each other: Conceptual entries, lexical entries, bibliographical records, and researchers. At the same time, it provides access to the project results, through SPARQL queries provided on different pages.

Content

ENEOLI Wikibase content can be classified as follows:

NeoCorpus bibliographical records

Research articles from the field of Neology are collaboratively collected and provided with structured publication metadata using a Zotero group. When transferring the publication metadata to Wikibase, they are processed in the following way (this is done using the ZotWb tool):

  • Author and editor name literals are linked to entities describing persons. The main approach here is to reconcile author name literals against Wikidata entities describing persons; matching Wikidata entities are cloned on ENEOLI Wikibase, unmatched ENEOLI person entities are candidates for transfer to Wikidata.
  • The article language is expressed through an entity describing the language.
  • Article DOI are used to check if an article is already described on Wikidata.
  • Journal ISSN are looked up on Wikidata.

NeoVoc concept and lexical entries

The research literature on Neology in French builds on a comparatively long and rich tradition. Monolexical and polylexical terms used in these articles are extracted, represented on ENEOLI Wikibase as concept entries, and provided with a short definition. ENEOLI members provide equivalent terms and short definitions in their working languages, adding these to the concept entries, for which equivalents and short definitions are drafted re-using labels and descriptions found on Wikidata for matching concepts. The following step is the creation of lexical entries for all validated term equivalents in all working languages.

Article term indexation

The full texts belonging to NeoCorpus bibliographical records are enriched with NeoVoc term occurrence information. The workflow can be described as follows:

  • The article text body is isolated, either using the TEI XML representation of the full text obtained using GROBID, or manually.
  • The text body is then lemmatized using SpaCy (for those languages that have lemmatization modules in SpaCy).
  • NeoVoc terms of the article language are found in the text, and the information about term mentions is attached to the article metadata (example).

This allows advanced queries, e.g. for histories of term usage, terms used by authors, terms used together in the same text, etc.

NeoVoc lexical innovation process descriptors

NeoVoc contains concepts describing lexical innovation processes (e.g. blending, and concepts describing results of these processes (e.g. blend).

Lexical entries describing neologisms

Neologisms are collected and described as lexical entry. Part of the description is a link to the corresponding neologism type.