GlossaryEntry | responsible | state | since | description | references | lang | master |
---|
Acronym | | | | An acronym is a word or name formed from the initial components of a longer name or phrase. Acronyms are usually formed from the initial letters of words, as in NATO (the North Atlantic Treaty Organization) and EU (the European Union), but sometimes use syllables, as in Benelux (short for Belgium, the Netherlands, and Luxembourg). They can also be a mixture, as in radar (RAdio Detection And Ranging). |
| en | |
CERMINE | https://github.com/CeON/CERMINE | | | CERMINE is a Java library and a web service (cermine.ceon.pl) for extracting metadata and content from PDF files containing academic publications. CERMINE is written in Java |
| en | |
ChatGPT | | | 2022-11-30 | is an artificial intelligence chatbot developed by OpenAI and launched in November 2022. It is built on top of OpenAI's GPT-3.5 and GPT-4 families of large language models (LLMs) and has been fine-tuned (an approach to transfer learning) using both supervised and reinforcement learning techniques. | https://www.wikidata.org/wiki/Q115564437 | en | |
Entity Linking | | | | Entity Linking |
| en | |
FAIR | https://www.force11.org/ | | 2016 |
- Findable
- Accessible
- Interoperable
- Reusable
|
| en | FAIR |
GROBID | | | 2009 | GROBID (or Grobid, but not GroBid nor GroBiD) means GeneRation Of BIbliographic Data. |
| en | GROBID |
Link rot | wikipedia | draft | 2024-06-26 | Link rot (also called link death, link decay, link breaking, or reference rot) is the phenomenon of hyperlinks tending over time to cease to point to their originally targeted file, web page, or server due to that resource being relocated to a new address or becoming permanently unavailable. A link that no longer points to its target, often called a broken, dead, or orphaned link, is a specific form of dangling pointer. | https://en.wikipedia.org/wiki/Link_rot | en | |
NER | | | | Named entity recognition |
| en | NER |
PID | | | | A persistent identifier (PI or PID) is a long-lasting reference to a document, file, web page, or other object. |
https://en.wikipedia.org/wiki/Persistent_identifier
https://www.pidforum.org/t/persistent-identifier-pid-definition/1502/11 A PID is a digital identifier that is globally unique, persistent, machine resolvable, has an associated metadata schema, identifies an entity (e.g., individual researcher, publication, award, digital research output, organization) in perpetuity, and is frequently used to disambiguate between entities.
https://www.cms.hu-berlin.de/de/dl/dataman/teilen/pid/persistente-identifikation
https://www.project-freya.eu/en/blogs/blogs/towards-persistent-identification-of-conferences
Ackermann2018
Birukou2019
zenodo3653755
| en | PID |
QEC | | draft | | Query Execution Context | | en | |
Query Rot | | | | Queries might get invalid over time to changes in the QEC | | | |
RDFa | http://www.w3.org/TR/rdfa-primer/ | | 2004 | Rich Structured Data Markup for Web Documents |
| en | de |
Retrieval Augmented Generation | | | | Retrieval augmented generation (RAG) is a technique that grants generative artificial intelligence models information retrieval capabilities. It modifies interactions with a large language model (LLM) so that the model responds to user queries with reference to a specified set of documents, using this information to augment information drawn from its own vast, static training data. This allows LLMs to use domain-specific and/or updated information. Use cases include providing chatbot access to internal company data, or giving factual information only from an authoritative source. | https://en.wikipedia.org/wiki/Retrieval-augmented_generation | en | |
Semantic Web | | | 1999 | A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities | Berners-Lee 2001Berners-Lee 1994DBLP:journals/ws/KhaliliA13
| en | Semantic Web |
TEI | https://tei-c.org/about/mission/ | | 1987 | The mission of the Text Encoding Initiative is to develop and maintain a set of high-quality guidelines for the encoding of humanities texts, and to support their use by a wide community of projects, institutions, and individuals. |
https://tei-c.org/about/mission/
https://en.wikipedia.org/wiki/Text_Encoding_Initiative
| en | TEI |
Wikidata | https://www.wikidata.org/ | | 2012/10/29 | Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation.[2] It is a common source of open data that Wikimedia projects such as Wikipedia,[3][4] and anyone else, can use under the CC0 public domain license. Wikidata is a wiki powered by the software MediaWiki, and is also powered by the set of knowledge graph MediaWiki extensions known as Wikibase. |
| en | Wikidata |