Workdocumentation 2023-12-22

From BITPlan cr Wiki
Revision as of 07:47, 22 December 2023 by Wf (talk | contribs) (→‎Prompt)
Jump to navigation Jump to search

Event Signature Extraction task

Based on Event Signature definition in https://doi.org/10.5334/dsj-2022-011

Persistent Identification for Conferences
Data Science Journal
2022-04-05 | Journal article
DOI: 10.5334/dsj-2022-011
Contributors: Julian Franken; Aliaksandr Birukou; Kai Eckert; Wolfgang Fahl; Christian Hauschke; Christoph Lange

Prompt

I'd like to extract event signatures from a couple of hundred home page texts of scientific events - what would be a prompt for you to get the information in structured format such as yaml or json ? Here is the definition of the event signature: Acronym: A short name for the conference often consisting of 3 to 8 upper case
letters trying to be unique but actually often being ambiguous. For instance, ISWC
may refer to the International Semantic Web Conference or to the International
Symposium on Wearable Computing.
Frequency: Annual, biennial, triennial – most events have an annual frequency, and
this is mostly not stated explicitly (not stated explicitly in this example).
Event reach: Target reach of the conference such as ‘International’, ‘European’, ‘East
Asian’ (International).
Event type: Such as Conference, Workshop, Symposium (Conference).
Year: A two or four-digit reference to the year in which the event took place – not to
be confused with the year of publication of the proceedings, which might be different
(2019).
Ordinal: Often used to enumerate the conference series instances (18th).
Date: Start date and end date or date range of the conference (October 26–30).
Location: Description of the location of the conference often consisting of country,
region, and city – sometimes with details about the exact venue. (Auckland, New
Zealand).
Title: The title often contains scope, type, and subject of the conference (International
Semantic Web Conference).
Subject: Description of what the conference is about often prefixed with ‘on’
(Semantic Web).
Delimiters: A variety of syntactic delimiters such as blanks, commas, colons, brackets
are used depending on the citation style

and an example of a homepage text:
QUATIC'1994
    1st International Conference on the Quality of Information and Communications Technology
    Hotel Penta
    27 May, 1994
    Lisbon, Portugal
    Toggle navigation
    PROCEEDINGS |
    DBLP |
    GOOGLE CITATIONS |
    QUATIC CONFERENCE SERIES
    QUATIC'1994
    Committees
    Program
    Conference Venue
    Sponsors
    QUATIC Conference Series

Answer

To extract event signatures from home page texts of scientific events in a structured format like YAML or JSON, you need to prompt the AI to identify and structure specific pieces of information based on the definition of the event signature you provided. The prompt should guide the AI to look for acronyms, frequencies, event reach, types, years, ordinals, dates, locations, titles, and subjects in the text. Here's how you could formulate such a prompt:

I have texts from home pages of scientific events. I need to extract structured information about each event in a clear, concise format. Here are the details I need for each event signature:

- Acronym: The short name of the conference, often in uppercase.
- Frequency: How often the event occurs, like annual or biennial.
- Event reach: The geographical or demographic reach of the event, like International or European.
- Event type: The format of the event, such as Conference, Workshop, or Symposium.
- Year: The year in which the event takes place.
- Ordinal: The instance number of the event, like 18th or 1st.
- Date: The start and end date or date range of the event.
- Location: The country, region, and city of the event, and sometimes specific venue details.
- Title: The full title of the event, often indicating the scope and subject.
- Subject: The main topic or focus of the event.

Given a homepage text of an event, please identify and extract this information in a structured format (either YAML or JSON). Here's an example of a homepage text:

"QUATIC'1994
1st International Conference on the Quality of Information and Communications Technology
Hotel Penta
27 May, 1994
Lisbon, Portugal
Toggle navigation
PROCEEDINGS |
DBLP |
GOOGLE CITATIONS |
QUATIC CONFERENCE SERIES
QUATIC'1994
Committees
Program
Conference Venue
Sponsors
QUATIC Conference Series"

Based on this text, please extract the event signature information and present it in a structured format.

Example Signature

Based on https://ceur-ws.org/Vol-2531/ pointing to https://ase.in.tum.de/seuh20/

Prompt

I'd like to know which of the following event signature elements:
- Acronym: The short name of the conference, often in uppercase.
- Frequency: How often the event occurs, like annual or biennial.
- Event reach: The geographical or demographic reach of the event, like International or European.
- Event type: The format of the event, such as Conference, Workshop, or Symposium.
- Year: The year in which the event takes place.
- Ordinal: The instance number of the event, like 18th or 1st.
- Date: The start and end date or date range of the event.
- Location: The country, region, and city of the event, and sometimes specific venue details.
- Title: The full title of the event, often indicating the scope and subject.
- Subject: The main topic or focus of the event.

are available in the following homepage text:
  #seuh20
    Software Engineering im Unterricht der Hochschulen
    Skip to content
    Start
    Programm
    Keynote
    Registrierung
    Call for Paper
    Organisation
    Mailingliste
    SE 20 Tagung
    SEUH Geschichte
    Tagungsband
    Software Engineering im Unterricht der Hochschulen
    Die SEUH ist seit vielen Jahren das Forum im deutschsprachigen Raum, auf dem Lehrende aus Universitäten, Hochschulen für angewandte Wissenschaften sowie dualen Hochschulen ihre Erfolge, Misserfolge und Erfahrungen in der Software Engineering Ausbildung vorstellen, diskutieren und gemeinsam die Qualität der Lehre verbessern. Der Workshop bietet viel Raum für Diskussionen, sowohl während der Sitzungen als auch in den Pausen. Kurze Vorträge mit anschließenden Diskussionsphasen fördern den Austausch der Teilnehmer. Dies haben die Teilnehmer in der Vergangenheit ausgiebig in lebhaften und konstruktiven Gesprächen genutzt. Viele Lehrende haben von der SEUH entscheidende Impulse für ihre Arbeit erhalten.
    Der Workshop findet zusammen mit der Tagung Software Engineering (SE) der Gesellschaft für Informatik (GI) am 26. und 27. Februar 2020 in Innsbruck, Österreich statt. Tagungsort ist das Hotel Grauer Bär in der Innsbrucker Innenstadt.
    Wir suchen Originalbeiträge, die sich konzeptionell oder empirisch mit der Ausbildung im Software Engineering befassen. Im Fokus stehen neue Erkenntnisse und Erfahrungen in der Vermittlung von Entwicklungsmethoden. Werkzeuge zur Verbesserung der Lehre sind ebenso wollkommen, genauso wie neue Ideen zur Lösung wohlbekannter Probleme in der Ausbildung.
    Weitere Details sind im Call for Paper zu finden. Einreichungsfrist ist der 04.11.2019 (siehe auch wichtige Termine rechts).
    Kontakt
    Stephan Krusche, Technische Universität Münchenkrusche@in.tum.de
    Stefan Wagner, Universität Stuttgartstefan.wagner@iste.uni-stuttgart.de
    Wichtige Termine
    Einreichungsfrist
    04.11.2019 (verlängert)
    Benachrichtigung
    08.12.2019
    Finale Version
    08.01.2020
    SEUH Workshop
    26.-27.02.2020
    Einreichung
    EasyChair
    Follow @seuh20
    Tweets by seuh20
    #seuh20
    Proudly powered by WordPress.

Prompt

provide the event signature elements:
- Acronym: The short name of the conference, often in uppercase.
- Frequency: How often the event occurs, like annual or biennial.
- Event reach: The geographical or demographic reach of the event, like International or European.
- Event type: The format of the event, such as Conference, Workshop, or Symposium.
- Year: The year in which the event takes place.
- Ordinal: The instance number of the event, like 18th or 1st.
- Date: The start and end date or date range of the event.
- Location: The country, region, and city of the event, and sometimes specific venue details.
- Title: The full title of the event, often indicating the scope and subject.
- Subject: The main topic or focus of the event.

in YAML Format.
use lowercase/underscore for the element names and leave out elements hat are not found. Use ISO date format for dates. Give the location as country/region and city using iso codes
by extracting from the following homepage text:

Answer

Based on the provided homepage text, here's the information available for the event signature elements you mentioned:

    Acronym: "SEUH" (found in the text as "#seuh20").
    Frequency: The frequency of the event is not explicitly mentioned in the text.
    Event reach: The geographical or demographic reach of the event is not explicitly mentioned in the text.
    Event type: The event type is mentioned as a "Workshop" and "Tagung" (Conference) in the text.
    Year: The year of the event is mentioned as "26.-27.02.2020" (February 26-27, 2020).
    Ordinal: The instance number of the event is not explicitly mentioned in the text.
    Date: The start and end date of the event are mentioned as "26.-27.02.2020" (February 26-27, 2020).
    Location: The location of the event is mentioned as "Innsbruck, Österreich" (Innsbruck, Austria) at the "Hotel Grauer Bär in der Innsbrucker Innenstadt."
    Title: The full title of the event is "Software Engineering im Unterricht der Hochschulen."
    Subject: The main topic or focus of the event is software engineering education and the exchange of experiences among educators in the German-speaking region.

Please note that some elements like "Frequency," "Event reach," and "Ordinal" are not explicitly mentioned in the provided text, so they may not be applicable to this specific event.