Property:Task objective
Jump to navigation
Jump to search
Property | |
---|---|
name | objective |
label | objective |
type | Special:Types/Text |
index | 3 |
sortPos | |
primary key | False |
mandatory | False |
namespace | |
size | 400 |
uploadable | False |
default | |
inputType | textarea |
allowedValues | |
documentation | the objective of the task |
values from | |
showInGrid | False |
isLink | False |
allow nulls? | |
topic | Concept:Task |
This is a Property with type Special:Types/Text
T
The metadata of the CEUR-WS related entities as depicted in the figure should be made available as triples in a triplestore.
http://diagrams.bitplan.com/render/png/0x1a095a5a.png
The progress is tracked per Entity/Attribute with [https://docs.google.com/spreadsheets/d/1rDyzmiphqrnwugid9-y4B6nRUUBjxusHXvmxzkEeGKg/edit#gid=0 a spreadsheet] and with a [https://github.com/users/WolfgangFahl/projects/2 github project] +
Common questions related to the quality of a scientific workshop or conference include whether a researcher should submit a paper to it or accept an invitation
to its program committee, whether a publisher should publish its proceedings,
or whether a company should sponsor it [2]. Moreover, knowing the quality of
an event helps to assess the quality of the papers accepted there. In the 2014
Challenge, we had designed Task 1 to extract from selected CEUR-WS.org work-
shop proceedings volumes RDF that would enable the computation of certain
indicators for the workshops’ quality [10]. The second objective of this effort was
to bootstrap the publication of all CEUR-WS.org workshops – more than 1,400
at the time of this writing – as linked data. As discussed above in Section 2,
we reused the 2014 queries, with two exceptions. As only one of the three 2014
submissions had addressed the two Task 1 queries that required metadata ex-
traction from the PDF full text of the papers (cf. [7]), and as Task 2 focused
on full-text extraction anyway, we replaced these queries (Q1.19 and Q1.20) by
similar queries that only relied on information available from HTML sources. +
Task 2 was designed to test the ability to extract data from the full text of the
papers. It follows last year’s Task 2, which focused on extracting information
about citations. The rationale was that the network of citations of a paper –
including papers citing it or cited by that paper – is an important dimension to
assess its relevance and to contextualise it within a research area.
This year we included further contextual information. Scientific papers are
not isolated units. Factors that directly or indirectly contribute to the origin
and development of a paper include citations, the institutions the authors are
affiliated to, funding agencies, and the venue where a paper was presented. Participants
had to make such information explicit and exploit it to answer queries
providing a deeper understanding of the context in which papers were written.
The dataset’s format is another difference from 2014. Instead of XML sources,
we used PDF this year, taken from CEUR-WS.org. PDF is still the predominant
format for publishing scientific papers, despite being designed for printing. The
internal structure of a PDF paper does not correspond to the logical structure
of its content, rather to a sequence of layouting and formatting commands.
The challenge for participants was to recover the logical structure, to extract
contextual information, and to represent it as semantic assertions. +
Task 3 was newly designed to assess the ability to identify same entities acrossdifferent datasets of the same domain, thus establishing links between these
datasets. Participants had to make such links explicit and exploit them to answer comprehensive queries about events and persons. The CEUR-WS.org data
in itself provide incomplete information about conferences and persons. This
information can be complemented by interlinking the dataset with others to
broaden the context and to allow for more reliable conclusions about the quality
of scientific events and the qualification of researchers. +