Snapquery EKAW 2024 paper

Introduction

★★★★★ Parameterized queries
★★★☆☆ https://web.archive.org/web/20150512231123/http://answers.semanticweb.com:80/questions/12147/whats-the-best-way-to-parameterize-sparql-queries
★★★☆☆ https://jena.apache.org/documentation/query/parameterized-sparql-strings.html
★★★★☆ Scholia Jinja templates
★★★★☆ Technical debt and accidental complexity
★★★☆☆ How to deal with aspects that do not (usually) influence the execution of a SPARQL query, like whitespace, comments, capitalization and variable names?

★★★★☆ Wikidata example queries
★★★★★ Scholia and Wikidata graph split
★★★☆☆ Other knowledge graphs, e.g., DBLP, OpenStreetMap
★★☆☆☆ Perhaps also some NFDI examples or some custom knowledge graphs like FAIRJupyter
★★★★★ Quality criteria https://github.com/WolfgangFahl/snapquery/issues/26
★★★★☆ List of standard refactoring activities and the support by this approach
★★★★☆ Getting your own copy of Wikidata; the infrastructure effort needs to be mentioned
★★★☆☆ Usability evaluation https://www.nngroup.com/articles/why-you-only-need-to-test-with-5-users/
★★★★☆ https://github.com/ad-freiburg/qlever/wiki/QLever-performance-evaluation-and-comparison-to-other-SPARQL-engines
★★★★☆ A closed issue should have at least one example that runs

★★★★★ Hypothesis by Stefan Decker: Query rot is more prominent in KG environments than with relational databases
★★★☆☆ Ambiguity of names

★★★☆☆ Link rot ★★★★☆ Information Hiding and Dependency Inversion Principles ★★★☆☆ Federated Queries ★★★☆☆ grlc ★★☆☆☆ querypulator

W3C test set - why did we not use that as an example

DOI: 10.5281/zenodo.4035223

Focuses on evaluating performance of graph pattern matching in SPARQL engines
Uses a subset of Wikidata as the dataset
Provides a large set of SPARQL basic graph patterns
Designed to test the benefits of worst-case optimal join algorithms
Exhibits a variety of increasingly complex join patterns
Allows for systematic testing of query optimization techniques
Offers insights into the performance characteristics of different SPARQL engines on complex graph patterns

a b Wolfgang Fahl; Tim Holzheim; Christoph Lange; Stefan Decker. (2023) "Semantification of CEUR-WS with Wikidata as a Target Knowledge Graph" . url: https://ceur-ws.org/Vol-3447/Text2KG_Paper_13.pdf
a b | Christoph Lange;Angelo Di Iorio. (2014) "Semantic Publishing Challenge – Assessing the Quality of Scientific Output" - 61-76 pages. doi: 10.1007/978-3-319-12024-9_8
^ Paul Warren;Paul Mulholland. (2020) "A Comparison of the Cognitive Difficulties Posed by SPARQL Query Constructs" - 3-19 pages. doi: 10.1007/978-3-030-61244-3_1at: EKAW 2022
^ Paul Warren;Paul Mulholland. (2018) "Using SPARQL – The Practitioners’ Viewpoint" - 485-500 pages. doi: 10.1007/978-3-030-03667-6_31
^ | Muhammad Saleem;Muhammad Intizar Ali;Aidan Hogan;Qaiser Mehmood;Axel-Cyrille Ngonga Ngomo. (2015) "LSQ: The Linked SPARQL Queries Dataset" - 261-269 pages. doi: 10.1007/978-3-319-25010-6_15
^ Johannes Lorey;Felix Naumann. (2013) "Detecting SPARQL Query Templates for Data Prefetching" - 124-139 pages. doi: 10.1007/978-3-642-38288-8_9
^ Angela Bonifati;Wim Martens;Thomas Timm. (2020) "An analytical study of large SPARQL query logs" - 655-679 pages. doi: 10.1007/s00778-019-00558-9