Acronym paper: Difference between revisions
Jump to navigation
Jump to search
| Line 17: | Line 17: | ||
{{Link|target=AcronymHistograms}} | {{Link|target=AcronymHistograms}} | ||
==== WikiCFP ==== | ==== WikiCFP ==== | ||
===== Standard case ==== | ===== Standard case ===== | ||
60% of all WikiCFP acronyms extracted are matching the regular expression | 60% of all WikiCFP acronyms extracted are matching the regular expression | ||
<pre>[A-Z]+\s*[12][0-9]{3}</pre> e.g. ISWC 2012 | <pre>[A-Z]+\s*[12][0-9]{3}</pre> e.g. ISWC 2012 | ||
| Line 24: | Line 24: | ||
654/43989 ( 1.5%) year different | 654/43989 ( 1.5%) year different | ||
</pre> | </pre> | ||
===== Corner cases ===== | |||
long acronyms tend to indicate the extraction has not worked or there | |||
is some other issue with the acronym such as indicating a joint / colocated situation | |||
<source lang='sql'> | <source lang='sql'> | ||
SELECT acronym | SELECT acronym | ||
| Line 42: | Line 44: | ||
... | ... | ||
</pre> | </pre> | ||
===== Exotic cases / Outliers ===== | |||
<source lang='sql'> | |||
SELECT acronym,url | |||
FROM "event_wikicfp" | |||
where length(acronym)>50 | |||
</source> | |||
call for chapters - images of female aggression 2016 http://www.wikicfp.com/cfp/servlet/event.showcfp?eventid=52302 | |||
Revision as of 15:23, 1 March 2023
- Acronym definition see Acronym
Research questions
- What do acronyms for scientific events and event series look like and how formal can they be described?
- How well do acronyms disambiguate scientific events and event series?
- How well is the acronym information curated in metadata sources for events and event series
- How well are acronyms used in citations of scientific events and event series?
- Acronym checker - does the Acronym fit the long version ...
Method
What do acronyms for scientific events and event series look like and how formal can they be described?
- Try regular expressions see Acronym_-_Regular_Expressions
- Check length histograms see https://github.com/WolfgangFahl/ConferenceCorpus/blob/main/tests/testAcronymCategory.py
Results
What do acronyms look like
Length distribution
WikiCFP
Standard case
60% of all WikiCFP acronyms extracted are matching the regular expression
[A-Z]+\s*[12][0-9]{3}e.g. ISWC 2012
43990/73731 ( 59.7%) matches for [A-Z]+\s*[12][0-9]{3}
654/43989 ( 1.5%) year different
Corner cases
long acronyms tend to indicate the extraction has not worked or there is some other issue with the acronym such as indicating a joint / colocated situation
SELECT acronym
FROM "event_wikicfp"
where length(acronym)=40
The acroynm entries with a length of 40 are mostly not acronyms ...
... Political Theology Agenda Symposium 2010 Knowledge Engineering Special Issue 2010 CFP MapReduce Special Issue of CCPE 2010 AOSD - Student Research Competition 2011 special session for Wireless VITAE 2011 Political Theology Agenda Symposium 2011 12th EANN / 7th AIAI Joint Congress 2011 ...
Exotic cases / Outliers
SELECT acronym,url
FROM "event_wikicfp"
where length(acronym)>50
call for chapters - images of female aggression 2016 http://www.wikicfp.com/cfp/servlet/event.showcfp?eventid=52302