GROBID

From BITPlan cr Wiki
Jump to navigation Jump to search
GlossaryEntry
responsible  
state  
since  2009
description  GROBID (or Grobid, but not GroBid nor GroBiD) means GeneRation Of BIbliographic Data.
references  
lang  en
master  GROBID

WhatLinks Here

Links

Installation

git clone https://github.com/kermitt2/grobid.git
./gradlew clean install test
# ...
BUILD SUCCESSFUL in 8m 55s

Running service

./gradlew run

Apache Configuration

a2ensite grobid
systemctl reload apache2

With domain Names

see https://grobid.bitplan.com

#
# grobid.bitplan.com Server configuration
#  2020-07-04 see https://grobid.readthedocs.io/en/latest/Grobid-service/ 
#  
# see  http://stackoverflow.com/a/13089668/1497139
<VirtualHost *:80> 
  ServerAlias www.grobid.bitplan.com
  ProxyPreserveHost On
  ProxyRequests Off
  ServerName grobid.bitplan.com 
  ProxyPass / http://localhost:8070/
  ProxyPassReverse / http://localhost:8070/
  Header add "Access-Control-Allow-Origin" "*"
</VirtualHost>
<VirtualHost *:443> 
  ServerAlias secure.grobid.bitplan.com
  ProxyPreserveHost On
  ProxyRequests Off
  ServerName grobid.bitplan.com 
  ProxyPass / http://localhost:8070/
  ProxyPassReverse / http://localhost:8070/
  Header add "Access-Control-Allow-Origin" "*"
  Include ssl.conf
</VirtualHost>


Install as service with Ansible playbook

https://github.com/TIBHannover/confiDent-Ansible-VMs/tree/master/playbooks/grobid


Service file

/etc/systemd/system/grobid.service

[Unit]
Description=Grobid
After=network.target
Wants=network.target

[Service]
WorkingDirectory=/usr/local/bin/
ExecStart=/usr/local/bin/grobid-installation/grobid-service/bin/grobid-service server /usr/local/bin/grobid-installation/grobid-service/config/config.yaml
Restart=on-abort


[Install]
WantedBy=multi-user.target

apache2 proxy pass

  Redirect /r/grobid /r/grobid/
  ProxyPass /r/grobid/ http://localhost:8070/
  ProxyPassReverse /r/grobid/ http://localhost:8070/

References

  1. ^  Patrice Lopez. (2009) "GROBID: Combining Automatic Bibliographic Data Recognition and Term Extraction for Scholarship Publications" ECDL 2009: Research and Advanced Technology for Digital Libraries pp 473-474 . doi: 10.1007/978-3-642-04346-8_62url: https://core.ac.uk/download/pdf/38300913.pdfat: ECDL 2009