#CEURWS #OPENACCESS : Are we stuck with PDF forever? Is there a chance for semantically enriched papers?

Practically all papers published at CEUR-WS.org are published in the PDF format. This is now a quite old format, originally developed for printing. I still regard it as a good paper format, in particular when the PDF file is supporting navigation and easy lookup of references.

However, it is not a format that allows easy searching/querying of papers because it has no meaningful semantic annotation of its contents.

There are a few papers in CEUR-WS that were actually written in a semantically enriched HTML format. Will such a format become more popular for academic publishing? Or are we stuck with PDF for the foreseeable future?

I personally believe that papers should also be data, i.e. it should be possible to query the contents of a paper. There are initiatives like the open knowledge graph at TiB Hannover to represent the research questions, methods and results in a way to facilitate semantic queries across a large collection of papers. I find that very promising but apparently PDF does not support such machine-readable content at all.

Leave a Reply (only about CEUR-WS matters)

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: